Virtual screening is the most practical method to leverage ligand and protein structures for lead discovery. Unfortunately, both ligand-based and docking techniques are inaccessible to most investigators. A key result from the first period, the ZINC database, has lowered the barrier to entry for docking through public access 3D screening libraries. The Similarity Ensemble Approach (SEA), also developed in the first period, has shown early promise for ligand-based target identification. Still, virtual screening remains difficult to use for most investigators. To lower these barriers still further we will develop databases and automated tools for use by the general community, and investigate their usefulness in proof-of-concept studies.
The specific aims are: 1. To develop databases that derive from and enable virtual screening. A. We will develop a database of pre-calculated docking hits that can simply be looked up and purchased for about 1,000 protein targets. This will rely on automated tools for docking, hit evaluation, and comparisons among targets (aim 2). We will also improve databases for virtual screening developed in the first period. These include: B. Expanding ZINC, adding more commercially available compounds and improving the structures represented in it. C. Improving the robustness of DUD, a general benchmarking set for virtual screening. D. Expanding the database of high energy intermediates (HEI) developed in the first period for protein function prediction. 2. To create simple web-based tools for ligand-based and protein-based virtual screening. We will develop and refine two web-based tools to enable non-specialists to discover ligands for their targets. A. For structure-based docking, a simple-looking web-interface to docking that guides the user, selects parameters, calibrates the model, and manages the calculation on our cluster. We will develop automated tools to evaluate the reliability of docking results. B. The second virtual screening tool is ligand based, for use when the structure of the target is unknown but many ligands are available, or when one wants to explore alternate targets for a known drug or reagent. We further develop a novel cheminformatic method SEA introduced in the last period to predict target relationships and off-target effects. This approach has had precocious success in identifying interesting polypharmacology, and we will also use it ourselves to predict- and-test off-target, clinically relevant effects of 50 to 100 FDA drugs, and identify the targets of the ~10% of FDA drugs for which a target is unknown.

Public Health Relevance

Virtual screening is widely used to discover new molecular leads for drug discovery and reagents to understand biological processes. Unfortunately, the technique remains difficult to use, and has thus been restricted to a few expert laboratories, limiting its usefulness. In this proposal, we create databases and tools to bring virtual screening to a wide biological audience, much expanding its impact and usefulness.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Preusch, Peter C
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Francisco
Schools of Pharmacy
San Francisco
United States
Zip Code
Irwin, John J; Gaskins, Garrett; Sterling, Teague et al. (2018) Predicted Biological Activity of Purchasable Chemical Space. J Chem Inf Model 58:148-164
Maciejewski, Mateusz; Lounkine, Eugen; Whitebread, Steven et al. (2017) Reverse translation of adverse event reports paves the way for de-risking preclinical off-targets. Elife 6:
O'Meara, Matthew J; Ballouz, Sara; Shoichet, Brian K et al. (2016) Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction. PLoS One 11:e0160098
Farrell, Martilias S; McCorvy, John D; Huang, Xi-Ping et al. (2016) In Vitro and In Vivo Characterization of the Alkaloid Nuciferine. PLoS One 11:e0150602
Irwin, John J; Shoichet, Brian K (2016) Docking Screens for Novel Ligands Conferring New Biology. J Med Chem 59:4103-20
Sterling, Teague; Irwin, John J (2015) ZINC 15--Ligand Discovery for Everyone. J Chem Inf Model 55:2324-37
Barelier, Sarah; Sterling, Teague; O'Meara, Matthew J et al. (2015) The Recognition of Identical Ligands by Unrelated Proteins. ACS Chem Biol 10:2772-84
Huang, Xi-Ping; Karpiak, Joel; Kroeze, Wesley K et al. (2015) Allosteric ligands for the pharmacologically dark receptors GPR68 and GPR65. Nature 527:477-83
Yee, Sook Wah; Lin, Lawrence; Merski, Matthew et al. (2015) Prediction and validation of enzyme and transporter off-targets for metformin. J Pharmacokinet Pharmacodyn 42:463-75
Pimentel-Elardo, Sheila M; Sørensen, Dan; Ho, Louis et al. (2015) Activity-Independent Discovery of Secondary Metabolites Using Chemical Elicitation and Cheminformatic Inference. ACS Chem Biol 10:2616-23

Showing the most recent 10 out of 43 publications