Annotating functional sites in 3D biological structures

Altman, Russ

Abstract

High-throughput data collection methods have revolutionized many areas of biology and medicine. The National Library of Medicine has targeted the representation, management, and manipulation of biological structure as a key element of its mission. Following upon the success of genome sequencing and functional genomics projects, the structural biology community is creating technologies to streamline the process of determining three-dimensional biological structures--with efforts in structural genomics. Like other high-throughput efforts, a major challenge for these efforts is the appropriate annotation and indexing of structures for retrieval and analysis by biologists who are trying to understand molecular function at an atomic detail: Where are the important functional sites, and how confident are we in their location? In this proposal, we plan to develop and apply methods for annotating biological structures, so that active sites, binding sites and interaction sites in biological structures can be automatically identified and annotated. Our novel computational representation of functional sites has been successful in characterizing these sites, and recognizing them based on their biochemical and biophysical signature--a 3D motif. We propose to improve the performance of our method with basic research in the representations and algorithms used for our site models. Because our site models are manually created, our library of available models has grown slowly. We therefore further propose to accelerate the growth of our model library using a combination of supervised and unsupervised machine learning methods. First, we will use known 1D sequence motifs as """"""""seeds"""""""" to create corresponding 3D motifs. Second, we will develop techniques for discovering entirely new motifs using cluster techniques. We will evaluate our models and resulting predictions through analysis of known structural sites, follow-up and dissemination of predictions with the structural genomics community, and large-scale evaluation on decoy and predicted structures. We will make the resulting models available on the Web for real-time structural annotation, and will distribute the software for open source development.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Research Project (R01)
Project #: 2R01LM005652-09
Application #: 6825272
Study Section: Special Emphasis Panel (ZLM1-HS-A (M3))
Program Officer: Ye, Jane

Project Start: 1994-07-01
Project End: 2009-08-31
Budget Start: 2004-09-01
Budget End: 2005-08-31
Support Year: 9
Fiscal Year: 2004
Total Cost: $396,367
Indirect Cost

Institution

Name: Stanford University
Department: Genetics
Type: Schools of Medicine
DUNS #: 009214214

City: Stanford
State: CA
Country: United States
Zip Code: 94305

Related projects

Publications

Zhou, Weizhuang; Altman, Russ B (2018) Data-driven human transcriptomic modules determined by independent component analysis. BMC Bioinformatics 19:327

Lo, Yu-Chen; Rensi, Stefano E; Torng, Wen et al. (2018) Machine learning in chemoinformatics and drug discovery. Drug Discov Today 23:1538-1546

Previde, Paul; Thomas, Brook; Wong, Mike et al. (2018) GeneDive: A gene interaction search and visualization tool to facilitate precision medicine. Pac Symp Biocomput 23:590-601

Liu, Tianyun; Ish-Shalom, Shirbi; Torng, Wen et al. (2018) Biological and functional relevance of CASP predictions. Proteins 86 Suppl 1:374-386

Percha, Bethany; Altman, Russ B (2018) A global network of biomedical relationships derived from text. Bioinformatics 34:2614-2624

Lavertu, Adam; McInnes, Greg; Daneshjou, Roxana et al. (2018) Pharmacogenomics and big genomic data: from lab to clinic and back again. Hum Mol Genet 27:R72-R78

Petkovic, Dragutin; Altman, Russ; Wong, Mike et al. (2018) Improving the explainability of Random Forest classifier - user centered approach. Pac Symp Biocomput 23:204-215

Mallory, Emily K; Acharya, Ambika; Rensi, Stefano E et al. (2018) Chemical reaction vector embeddings: towards predicting drug metabolism in the human gut microbiome. Pac Symp Biocomput 23:56-67

Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne et al. (2017) Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans. Genome Med 9:98

Torng, Wen; Altman, Russ B (2017) 3D deep convolutional neural networks for amino acid environment similarity analysis. BMC Bioinformatics 18:302

Showing the most recent 10 out of 64 publications

Comments

Be the first to comment on Russ Altman's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: