This work aims to develop methods to identify functional sites in protein structures and to characterize protein function on a genomic scale. The approach is predicated on the Evolutionary Trace method (ET) to locate functional sites in structures. Preliminary studies enabled us to automate the basic steps towards a complete, automated functional annotation pipeline, namely, functional site analysis with ET;extraction from ET analysis of SD-templates that describe composition and conformation of key residues involved in binding or catalytic function;the search in other structures for geometric matches to these 3D-templates;and the analysis of which of those matched are most biologically relevant. We now seek to increase the sensitivity and specificity of the annotation pipeline by optimizing the definition of 3-D templates, by adding new template features to better judge whether molecular mimicry underlies functional similarity;and by developing novel strategies that use multiple templates to identify function. The result will reveal which regions of proteins are most biologically relevant, and hence logical targets for protein engineering and drug design, and it will extend to three dimensions a functional annotation strategy traditionally based on one- dimensional pattern matching in protein sequences. In so doing, this work addresses a fundamental NIH roadmap problem in """"""""post-genomic biology"""""""": linking massive and exponentially growing amounts of raw sequence and structure data to the molecular basis of biological function.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM079656-04
Application #
7786185
Study Section
Special Emphasis Panel (ZRG1-BCMB-Q (90))
Program Officer
Wehrle, Janna P
Project Start
2007-04-01
Project End
2011-09-29
Budget Start
2010-04-01
Budget End
2011-09-29
Support Year
4
Fiscal Year
2010
Total Cost
$278,531
Indirect Cost
Name
Baylor College of Medicine
Department
Genetics
Type
Schools of Medicine
DUNS #
051113330
City
Houston
State
TX
Country
United States
Zip Code
77030
Chun, Yun Shin; Passot, Guillaume; Yamashita, Suguru et al. (2017) Deleterious Effect of RAS and Evolutionary High-risk TP53 Double Mutation in Colorectal Liver Metastases. Ann Surg :
Gallion, Jonathan; Koire, Amanda; Katsonis, Panagiotis et al. (2017) Predicting phenotype from genotype: Improving accuracy through more robust experimental and computational modeling. Hum Mutat 38:569-580
Wilson, Stephen J; Wilkins, Angela D; Lin, Chih-Hsu et al. (2017) DISCOVERY OF FUNCTIONAL AND DISEASE PATHWAYS BY COMMUNITY DETECTION IN PROTEIN-PROTEIN INTERACTION NETWORKS. Pac Symp Biocomput 22:336-347
Koire, Amanda; Kim, Young Won; Wang, Jarey et al. (2017) Codon-level co-occurrences of germline variants and somatic mutations in cancer are rare but often lead to incorrect variant annotation and underestimated impact prediction. PLoS One 12:e0174766
Katsonis, Panagiotis; Lichtarge, Olivier (2017) Objective assessment of the evolutionary action equation for the fitness effect of missense mutations across CAGI-blinded contests. Hum Mutat 38:1072-1084
Xu, Qifang; Tang, Qingling; Katsonis, Panagiotis et al. (2017) Benchmarking predictions of allostery in liver pyruvate kinase in CAGI4. Hum Mutat 38:1123-1131
Schönegge, Anne-Marie; Gallion, Jonathan; Picard, Louis-Philippe et al. (2017) Evolutionary action and structural basis of the allosteric switch controlling ?2AR functional selectivity. Nat Commun 8:2169
Gallion, Jonathan; Wilkins, Angela D; Lichtarge, Olivier (2017) HUMAN KINASES DISPLAY MUTATIONAL HOTSPOTS AT COGNATE POSITIONS WITHIN CANCER. Pac Symp Biocomput 22:414-425
Cancer Genome Atlas Research Network. Electronic address: wheeler@bcm.edu; Cancer Genome Atlas Research Network (2017) Comprehensive and Integrative Genomic Characterization of Hepatocellular Carcinoma. Cell 169:1327-1341.e23
Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M et al. (2016) UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures. Nucleic Acids Res 44:D308-12

Showing the most recent 10 out of 60 publications