The project will provide new technology to accelerate the discovery of protein structures and assemblies from humans and other species. Knowing the 3D structures of proteins has important biomedical implications for the development of protein-based therapies and targeted therapeutic drugs, but the 3D structures of proteins of thousands of important protein types remain unsolved. The project aims to close this gap, based on two recent advances: (1) the rapid development of new DNA sequencing technologies and (2) a recent breakthrough in protein 3D structure prediction using statistical physics and bio-molecular computation. The new structure prediction method, developed by the applicant team, extracts evolutionary residue-residue couplings from multiple sequence alignments, using a maximum entropy method. The team will use the evolutionary couplings as distance constraints to predict the structure of many single domains, of multidomain proteins and of protein complexes, and to map functional sites on known and predicted structures, with potentially broad impact on diverse biological research areas. The team will also aim to aid the development of hybrid computational- experimental technologies for structure determination. For X-ray crystallography, the aim is bridge the gap between the predicted 3D structures and the basin of convergence for molecular replacement, allowing structure determination from a single native data set without the need for anomalous or derivative diffraction datasets. For NMR, the aim is to add evolutionary couplings to NMR-derived backbone and residue-residue distance information and thus reduce experimental effort and/or permit the solution of larger structures. The project is a close collaboration between the Computational Biology Program at Memorial Sloan-Kettering Cancer Center and the Department of Systems Biology at Harvard Medical School. Experimental collaborations with PSI:Biology centers and the international structural genomics effort will aim to implement a more efficient technology for the determination of biomedically relevant protein structures.
Knowing the 3D structures of proteins has important biomedical implications for the development of protein-based therapies and targeted therapeutic drugs. Currently, the 3D structures of proteins of thousands of important protein types remain unsolved. The project will provide new technology, by combining advances in genomic sequencing with statistical physics and biomolecular computation, to accelerate the discovery of protein structures from humans, as well as pathogenic microorganisms, such as bacteria, viruses and fungi.
|Grad, Yonatan H; Harris, Simon R; Kirkcaldy, Robert D et al. (2016) Genomic Epidemiology of Gonococcal Resistance to Extended-Spectrum Cephalosporins, Macrolides, and Fluoroquinolones in the United States, 2000-2013. J Infect Dis 214:1579-1587|
|Weinreb, Caleb; Riesselman, Adam J; Ingraham, John B et al. (2016) 3D RNA and Functional Interactions from Evolutionary Couplings. Cell 165:963-75|
|Nicoludis, John M; Vogt, Bennett E; Green, Anna G et al. (2016) Antiparallel protocadherin homodimers use distinct affinity- and specificity-mediating regions in cadherin repeats 1-4. Elife 5:|
|Toth-Petroczy, Agnes; Palmedo, Perry; Ingraham, John et al. (2016) Structured States of Disordered Proteins from Genomic Sequences. Cell 167:158-170.e12|
|Hayat, Sikander; Sander, Chris; Marks, Debora S et al. (2015) All-atom 3D structure prediction of transmembrane Î²-barrel proteins from sequences. Proc Natl Acad Sci U S A 112:5413-8|
|Hopf, Thomas A; Morinaga, Satoshi; Ihara, Sayoko et al. (2015) Amino acid coevolution reveals three-dimensional structure and functional domains of insect odorant receptors. Nat Commun 6:6077|
|Tang, Yuefeng; Huang, Yuanpeng Janet; Hopf, Thomas A et al. (2015) Protein structure determination by combining sparse NMR data with evolutionary couplings. Nat Methods 12:751-4|
|Stein, Richard R; Marks, Debora S; Sander, Chris (2015) Inferring Pairwise Interactions from Biological Data Using Maximum-Entropy Probability Models. PLoS Comput Biol 11:e1004182|
|KajÃ¡n, LÃ¡szlÃ³; Hopf, Thomas A; KalaÅ¡, MatÃºÅ¡ et al. (2014) FreeContact: fast and free software for protein contact prediction from residue co-evolution. BMC Bioinformatics 15:85|
|Michel, Mirco; Hayat, Sikander; Skwark, Marcin J et al. (2014) PconsFold: improved contact predictions improve protein models. Bioinformatics 30:i482-8|
Showing the most recent 10 out of 12 publications