Recent years'CASP experiments have seen significant progress on computer-based protein tertiary structure prediction. The state-of-the-art algorithms, such as TASSER, can generate correct folds for 2/3 of single domain proteins without using homologous templates. Most of the models are of low resolution with a root mean square deviation (RMSD) 3-6 Angstroms, which may be useful for protein topology analysis and some level of biological function inference. However, such models are not suitable for reliable ligand screening. The proposed research seeks to develop new methodologies for the atomic-level high-resolution protein structure modeling.
The specific aims are: (1) development of a new algorithm, atom-TASSER, for atomic-level protein structure modeling;(2) development of a two-level composite potential for both the low-resolution topology construction and the high-resolution refinement;(3) force field optimization in both structure and sequence phase spaces;(4) large-scale benchmarking of atom-TASSER algorithm. Overall, these studies are designed to extend the range of validity of the reduced modeling approaches, and provide significant improvements in the state-of-the-art of tertiary structure prediction. The goal is to push up the level of accuracy of computer-based modeling, especially for the distant/weakly homologous proteins, so that they can be of real use to new drug screening and biochemical functional inference.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Macromolecular Structure and Function D Study Section (MSFD)
Program Officer
Wehrle, Janna P
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Michigan Ann Arbor
Biostatistics & Other Math Sci
Schools of Medicine
Ann Arbor
United States
Zip Code
Quan, Shu; Wang, Lili; Petrotchenko, Evgeniy V et al. (2014) Super Spy variants implicate flexibility in chaperone action. Elife 3:e01584
Szilagyi, Andras; Zhang, Yang (2014) Template-based structure modeling of protein-protein interactions. Curr Opin Struct Biol 24:10-23
Zhang, Yang (2014) Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10. Proteins 82 Suppl 2:175-87
Yang, Jianyi; Roy, Ambrish; Zhang, Yang (2013) BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions. Nucleic Acids Res 41:D1096-103
Xue, Zhidong; Xu, Dong; Wang, Yan et al. (2013) ThreaDom: extracting protein domain boundary information from multiple threading alignments. Bioinformatics 29:i247-56
Mitra, Pralay; Shultis, David; Zhang, Yang (2013) EvoDesign: De novo protein design based on structural and evolutionary profiles. Nucleic Acids Res 41:W273-80
Xu, Dong; Zhang, Yang (2013) Toward optimal fragment generations for ab initio protein structure assembly. Proteins 81:229-39
Xu, Dong; Li, Hua; Zhang, Yang (2013) Protein depth calculation and the use for improving accuracy of protein fold recognition. J Comput Biol 20:805-16
Fan, Yong-Xian; Zhang, Yang; Shen, Hong-Bin (2013) LabCaS: labeling calpain substrate cleavage sites from amino acid sequence using conditional random fields. Proteins 81:622-34
Mitra, Pralay; Shultis, David; Brender, Jeffrey R et al. (2013) An evolution-based approach to De Novo protein design and case study on Mycobacterium tuberculosis. PLoS Comput Biol 9:e1003298

Showing the most recent 10 out of 51 publications