Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels

Cheng, Jianlin

Abstract

Computational prediction of protein structure from the amino acid sequence is one of the most important and challenging problems in bioinformatics and computational biology. With the exponential growth of protein sequences without solved protein structures in the post-genomic era, accurate protein structure prediction methods and tools are in urgent need. Here, we propose to develop an integrated approach to advance protein structure prediction at the 1-dimensional (1D), 2-dimensional (2D) and 3-dimensional (3D) levels. At the 1D level, novel information such as domain evolution signals, alternative gene splicing sites, and 2D protein contact map will be used to predict protein domain boundaries from the sequences. At the 2D level, new methods such as residue contact propagation, machine learning boosting, linear programming, and Markov Chain Monte Carlo simulations will be used to advance residue-residue contact prediction for a domain, or a protein. At the 3D level, 2D contact prediction, fold recognition via machine learning, and multi-template combination will be used to enhance both template-based and ab initio structure prediction. Finally, knowledge-based statistical machine learning methods and model combination algorithms will be developed to reliably evaluate and refine the quality of predicted protein structural models. One of several innovative aspects of this approach is to integrate 1D, 2D, and 3D predictions in order to improve each other through protein structural unit - domains. The 1D, 2D, and 3D protein structure prediction methods will be implemented as user-friendly software packages and web services released to the scientific community. These tools and web services will be useful for protein structure prediction, structure determination, functional analysis, protein engineering, protein mutagenesis analysis, and protein design.

Public Health Relevance

The project will develop accurate computational methods and tools for basic biomedical research such as protein structure prediction, protein function analysis, protein design, protein engineering, and structure-based drug design.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 1R01GM093123-01
Application #: 7863766
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Brazhnik, Paul

Project Start: 2010-06-01
Project End: 2014-05-31
Budget Start: 2010-06-01
Budget End: 2011-05-31
Support Year: 1
Fiscal Year: 2010
Total Cost: $293,715
Indirect Cost

Institution

Name: University of Missouri-Columbia
Department: Biostatistics & Other Math Sci
Type: Schools of Engineering
DUNS #: 153890272

City: Columbia
State: MO
Country: United States
Zip Code: 65211

Related projects


NIH 2020 R01 GM	Distance-based ab initio protein structure prediction Cheng, Jianlin / University of Missouri-Columbia
NIH 2018 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia
NIH 2017 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia
NIH 2016 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia
NIH 2015 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia	$326,428
NIH 2013 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$283,648
NIH 2012 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$294,112
NIH 2011 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$290,613
NIH 2010 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$293,715

Publications

Keasar, Chen; McGuffin, Liam J; Wallner, Björn et al. (2018) An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12. Sci Rep 8:9939

Korasick, David A; White, Tommi A; Chakravarthy, Srinivas et al. (2018) NAD+ promotes assembly of the active tetramer of aldehyde dehydrogenase 7A1. FEBS Lett 592:3229-3238

Adhikari, Badri; Hou, Jie; Cheng, Jianlin (2018) DNCON2: improved protein contact prediction using two-level deep convolutional neural networks. Bioinformatics 34:1466-1472

Hou, Jie; Adhikari, Badri; Cheng, Jianlin (2018) DeepSF: deep convolutional neural network for mapping protein sequences to folds. Bioinformatics 34:1295-1303

Liu, Li-Kai; Tanner, John J (2018) Crystal Structure of Aldehyde Dehydrogenase 16 Reveals Trans-Hierarchical Structural Similarity and a New Dimer. J Mol Biol :

Adhikari, Badri; Cheng, Jianlin (2018) CONFOLD2: improved contact-driven ab initio protein structure modeling. BMC Bioinformatics 19:22

Korasick, David A; Kon?itíková, Radka; Kope?ná, Martina et al. (2018) Structural and Biochemical Characterization of Aldehyde Dehydrogenase 12, the Last Enzyme of Proline Catabolism in Plants. J Mol Biol :

Adhikari, Badri; Hou, Jie; Cheng, Jianlin (2018) Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning. Proteins 86 Suppl 1:84-96

Cao, Renzhi; Adhikari, Badri; Bhattacharya, Debswapna et al. (2017) QAcon: single model quality assessment using protein structural and contact information with machine learning techniques. Bioinformatics 33:586-588

Adhikari, Badri; Bhattacharya, Debswapna; Cao, Renzhi et al. (2017) Assessing Predicted Contacts for Building Protein Three-Dimensional Models. Methods Mol Biol 1484:115-126

Showing the most recent 10 out of 77 publications

Comments

Be the first to comment on Jianlin Cheng's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: