Integrated Prediction and Validation of Protein Structures

Cheng, Jianlin; Tanner, John

Abstract

Knowledge of three-dimensional protein structure is indispensable in biomedical research. Protein structure and function are intimately linked, and thus structure facilitates drug discovery, aids investigations of protein-protein interactions, informs mutagenesis analysis, guides protein engineering and the design of new proteins, and provides a foundation for understanding the molecular basis of disease. However, the number of protein sequences available in the genomic era far exceeds the capacity of the main experimental structure determination techniques of X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy, resulting in a substantial sequence- structure gap. We address this ever-widening gap by developing and disseminating novel protein structure modeling tools. This renewal project is a new collaboration between experts in computational modeling (Cheng) and experimental structural biology (Tanner). We plan to develop innovative, integrated machine learning (e.g., deep learning), data mining and statistical modeling methods to address major challenges in both template-based structure modeling and template-free (ab initio) structure modeling. We will apply these tools to enzymes in the aldehyde dehydrogenase (ALDH) superfamily, a group of enzymes that are involved in numerous important biological processes and implicated in many diseases due to mutations. The ALDH models will be experimentally validated using X-ray crystallography and biochemical assays. Furthermore, we will combine the modeling power of our structural Input-Output hidden Markov model with experimental small- angle X-ray scattering (SAXS) to predict the tertiary structures of large multi-domain proteins. The integration of computational and experimental sciences in this project positions us uniquely in structure modeling space.

Public Health Relevance

Three-dimensional protein structure information is indispensable in modern biomedical research. However, gene sequencing technology has far exceeded the capacity of experimental protein structure determination methods, giving rise to an ever-widening sequence-structure gap. This project addresses the gap by developing new computational methods for predicting protein structure, validating these methods with experiments, and disseminating the methods freely through user-friendly tools and web services.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 5R01GM093123-06
Application #: 9119094
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Krepkiy, Dmitriy

Project Start: 2010-06-01
Project End: 2019-07-31
Budget Start: 2016-08-01
Budget End: 2017-07-31
Support Year: 6
Fiscal Year: 2016
Total Cost
Indirect Cost

Institution

Name: University of Missouri-Columbia
Department: Biostatistics & Other Math Sci
Type: Biomed Engr/Col Engr/Engr Sta
DUNS #: 153890272

City: Columbia
State: MO
Country: United States
Zip Code: 65211

Related projects


NIH 2020 R01 GM	Distance-based ab initio protein structure prediction Cheng, Jianlin / University of Missouri-Columbia
NIH 2018 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia
NIH 2017 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia
NIH 2016 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia
NIH 2015 R01 GM	Integrated Prediction and Validation of Protein Structures Cheng, Jianlin; Tanner, John J. / University of Missouri-Columbia	$326,428
NIH 2013 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$283,648
NIH 2012 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$294,112
NIH 2011 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$290,613
NIH 2010 R01 GM	Integrated Prediction of Protein Struture at 1D, 2D and 3D Levels Cheng, Jianlin / University of Missouri-Columbia	$293,715

Publications

Korasick, David A; White, Tommi A; Chakravarthy, Srinivas et al. (2018) NAD+ promotes assembly of the active tetramer of aldehyde dehydrogenase 7A1. FEBS Lett 592:3229-3238

Adhikari, Badri; Hou, Jie; Cheng, Jianlin (2018) DNCON2: improved protein contact prediction using two-level deep convolutional neural networks. Bioinformatics 34:1466-1472

Hou, Jie; Adhikari, Badri; Cheng, Jianlin (2018) DeepSF: deep convolutional neural network for mapping protein sequences to folds. Bioinformatics 34:1295-1303

Liu, Li-Kai; Tanner, John J (2018) Crystal Structure of Aldehyde Dehydrogenase 16 Reveals Trans-Hierarchical Structural Similarity and a New Dimer. J Mol Biol :

Adhikari, Badri; Cheng, Jianlin (2018) CONFOLD2: improved contact-driven ab initio protein structure modeling. BMC Bioinformatics 19:22

Korasick, David A; Kon?itíková, Radka; Kope?ná, Martina et al. (2018) Structural and Biochemical Characterization of Aldehyde Dehydrogenase 12, the Last Enzyme of Proline Catabolism in Plants. J Mol Biol :

Adhikari, Badri; Hou, Jie; Cheng, Jianlin (2018) Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning. Proteins 86 Suppl 1:84-96

Keasar, Chen; McGuffin, Liam J; Wallner, Björn et al. (2018) An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12. Sci Rep 8:9939

Cao, Renzhi; Adhikari, Badri; Bhattacharya, Debswapna et al. (2017) QAcon: single model quality assessment using protein structural and contact information with machine learning techniques. Bioinformatics 33:586-588

Adhikari, Badri; Bhattacharya, Debswapna; Cao, Renzhi et al. (2017) Assessing Predicted Contacts for Building Protein Three-Dimensional Models. Methods Mol Biol 1484:115-126

Showing the most recent 10 out of 77 publications

Comments

Be the first to comment on Jianlin Cheng's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: