While automation is revolutionizing many aspects of biology, the determination of three-dimensional protein structure remains a long, hard, and expensive task. High-throughput structural genomics is required in order to apply modem techniques such as structure-based drug design on a much larger scale. Traditional (semi-) automated approaches to protein structure determination through nuclear magnetic resonance (NMR) spectroscopy require dozens of experiments and months of spectrometer time, making them unsuitable for high-throughput automation. One of the main bottlenecks in the determination of three-dimensional protein structures by NMR is the assignment of chemical shifts to atoms in a biopolymer. Therefore, high-throughput structure determination using NMR requires a systematic attack on the assignment problem. Novel algorithmic techniques are proposed for automated assignment and protein structure determination from sparse, unassigned NMR data, based on an approach called Jigsaw. The proposed research aims to minimize the number and types of NMR experiments that must be performed and the amount of human effort required to interpret the experimental results, while still producing an accurate analysis of the protein structure. To enable high-throughput data collection, the proposed methods utilize only a few fast, cheap NMIR experiments. The research will build on Jigsaw to develop a minimalist approach, demonstrating the large amount of information available in a few key spectra, and how it can be extracted using a combination of combinatorial and geometric algorithms. New algorithms and computer systems will be developed for determining protein structure from only four NMR spectra. The system will use algorithms similar to and adapted from physical geometric algorithms, pattern recognition and machine vision, signal processing, and robotics, in order to analyze spectra, assign spectral peaks to atom interactions, compute secondary structure, and estimate the global fold. Jigsaw will be extended to work on larger proteins, and tested on experimental NMR data. A novel probabilistic framework will be implemented to handle the increased spectral complexity and sparser information content obtained both for larger proteins, and in high-throughput NMR protocols.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM065982-05
Application #
7089793
Study Section
Molecular and Cellular Biophysics Study Section (BBCA)
Program Officer
Edmonds, Charles G
Project Start
2002-07-01
Project End
2006-09-15
Budget Start
2006-07-01
Budget End
2006-09-15
Support Year
5
Fiscal Year
2006
Total Cost
$76,329
Indirect Cost
Name
Dartmouth College
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
041027822
City
Hanover
State
NH
Country
United States
Zip Code
03755
Martin, Jeffrey W; Zhou, Pei; Donald, Bruce R (2015) Systematic solution to homo-oligomeric structures determined by NMR. Proteins 83:651-61
Chandola, Himanshu; Williamson, Tim E; Craig, Bruce A et al. (2014) Stoichiometries and affinities of interacting proteins from concentration series of solution scattering data: decomposition by least squares and quadratic optimization. J Appl Crystallogr 47:899-914
Reardon, Patrick N; Sage, Harvey; Dennison, S Moses et al. (2014) Structure of an HIV-1-neutralizing antibody target, the lipid-bound gp41 envelope membrane proximal region trimer. Proc Natl Acad Sci U S A 111:1391-6
Zeng, Jianyang; Zhou, Pei; Donald, Bruce Randall (2013) HASH: a program to accurately predict protein H? shifts from neighboring backbone shifts. J Biomol NMR 55:105-18
Gainza, Pablo; Roberts, Kyle E; Georgiev, Ivelin et al. (2013) OSPREY: protein design with ensembles, flexibility, and provable algorithms. Methods Enzymol 523:87-107
Donald, Bruce R; Levey, Christopher G; Paprotny, Igor et al. (2013) Planning and Control for Microassembly of Structures Composed of Stress-Engineered MEMS Microrobots. Int J Rob Res 32:218-246
Tripathy, Chittaranjan; Zeng, Jianyang; Zhou, Pei et al. (2012) Protein loop closure using orientational restraints from NMR data. Proteins 80:433-53
Chandola, Himanshu; Yan, Anthony K; Potluri, Shobha et al. (2011) NMR structural inference of symmetric homo-oligomers. J Comput Biol 18:1757-75
Martin, Jeffrey W; Yan, Anthony K; Bailey-Kellogg, Chris et al. (2011) A graphical method for analyzing distance restraints using residual dipolar couplings for structure determination of symmetric protein homo-oligomers. Protein Sci 20:970-85
Zeng, Jianyang; Roberts, Kyle E; Zhou, Pei et al. (2011) A Bayesian approach for determining protein side-chain rotamer conformations using unassigned NOE data. J Comput Biol 18:1661-79

Showing the most recent 10 out of 45 publications