Synthetic protein families powerfully complement natural genetic diversity in efforts to understand fundamental sequence-structure-function relationships. We propose to use a unique family of chimeric cytochrome P450s made by structure-guided recombination to make advances in modeling P450 structure, stability and specificity. Differing from one another by many dozens of amino acids yet still likely to fold into highly similar structures, these diverse sequences have different stabilities and exhibit an array of activities and specificities. We seek to maximize the number of accurate structure, stability, and specificity predictions given a set number of experimental measurements that are relatively easily performed on these laboratory-generated sequences. We propose to use this unique library of P450 chimeras to study enzyme structural modularity and exploit that modularity to simplify structure-function predictions. We will determine the X-ray crystal structures for several chimeric P450 heme domains. The crystal structures will assist our concurrent efforts to computationally predict accurate chimera structures. We will test new algorithms that recombine optimized enzyme substructures to sample relevant protein backbone conformations. Our goal is to bridge the gap between low resolution homology models and accurate high resolution structures. The chimeric enzymes are also a rich resource for investigating sequence-structure-stability relationships. The chimeric proteins have diverse stabilities;many are more stable than their parents. We plan to measure stability for selected chimeras and construct and test additive models for protein stability and quantify non- additive coupling effects. Recombination creates sequence intermediates between parent proteins and allows us to reconcile the effects of swapping blocks of sequence with the action of individual mutations. This work will contribute key data and insight into the effects of long-range interactions and coupling. Finally, we plan to explore the functional diversity of the chimeric cytochrome P450s, in the context of their applications in drug discovery and drug metabolism. These soluble, well-expressed chimeric P450s are a viable platform for production of authentic human metabolites of drugs and for lead diversification. Preliminary results indicate that the diverse sequences exhibit activity on a wide range of substrates. We will perform a systematic analysis of the activities of a set of chimeric P450s towards pharmaceutically relevant scaffolds, using clustering and regression to determine whether activity and specificity can be predicted from data on selected chimeras. We will attempt to rationalize the P450 specificity results in terms of structural P450- substrate complex models. This research continues an effective interdisciplinary collaboration that partners powerful experimental and theoretical capabilities.

Public Health Relevance

We propose to demonstrate how a laboratory-generated family of cytochrome P450s can be used to study the basis of P450 structure, stability, and function. We will test a new strategy for predicting the interactions between drugs and P450 enzymes, a potential breakthrough that would allow us to predict drug metabolism and interactions and to efficiently produce drug metabolites for toxicity studies and drug discovery. Finally, this research represents a fundamentally new approach to study protein stability using recombination, with results that can be applied to understanding enzymes in general.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
2R01GM068664-05A1
Application #
7528039
Study Section
Macromolecular Structure and Function A Study Section (MSFA)
Program Officer
Wehrle, Janna P
Project Start
2003-07-15
Project End
2011-06-30
Budget Start
2009-07-16
Budget End
2010-06-30
Support Year
5
Fiscal Year
2009
Total Cost
$323,241
Indirect Cost
Name
California Institute of Technology
Department
Chemistry
Type
Schools of Engineering
DUNS #
009584210
City
Pasadena
State
CA
Country
United States
Zip Code
91125
Romero, Philip A; Shapiro, Mikhail G; Arnold, Frances H et al. (2013) Directed evolution of protein-based neurotransmitter sensors for MRI. Methods Mol Biol 995:193-205
Heinzelman, Pete; Romero, Philip A; Arnold, Frances H (2013) Efficient sampling of SCHEMA chimera families to identify useful sequence elements. Methods Enzymol 523:351-68
Romero, Philip A; Arnold, Frances H (2012) Random field model reveals structure of the protein recombinational landscape. PLoS Comput Biol 8:e1002713
Romero, Philip A; Stone, Everett; Lamb, Candice et al. (2012) SCHEMA-designed variants of human Arginase I and II reveal sequence elements important to stability and catalysis. ACS Synth Biol 1:221-8
Jung, Sang Taek; Lauchli, Ryan; Arnold, Frances H (2011) Cytochrome P450: taming a wild type enzyme. Curr Opin Biotechnol 22:809-17
Lewis, Jared C; Coelho, Pedro S; Arnold, Frances H (2011) Enzymatic functionalization of carbon-hydrogen bonds. Chem Soc Rev 40:2003-21
Lewis, Jared C; Mantovani, Simone M; Fu, Yu et al. (2010) Combinatorial alanine substitution enables rapid optimization of cytochrome P450BM3 for selective hydroxylation of large substrates. Chembiochem 11:2502-5
Jackel, Christian; Bloom, Jesse D; Kast, Peter et al. (2010) Consensus protein design without phylogenetic bias. J Mol Biol 399:541-6
Acar, Murat; Pando, Bernardo F; Arnold, Frances H et al. (2010) A general mechanism for network-dosage compensation in gene circuits. Science 329:1656-60
Shapiro, Mikhail G; Westmeyer, Gil G; Romero, Philip A et al. (2010) Directed evolution of a magnetic resonance imaging contrast agent for noninvasive imaging of dopamine. Nat Biotechnol 28:264-70

Showing the most recent 10 out of 24 publications