Decoding the information in the primary sequence of a protein is one of the most fundamental challenges in modern biology. A protein's sequence encodes more than just the native structure; it encodes the entire energy landscape - an ensemble of conformations whose energetics and dynamics are finely tuned. The goal of this proposal is a molecular, quantitative, and predictive understanding of the relationship between sequence and the energy landscape together with an understanding of how the environment modulates this landscape. A major hurdle in going from sequence to function is our lack of understanding of the non-native or high- energy regions of the landscape and how they are modulated by the environment. High-energy conformations are important for directing the stability and folding of a protein, and modulations of this ensemble play a role in misfolding, protein signaling, catalytic activity, and allostery. While, many sequences can encode the same structure, their function and dynamics can vary dramatically - due to changes in the landscape. Small variations in a sequence can have effects that range from undetectable to pathological. Soon we will have access to thousands of human genomes, and without our ability to interpret variation, the potential of these data to impact medicine and human health will never be fully appreciated. It is imperative, therefore, that we have an understanding and control over the relationship between sequence and the energy landscape. Modulations of the energy landscape are not easily detected due to the small populations and transient nature of the high-energy species. The experiments outlined here are aimed at understanding how changes in the sequence and the environment affect the energy landscape.
Aim 1 : Quantitative measures of protein folding and stability in complex environments a. Develop a quantitative bench-top measure of protein stability on the ribosome and other complex mixtures. b. Measure conformational dynamics of ribosome-bound polypeptide chains c. Monitor translational coupled folding using HaloTag as a model system Aim 2: Probe the energy landscape through evolution and sequence modulation a. Use Ancestral Sequence Reconstruction (ASR) to explore changes in the landscape of RNase H over time. We will evaluate the energy landscapes of these resurrected proteins to determine how optimizations of the energy landscape, and thus function/fitness, occur over evolutionary time. b. Use Ancestral Sequence Reconstruction to evaluate the rate-limiting step by using the alpha-lytic protease family of both kinetically stable and thermodynamically stable proteases Aim 3: Probe the energy landscape through single molecule mechanical unfolding a. Single molecule mechanical studies to probe the energy barriers in protein folding

Public Health Relevance

Decoding the information in a protein's primary sequence is one of the most fundamental challenges in modern biology; this sequence of amino acids specifies the energy landscape, which describes all of a protein' conformations and dynamics and is critical for function. Without our ability to interpret the effects of sequence variation onthe energy landscape, the impact of the explosion in human genome sequences will never be realized. The goal of this proposal is a molecular, quantitative, and predictive understanding of the relationship between sequence and the energy landscape, together with a predictive understanding of how the environment modulates this landscape.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM050945-23
Application #
9152809
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Wehrle, Janna P
Project Start
1994-05-01
Project End
2019-08-31
Budget Start
2016-09-01
Budget End
2017-08-31
Support Year
23
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of California Berkeley
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
124726725
City
Berkeley
State
CA
Country
United States
Zip Code
94704
Guinn, Emily J; Tian, Pengfei; Shin, Mia et al. (2018) A small single-domain protein folds through the same pathway on and off the ribosome. Proc Natl Acad Sci U S A 115:12206-12211
Guinn, Emily J; Marqusee, Susan (2018) Exploring the Denatured State Ensemble by Single-Molecule Chemo-Mechanical Unfolding: The Effect of Force, Temperature, and Urea. J Mol Biol 430:450-464
Lim, Shion An; Bolin, Eric Richard; Marqusee, Susan (2018) Tracing a protein's folding pathway over evolutionary time using ancestral sequence reconstruction and hydrogen exchange. Elife 7:
Lim, Shion A; Marqusee, Susan (2018) The burst-phase folding intermediate of ribonuclease H changes conformation over evolutionary history. Biopolymers 109:e23086
Samelson, Avi J; Bolin, Eric; Costello, Shawn M et al. (2018) Kinetic and structural comparison of a protein's cotranslational folding and refolding pathways. Sci Adv 4:eaas9098
Zhang, Yongli; Ha, Taekjip; Marqusee, Susan (2018) Editorial Overview: Single-Molecule Approaches up to Difficult Challenges in Folding and Dynamics. J Mol Biol 430:405-408
Hamadani, Kambiz M; Howe, Jesse; Jensen, Madeleine K et al. (2017) An in vitro tag-and-modify protein sample generation method for single-molecule fluorescence resonance energy transfer. J Biol Chem 292:15636-15648
Samelson, Avi J; Jensen, Madeleine K; Soto, Randy A et al. (2016) Quantitative determination of ribosome nascent chain stability. Proc Natl Acad Sci U S A 113:13402-13407
Wheeler, Lucas C; Lim, Shion A; Marqusee, Susan et al. (2016) The thermostability and specificity of ancient proteins. Curr Opin Struct Biol 38:37-43
Lim, Shion A; Hart, Kathryn M; Harms, Michael J et al. (2016) Evolutionary trend toward kinetic stability in the folding trajectory of RNases H. Proc Natl Acad Sci U S A 113:13045-13050

Showing the most recent 10 out of 39 publications