The goal of this proposal is a molecular level description of protein unfolding, and by extension folding, using realistic molecular dynamics (MD) simulations in solution. Both the general and sequence-specific rules of unfolding will be pursued. The general rules will be investigated by making use of a large database of protein unfolding trajectories that already exist in the lab. In addition, new trajectories will be added. So far, this database contains nearly 11,000 simulations of more than 2200 protein and peptide systems. This repository represents the largest collection of protein simulations and protein structures in the world. The simulations were designed so that representatives of all proteins folds will eventually be investigated, working from the most to least populated folds. The current set represents over 80% of all known protein structures. We have already developed a novel relational/multidimensional database to house these data.
Specific Aim 1 of this proposal seeks to determine the general rules of protein unfolding by mining this database. In addition, multiple representatives of highly populated folds are being investigated to determine sequence-specific effects in Specific Aim 2. Our hypothesis is that all-atom molecular dynamics simulations of isolated proteins in solution can provide continuous and realistic protein unfolding pathways and that the general rules for unfolding and folding can be determined once a large number of protein folds have been simulated. While most relatives within a fold family fold by the same mechanism based on experimental studies, there are some exceptions. Consequently, sequence-specific effects will be determined by investigating multiple members of four common fold families with different architectures.
Specific Aim 3 focuses on the unfolding of structural motifs in isolation and in different structural contexts, i.e., within different structures. Finally, Specific Aim 4 focuses on characterizing the unfolding behavior and sequence determinants of unfolding for a pair of designed proteins with high sequence identity but they adopt different folds with different functions.

Public Health Relevance

Protein folding remains one of the most important unsolved problems in molecular biology, and it represents an important missing link necessary for full utilization of the information becoming available from the mapping of genomic sequences. Characterization of the unfolding process is equally important, both from the perspective of fully understanding a fundamental biochemical phenomenon and for the light shed on the folding process. An understanding of protein folding/unfolding also has important implications for all biological processes, including protein degradation, protein translocation, aging, and many human diseases, including amyloid diseases and single-nucleotide polymorphism associated disorders.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Macromolecular Structure and Function B Study Section (MSFB)
Program Officer
Wehrle, Janna P
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Washington
Biomedical Engineering
Schools of Engineering
United States
Zip Code
Rysavy, Steven J; Bromley, Dennis; Daggett, Valerie (2014) DIVE: a graph-based visual-analytics framework for big data. IEEE Comput Graph Appl 34:26-37
Merkley, Eric D; Rysavy, Steven; Kahraman, Abdullah et al. (2014) Distance restraints from crosslinking mass spectrometry: mining a molecular dynamics simulation database to evaluate lysine-lysine distances. Protein Sci 23:747-59
Bromley, Dennis; Rysavy, Steven J; Su, Robert et al. (2014) DIVE: a data intensive visualization engine. Bioinformatics 30:593-5
Bromley, Dennis; Anderson, Peter C; Daggett, Valerie (2013) Structural consequences of mutations to the *-tocopherol transfer protein associated with the neurodegenerative disease ataxia with vitamin E deficiency. Biochemistry 52:4264-73
Wang, Dan; McCully, Michelle E; Luo, Zhaoxiong et al. (2013) Structural and functional consequences of cardiac troponin C L57Q and I61Q Ca(2+)-desensitizing variants. Arch Biochem Biophys 535:68-75
Rizzuti, Bruno; Daggett, Valerie (2013) Using simulations to provide the framework for experimental protein folding studies. Arch Biochem Biophys 531:128-35
McCully, Michelle E; Beck, David A C; Daggett, Valerie (2013) Promiscuous contacts and heightened dynamics increase thermostability in an engineered variant of the engrailed homeodomain. Protein Eng Des Sel 26:35-45
Schmidlin, Tom; Ploeger, Ken; Jonsson, Amanda L et al. (2013) Early steps in thermal unfolding of superoxide dismutase 1 are similar to the conformational changes associated with the ALS-associated A4V mutation. Protein Eng Des Sel 26:503-13
Dar, Tanveer A; Schaeffer, R Dustin; Daggett, Valerie et al. (2011) Manifestations of native topology in the denatured state ensemble of Rhodopseudomonas palustris cytochrome c'. Biochemistry 50:1029-41
Calhoun, Sara; Daggett, Valerie (2011) Structural effects of the L145Q, V157F, and R282W cancer-associated mutations in the p53 DNA-binding core domain. Biochemistry 50:5345-53

Showing the most recent 10 out of 28 publications