This proposal describes the development and application of rare event methods that will open the door to treating classes of biomolecular problems that are challenging for existing methods owing to their multipathway character. These include problems in protein folding and aggregation that remain outstanding, despite enormous progress in those areas in recent years. The key idea is to rigorously characterize the error behavior of the methods (in particular in the low temperature limit) to derive criteria tat can be used to adaptively refocus sampling for maximal efficiency. The properties of the methods that we seek are as follows: (i) they will be capable of operating with more than a few reaction coordinates to allow treating problems without obvious choices of descriptors, (ii) they will efficiently sample reaction mechanisms that proceed via a complex network of transitions rather than a single dominant pathway, (iii) they will be fully non-equilibrium, i.e., they will generate reactive trajectories and the statistical properties of those trajectories including rates and other dynamical quantities such as time correlation functions, and (iv) the results that they provide will not depend on the particular choice of reaction coordinates (though the convergence rate typically will) or on any Markov assumptions. The proposed methods will be applied to two protein folding problems that are beyond present enhanced sampling methods. The first system is beta3s, a 20-residue designed three-stranded antiparallel beta-sheet miniprotein. We choose this system because data on its folding and unfolding exists from extensive physically weighted molecular dynamics simulations, and analyses by several groups show that it has a well-defined native state and complex dynamics that are characteristic of much larger proteins. The second system is much less well characterized, but improved means for modeling it can provide information of major biomedical significance: we will map the states and transitions accessible to monomers and oligomers of alpha-synuclein, a 140-residue intrinsically disordered protein that is linked to a number of neurodegenerative diseases, including Parkinson's Disease and Dementia with Lewy Bodies.

Public Health Relevance

The protein folding and aggregation processes underlying many neurodegenerative diseases are not amenable to modeling by present computational techniques. We will use rigorous mathematical error analysis to develop a new class of computational methods for molecular simulation that promises to overcome the shortcommings of existing methods. The resulting techniques will be applied to studying the dynamics of proteins linked to Parkinson's Disease and Dementia with Lewy Bodies.

National Institute of Health (NIH)
National Institute of General Medical Sciences (NIGMS)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZGM1-BBCB-5 (BM))
Program Officer
Wehrle, Janna P
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Chicago
Biostatistics & Other Math Sci
Schools of Arts and Sciences
United States
Zip Code
Chen, Yunjie; Kale, Seyit; Weare, Jonathan et al. (2016) Multiple Time-Step Dual-Hamiltonian Hybrid Molecular Dynamics - Monte Carlo Canonical Propagation Algorithm. J Chem Theory Comput 12:1449-58
Thiede, Erik H; Van Koten, Brian; Weare, Jonathan et al. (2016) Eigenvector method for umbrella sampling enables error analysis. J Chem Phys 145:084115
Thiede, Erik; VAN Koten, Brian; Weare, Jonathan (2015) SHARP ENTRYWISE PERTURBATION BOUNDS FOR MARKOV CHAINS. SIAM J Matrix Anal Appl 36:917-941
Kale, Seyit; Sode, Olaseni; Weare, Jonathan et al. (2014) Finding Chemical Reaction Paths with a Multilevel Preconditioning Protocol. J Chem Theory Comput 10:5467-5475