Eukaryotic cells respond to stress (heat, oxidation, starvation) by reorganizing proteins and RNA into massive, reversible assemblies?termed stress granules?thought to play important roles in stress survival. Stress granules contain dozens of RNA-binding proteins, most of which contain disordered, quasi-repetitive low- complexity regions (LCRs). Recent, revolutionary work has demonstrated that these low-complexity regions mediate or modulate the formation of large assemblies by specific biophysical processes: phase separation and hydrogel formation. The resulting liquid/gel assemblies lack the fixed stoichiometry of quaternary structures and have been termed quinary assemblies. Disease-associated mutations in LCRs perturb quinary assemblies, producing pathological fibrillarization associated with these mutations in human disease. Yet the constraints on LCRs, and how LCR sequences encode quinary behaviors, remain unclear. Evolutionary analyses grounded in the biophysics of quinary assembly are urgently needed. How do low-complexity sequences encode quinary assembly behavior? Which features of these highly variable sequences are under selection, and which reflect mutational processes long known to give rise to repetitive low-complexity sequences? How can we uncover coevolution between features in interacting LCRs, in the absence of reliable site-specific information? And how can we experimentally validate putative evolutionary constraints? We propose an integrative approach to answering these questions. This approach builds on our deep expertise in analyzing evolutionary constraints on protein sequences, and in evolutionary and empirical studies of protein aggregation, coupled with our recent work identifying and isolating stress-triggered phase-separation behavior in RNA binding proteins. We have recently discovered that poly(A)-binding protein (Pab1 in budding yeast), an abundant, conserved RNA-binding protein with a highly variable LCR, phase-separates in response to heat and pH stress. We discovered novel patterns of evolutionary constraint in this LCR, and showed that making mutations which systematically perturbing the conserved composition of this LCR result in coordinately perturbed phase-separation, gel formation by Pab1, and altered stress survival by the organism.
In Aim 1, we propose methods for quantifying these and related evolutionary constraints which generalizes broadly to sequences showing selection on amino-acid composition with weak or negligible selection on the ordering of amino acids. Based on substantial preliminary work, we propose specific methods to quantify selection linked to the biophysical features needed to promote phase separation while preventing pathological aggregation.
In Aim 2, we describe an experimental system for assessing the in vitro and in vivo effects of perturbing evolutionarily constrained LCR features.
In Aim 3, we propose new methods for quantifying covariation within and between LCRs. Together, this study will elucidate new, general, empirically validated methods for extracting information from low-complexity sequences which take into account their unique biology.

Public Health Relevance

Sudden stressful situations require a rapid response by the organism and, in many cases, by its cells. Cell stress causes proteins to form large clumps, a process also associated with many human disorders such as frontotemporal dementia. We are studying how protein sequences cause this behavior, using evolutionary information to understand what features produce helpful or harmful clumping.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM127406-01
Application #
9498977
Study Section
Genetic Variation and Evolution Study Section (GVE)
Program Officer
Janes, Daniel E
Project Start
2018-05-01
Project End
2022-02-28
Budget Start
2018-05-01
Budget End
2019-02-28
Support Year
1
Fiscal Year
2018
Total Cost
Indirect Cost
Name
University of Chicago
Department
Biochemistry
Type
Schools of Medicine
DUNS #
005421136
City
Chicago
State
IL
Country
United States
Zip Code
60637