RNA molecules guide cellular processes at many levels of gene expression. They do so through specific molecular interactions with RNA-binding proteins (RBPs) to form multi-protein ribonucleoprotein complexes (RNPs). Each RBP exhibits a preference for binding to a particular set of RNA sequences, but this set could be many. We will establish the biophysical parameters governing the molecular interactions that take place between specific RBPs and their specific RNA sequence targets in mammalian cells. This specificity is typically a continuous function, with the tightest binders accompanied by less and less tightly bound sequences. These latter are likely to compete for RBPs in the cell and indeed their weaker binding may be important to their function, as for example a transient binding that is a prerequisite in the assembly of a more stable multi-protein complex. Through the use of high throughput DNA sequencing technology (Illumina) we will measure biophysical binding constants for all 65,536 possible RNA sequences of 8 bases;a length of 8 is somewhat larger than a typical RNA target. These binding parameters will be comprised of the equilibrium dissociation constant (Kd) to measure overall tightness, and the important kinetic constants of the on-rate and the off-rate, a measure of accessibility and stability, respectively. Recombinant RBPs will be expressed in cultured mammalian cells and then purified and covalently immobilized on beads in a single step following transient transfection. The beads will be exposed to a library of RNA molecules comprising all possible 8-mers. The bead-bound molecules will be isolated after various times as well as at equilibrium and the number of molecules of each RNA sequence bound will be determined by deep sequencing. After establishment of the methodology using one model RBP (the pre- mRNA splicing factor Fox-2), the method will be extended to additional related splicing factors. As this sort of data is accumulated for additional RBPs, we will be able to model RNA-protein interactions that occur in the cell and better understand the combinatorial factors that make these RNPs so complex. In addition, such fundamental information will allow us to better predict how disturbances in RNA metabolism, commonly seen in genetic diseases and in cancer, can affect cell behavior.

Public Health Relevance

RNA molecules in complexes with proteins carry out a myriad of cellular functions, most having to do with gene expression, and these functions are often defective in genetic disease and in cancer cells. In this proposal we seek to understand how specific RNA molecules combine with their specific protein partners to form these functional complexes. Using high throughput technology, we will determine the basis of this specificity by quantifying the tightness of this interaction for tens of thousands of RNA sequences in a single experiment.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
3R01GM072740-07S1
Application #
8573046
Study Section
Special Emphasis Panel (ZGM1-CBB-0 (MI))
Program Officer
Bender, Michael T
Project Start
2005-08-01
Project End
2014-08-31
Budget Start
2012-09-01
Budget End
2013-08-31
Support Year
7
Fiscal Year
2013
Total Cost
$10,000
Indirect Cost
$3,750
Name
Columbia University (N.Y.)
Department
Biology
Type
Other Domestic Higher Education
DUNS #
049179401
City
New York
State
NY
Country
United States
Zip Code
10027
Arias, Mauricio A; Ke, Shengdong; Chasin, Lawrence A (2010) Splicing by cell type. Nat Biotechnol 28:686-7
Zhang, Xiang H-F; Arias, Mauricio A; Ke, Shengdong et al. (2009) Splicing of designer exons reveals unexpected complexity in pre-mRNA splicing. RNA 15:367-76