The goal of this proposal is to comprehensively characterize the functional sequence elements encoded in the human genome that are recognized by 250 RNA binding proteins (RBPs) in two cell lines. To do this, we will generate stable HeLa-S3 and GM 12878 cell lines expressing epitope-tagged RBPs and determine the sub cellular localization pattern of each RBP. These cells will be used to perform CLIP-Seq assays to define genome-wide, and at single-nucleotide resolution, the RNA sequence elements recognized by 250 RBPs. The RNA sequence elements identified will be validated using sequence-based in vitro binding assays. Furthermore, ChlP-Seq will be performed for all nuclear localized RBPs to determine the regions of the genome and chromatin that each RBP associates with. These binding assays will be supplemented with functional assays in RBP-depleted cells that will be critical for assigning functions to the identified binding sites. These assays include RNA-Seq of total cellular RNA and RNA purified from various cellular fractions, ribosomal footprint profiling, and Gro-Seq. Together, these assays will provide functional information regarding the roles of each RBP in splicing, cleavage and polyadenylation, RNA stability, RNA editing, translation, RNA localization, and transcription. Bioinformatic analysis will be performed, largely using software generated by our group, to quantitate all assays and to associate functions to the sequence elements identified in the binding assays. Together, these experiments will provide a comprehensive and in depth measure of the functions of approximately half of the human RBPs and the functional sequence elements that they interact with. This project will fill a major gap in the catalog of functional elements encoded in the human genome that are being characterized by the ENCODE consortium. The product of this project will be a unique and valuable community resource that will push the field forward in new and exciting ways and will almost certainly create new paradigms regarding the functions of RBPs and RNA-protein networks in human biology and disease.
Many human diseases are caused by defects in RNA biology. However, our understanding of what causes such diseases is limited because we lack a fundamental knowledge of the function of most RNA binding proteins. Our comprehensive survey of the biology of 250 RNA binding proteins will provide significant new insight into RNA biology and increased understanding of the causes of many human diseases.
|Lambert, Nicole; Robertson, Alex; Jangi, Mohini et al. (2014) RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. Mol Cell 54:887-900|
|Gerstein, Mark B; Rozowsky, Joel; Yan, Koon-Kiu et al. (2014) Comparative analysis of the transcriptome across distant species. Nature 512:445-8|
|Lovci, Michael T; Ghanem, Dana; Marr, Henry et al. (2013) Rbfox proteins regulate alternative mRNA splicing through evolutionarily conserved RNA bridges. Nat Struct Mol Biol 20:1434-42|
|Bolisetty, Mohan T; Graveley, Brenton R (2013) Circuitous route to transcription regulation. Mol Cell 51:705-6|
|Plocik, Alex M; Graveley, Brenton R (2013) New insights from existing sequence data: generating breakthroughs without a pipette. Mol Cell 49:605-17|
|Braunschweig, Ulrich; Gueroussov, Serge; Plocik, Alex M et al. (2013) Dynamic integration of splicing within gene regulatory pathways. Cell 152:1252-69|