The splicing together of exons during the transcription of most eukaryotic pre-mRNA molecules is a fundamental step in the transfer of information from DNA to protein. However, how the splice sites are recognized to initiate this fast and accurate process in not at all clear, since false sequences that resemble real sites outnumber the latter by 1 or 2 orders of magnitude yet are not used. We are searching for the additional information that specifies splice site recognition during pre-mRNA splicing in mammalian cells. In previous work we have used computational analysis to discover that: 1) that exon flanks 50 nt beyond the splice site consensuses can greatly influence splicing and contain sequences that stand out from the background; and 2) there are a large number of specific 8 mers that can act either as exonic splicing enhancers or splicing silencers. We will characterize both of these types of sequences, defining their base requirements, extent, position dependence, and specificity. With regard to the last, we will develop a novel exhaustive survey method that uses massively parallel solid state sequencing to identify every possible 8-mer that can be functional in a particular context, and we will compare this profile in different intronic, exonic and cellular contexts for constitutive and alternatively spliced exons. The flanking sequences fall into 2 distinct classes according to their G+C contents; we will test the idea that the rules governing exon definition differ for the two classes. To simplify the discovery of rules that govern exon definition, we will create designer exons made up of synthetic 8-mer modules we have identified as enhancers, silencers or neutral sequences. The numbers and spacing of these modules will reveal relationships that are hidden in the complexity of natural sequences. Finally, we will continue to use computation, including new machine learning algorithms, to better define intronic elements and to discover interactions between two or more features of a local sequence that define a splice site. We will also develop a Web-based exon-finding program that integrates all the information we have accumulated. Pre-mRNA splicing goes awry in a large proportion or human genetic disease cases. An understanding of the rules that govern the recognition of splice sites should help in the design of therapeutic intervention strategies to reverse such ill effects. ? ? ? ? ? ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM072740-02
Application #
7105514
Study Section
Molecular Genetics C Study Section (MGC)
Program Officer
Rhoades, Marcus M
Project Start
2005-08-01
Project End
2009-07-31
Budget Start
2006-08-01
Budget End
2007-07-31
Support Year
2
Fiscal Year
2006
Total Cost
$298,611
Indirect Cost
Name
Columbia University (N.Y.)
Department
Biology
Type
Other Domestic Higher Education
DUNS #
049179401
City
New York
State
NY
Country
United States
Zip Code
10027
Ke, Shengdong; Anquetil, Vincent; Zamalloa, Jorge Rojas et al. (2018) Saturation mutagenesis reveals manifold determinants of exon definition. Genome Res 28:11-24
Arias, Mauricio A; Lubkin, Ashira; Chasin, Lawrence A (2015) Splicing of designer exons informs a biophysical model for exon definition. RNA 21:213-29
Ke, Shengdong; Shang, Shulian; Kalachikov, Sergey M et al. (2011) Quantitative evaluation of all hexamers as exonic splicing elements. Genome Res 21:1360-74
Ke, Shengdong; Chasin, Lawrence A (2011) Context-dependent splicing regulation: exon definition, co-occurring motif pairs and tissue specificity. RNA Biol 8:384-8
Ke, Shengdong; Chasin, Lawrence A (2010) Intronic motif pairs cooperate across exons to promote pre-mRNA splicing. Genome Biol 11:R84
Arias, Mauricio A; Ke, Shengdong; Chasin, Lawrence A (2010) Splicing by cell type. Nat Biotechnol 28:686-7
Zhang, Xiang H-F; Arias, Mauricio A; Ke, Shengdong et al. (2009) Splicing of designer exons reveals unexpected complexity in pre-mRNA splicing. RNA 15:367-76