Recently, the RNA-seq technology is increasingly replacing microarray for expression profiling. In this proposal, we will timely address the emerging challenges and opportunities brought by the rapidly accumulating RNA-seq data. We will design novel methods to perform integrative analysis of many RNA-seq datasets to study the functions and regulations of alternative splicing. In particular, we have the following specifi aims: (1) We will develop a novel graph- based pattern mining method to reconstruct an atlas of splicing modules and identify the associated experimental conditions in human, mouse, fly, and yeast. (2) We will study the coupling between transcription and splicing, the two important regulatory processes, by exploiting both expression and splicing information provided by RNA-seq data. We will design a novel multi-layer network mining approach to systematically identify coupled transcription- splicing modules. (3) We will predict the functions of alternatively spliced transcripts to establish a high-resolution function annotation of human genome. The predicted functions will be incorporated into the GeneOntology and the Ensembl databases to benefit the biological community. (4) We will perform experimental validation on a subset of computational predictions made in Aims 1, 2, 3. (5) We will develop web databases and software to directly benefit the scientific community. Our methods and software will significantly facilitate the re-use of the vast amount of existing RNA-seq data, reduce the necessity to generate new data, and improve our understanding of gene regulations under a variety of perturbations.

Public Health Relevance

This project will create a set of computational methods to facilitate the re-use of the rapidly accumulating public RNA-seq repositories. We will generate an atlas of splicing modules specific to diverse diseases, and will predict specific functions of splicing isoforms. We will experimentally validate a subset of predictions related to cancer. Finally, we will develop software and web servers to directly benefit the biomedical community.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
4R01GM105431-04
Application #
8991510
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Bender, Michael T
Project Start
2013-04-01
Project End
2016-06-30
Budget Start
2016-01-01
Budget End
2016-06-30
Support Year
4
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of Southern California
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
072933393
City
Los Angeles
State
CA
Country
United States
Zip Code
90032
Katrib, Amal; Hsu, William; Bui, Alex et al. (2016) ""RADIOTRANSCRIPTOMICS"": A synergy of imaging and transcriptomics in clinical assessment. Quant Biol 4:1-12
Lin, Lan; Jiang, Peng; Park, Juw Won et al. (2016) The contribution of Alu exons to the human proteome. Genome Biol 17:15
Yang, Xinping; Coulombe-Huntington, Jasmin; Kang, Shuli et al. (2016) Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing. Cell 164:805-17
Damianov, Andrey; Ying, Yi; Lin, Chia-Ho et al. (2016) Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR. Cell 165:606-19
Shen, Shihao; Wang, Yuanyuan; Wang, Chengyang et al. (2016) SURVIV for survival analysis of mRNA isoform variation. Nat Commun 7:11548
Ji, Xinjun; Park, Juw Won; Bahrami-Samani, Emad et al. (2016) ?CP binding to a cytosine-rich subset of polypyrimidine tracts drives a novel pathway of cassette exon splicing in the mammalian transcriptome. Nucleic Acids Res 44:2283-97
Li, Wenyuan; Liu, Chun-Chi; Kang, Shuli et al. (2016) Pushing the annotation of cellular activities to a higher resolution: Predicting functions at the isoform level. Methods 93:110-8
Stein, Shayna; Lu, Zhi-Xiang; Bahrami-Samani, Emad et al. (2015) Discover hidden splicing variations by mapping personal transcriptomes to personal genomes. Nucleic Acids Res 43:10612-22
Tseng, Yu-Ting; Li, Wenyuan; Chen, Ching-Hsien et al. (2015) IIIDB: a database for isoform-isoform interactions and isoform network modules. BMC Genomics 16 Suppl 2:S10
Bebee, Thomas W; Park, Juw Won; Sheridan, Katherine I et al. (2015) The splicing regulators Esrp1 and Esrp2 direct an epithelial splicing program essential for mammalian development. Elife 4:

Showing the most recent 10 out of 15 publications