Recently, the RNA-seq technology is increasingly replacing microarray for expression profiling. In this proposal, we will timely address the emerging challenges and opportunities brought by the rapidly accumulating RNA-seq data. We will design novel methods to perform integrative analysis of many RNA-seq datasets to study the functions and regulations of alternative splicing. In particular, we have the following specifi aims: (1) We will develop a novel graph- based pattern mining method to reconstruct an atlas of splicing modules and identify the associated experimental conditions in human, mouse, fly, and yeast. (2) We will study the coupling between transcription and splicing, the two important regulatory processes, by exploiting both expression and splicing information provided by RNA-seq data. We will design a novel multi-layer network mining approach to systematically identify coupled transcription- splicing modules. (3) We will predict the functions of alternatively spliced transcripts to establish a high-resolution function annotation of human genome. The predicted functions will be incorporated into the GeneOntology and the Ensembl databases to benefit the biological community. (4) We will perform experimental validation on a subset of computational predictions made in Aims 1, 2, 3. (5) We will develop web databases and software to directly benefit the scientific community. Our methods and software will significantly facilitate the re-use of the vast amount of existing RNA-seq data, reduce the necessity to generate new data, and improve our understanding of gene regulations under a variety of perturbations.
This project will create a set of computational methods to facilitate the re-use of the rapidly accumulating public RNA-seq repositories. We will generate an atlas of splicing modules specific to diverse diseases, and will predict specific functions of splicing isoforms. We will experimentally validate a subset of predictions related to cancer. Finally, we will develop software and web servers to directly benefit the biomedical community.
|Ji, Xinjun; Park, Juw Won; Bahrami-Samani, Emad et al. (2016) Î±CP binding to a cytosine-rich subset of polypyrimidine tracts drives a novel pathway of cassette exon splicing in the mammalian transcriptome. Nucleic Acids Res 44:2283-97|
|Li, Wenyuan; Liu, Chun-Chi; Kang, Shuli et al. (2016) Pushing the annotation of cellular activities to a higher resolution: Predicting functions at the isoform level. Methods 93:110-8|
|Shen, Shihao; Wang, Yuanyuan; Wang, Chengyang et al. (2016) SURVIV for survival analysis of mRNA isoform variation. Nat Commun 7:11548|
|Damianov, Andrey; Ying, Yi; Lin, Chia-Ho et al. (2016) Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR. Cell 165:606-19|
|Lin, Lan; Jiang, Peng; Park, Juw Won et al. (2016) The contribution of Alu exons to the human proteome. Genome Biol 17:15|
|Yang, Xinping; Coulombe-Huntington, Jasmin; Kang, Shuli et al. (2016) Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing. Cell 164:805-17|
|Cieply, Benjamin; Park, Juw Won; Nakauka-Ddamba, Angela et al. (2016) Multiphasic and Dynamic Changes in Alternative Splicing during Induction of Pluripotency Are Coordinated by Numerous RNA-Binding Proteins. Cell Rep 15:247-55|
|Lu, Zhi-xiang; Huang, Qin; Park, Juw Won et al. (2015) Transcriptome-wide landscape of pre-mRNA alternative splicing associated with metastatic colonization. Mol Cancer Res 13:305-18|
|Stein, Shayna; Lu, Zhi-Xiang; Bahrami-Samani, Emad et al. (2015) Discover hidden splicing variations by mapping personal transcriptomes to personal genomes. Nucleic Acids Res 43:10612-22|
|Bebee, Thomas W; Park, Juw Won; Sheridan, Katherine I et al. (2015) The splicing regulators Esrp1 and Esrp2 direct an epithelial splicing program essential for mammalian development. Elife 4:|
Showing the most recent 10 out of 15 publications