With each successive discovery in genetics, the true dynamic complexity of the human genome has become increasingly apparent, requiring relatively consistent updates to the technical definition of the word ?gene?. It is now understood that the notion of ?one gene makes one protein that functions in one signaling pathway? in human cells is overly simplistic, because majority of the human genes produce multiple functional products (transcript variants and protein isoforms), through alternative transcription and/or alternative splicing. Therefore, our central hypothesis is that the isoform-level gene products ? ?transcript variants? and ?protein isoforms? are the basic functional units in a mammalian cell, and accordingly, the informatics platforms for managing and analyzing gene regulation data both in normal and disease cells should adopt ?gene isoform centric? rather than ?gene centric? approaches. Towards the goal of broadly impacting gene regulation and functional studies at gene isoform-level, we have been developing novel algorithms for analyses of genome- wide transcriptome (RNA-seq and exon-array) and protein-DNA binding (ChIP-seq) data, and for extending the gene-level orthology mapping to exon- and transcript-level mapping between the orthologous human and mouse genes. By applying these novel algorithms on public datasets, we have observed significant expression differences between different sample groups (e.g., developmental stages, cancer subtypes, normal vs cancer) for numerous genes at the isoform-level but not at the overall gene-level, and experimentally validated the `significant' isoforms using RT-qPCR in independent bio-specimens. While the application of these algorithms has led to the development of new methods for diagnosis of glioblastoma or a sub-type thereof, the isoform- level transcriptome analyses results also led to some challenging questions ? for example ? How are the alternative promoters of a gene show switch-like opposing patterns of activity (while one promoter is up- the other is down-regulated in one condition vs the other), and how are different splice-variants of a gene show opposing expression patterns in cancer versus normal tissue samples? We currently lack informatics methods to address these challenging questions. Therefore, we propose to develop novel statistical methods (1) for integrative cluster analysis of isoform-level gene expression information from exon-array and RNA-seq platforms, (2) for identification of differential transcript/isoform usage in heterogeneous cancer samples, and (3) for identification of alternative transcription/splicing quantitative trait locus (sQTL) in tumor adjusted by somatic genetic and epigenetic changes. And, (4) the novel predictions from these algorithms will be experimentally validated by performing Chromatin immunoprecipitation (ChIP), dual-luciferase reporter assay and CRISPR/Cas9 genome editing in U87 and A172 cells. The novel bioinformatics methods developed by this project will help in silico discovery and research for accelerating the linkage of phenotypic and genomic information, at gene-isoform level.

Public Health Relevance

The disruption of numerous human genes and their isoforms driven by alternative splicing and alternative transcription is implicated in cancer and several neuropsychiatric disorders, including Parkinson's disease, schizophrenia, bipolar disorder and autism. The development of bioinformatics methods and user-friendly software in this study will provide useful tools to better understand gene regulatory mechanisms in mammalian cells, and more importantly, how dis-regulation of these mechanisms leads to a variety of diseases.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
7R01LM011297-09
Application #
10273985
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2020-11-01
Project End
2021-04-30
Budget Start
2020-11-01
Budget End
2021-04-30
Support Year
9
Fiscal Year
2020
Total Cost
Indirect Cost
Name
State University New York Stony Brook
Department
Other Clinical Sciences
Type
Schools of Medicine
DUNS #
804878247
City
Stony Brook
State
NY
Country
United States
Zip Code
11794
Van Roosbroeck, Katrien; Fanini, Francesca; Setoyama, Tetsuro et al. (2017) Combining Anti-Mir-155 with Chemotherapy for the Treatment of Lung Cancers. Clin Cancer Res 23:2891-2904
Calvert, Andrea E; Chalastanis, Alexandra; Wu, Yongfei et al. (2017) Cancer-Associated IDH1 Promotes Growth and Resistance to Targeted Therapies in the Absence of Mutation. Cell Rep 19:1858-1873
Liu, Xianpeng; Zhao, Bo; Sun, Limin et al. (2017) Orthogonal ubiquitin transfer identifies ubiquitination substrates under differential control by the two ubiquitin activating enzymes. Nat Commun 8:14286
Shilpi, Arunima; Bi, Yingtao; Jung, Segun et al. (2017) Identification of Genetic and Epigenetic Variants Associated with Breast Cancer Prognosis by Integrative Bioinformatics Analysis. Cancer Inform 16:1-13
Dapas, Matthew; Kandpal, Manoj; Bi, Yingtao et al. (2017) Comparative evaluation of isoform-level gene expression estimation algorithms for RNA-seq and exon-array platforms. Brief Bioinform 18:260-269
Vannini, Ivan; Wise, Petra M; Challagundla, Kishore B et al. (2017) Transcribed ultraconserved region 339 promotes carcinogenesis by modulating tumor suppressor microRNAs. Nat Commun 8:1801
Malchenko, Sergey; Sredni, Simone Treiger; Bi, Yingtao et al. (2017) Stabilization of HIF-1? and HIF-2?, up-regulation of MYCC and accumulation of stabilized p53 constitute hallmarks of CNS-PNET animal model. PLoS One 12:e0173106
Macyszyn, Luke; Akbari, Hamed; Pisapia, Jared M et al. (2016) Imaging patterns predict patient survival and molecular subtype in glioblastoma via machine learning techniques. Neuro Oncol 18:417-25
Bell, Jonathan B; Eckerdt, Frank D; Alley, Kristen et al. (2016) MNK Inhibition Disrupts Mesenchymal Glioma Stem Cells and Prolongs Survival in a Mouse Model of Glioblastoma. Mol Cancer Res 14:984-993
Jin, Hong-Jian; Jung, Segun; DebRoy, Auditi R et al. (2016) Identification and validation of regulatory SNPs that modulate transcription factor chromatin binding and gene expression in prostate cancer. Oncotarget 7:54616-54626

Showing the most recent 10 out of 20 publications