The development of molecularly targeted drugs, specifically those which modulate the activities of one or several proteins involved in the pathogenesis of a cancer, is the most exciting field for cancer treatment because targeted anticancer drugs have the potential to provide dramatic clinical benefits with little toxicity. In order to develop new molecularly targeted drugs for lung cancer, the leading cause of cancer in the world, we have collected a large amount of data, including genetic/epigenetic (mutations, copy number variation, and methylation), mRNA expression, protein expression and genome-wide RNAi functional screening data on 108 non-small cell lung cancer (NSCLC) cell lines. Integrating these large-scale and complementary datasets from different sources will provide great opportunities to discover new molecular mechanisms of lung cancer.
In Aim 1 of this study, we will develop a powerful computational model to integrate multiple genomic, proteomic and functional datasets to identify new lung cancer driver genes. Only a small subset of tumor driver genes is traditionally "druggable" targets.
In Aim 2 of this study, we will use a data-drive and unbiased approach to discover and evaluate potential new therapeutic targets in lung cancer. A novel reverse engineering approach will be proposed to construct a lung-cancer-specific gene network.
In Aim 3 of this study, we will develop a publicly available comprehensive lung cancer database with a user-friendly interface and powerful analysis engine. This database will include all genomic, proteomic and functional data together with the de-identified clinical data used in this study. By using the state-of-the-art information technolog, we will integrate these datasets with analytic algorithms and a user-friendly interface in a publicly available database so that researchers worldwide can utilize and test the data and computational tools generated from this study.
Lung cancer is the leading cause of death from cancer for both men and women in the United States with a 5- year survival rate of approximately 15%. The overall goal of this study is to develop novel analytical models and systems biology approaches to identify new potential therapeutic targets of lung cancer.
|Xiao, Guanghua; Ma, Shuangge; Minna, John et al. (2014) Adaptive prediction model in prospective molecular signature-based clinical studies. Clin Cancer Res 20:531-9|
|Yun, Jonghyun; Wang, Tao; Xiao, Guanghua (2014) Bayesian hidden Markov models to identify RNA-protein interaction sites in PAR-CLIP. Biometrics 70:430-40|
|Wang, Tao; Xie, Yang; Xiao, Guanghua (2014) dCLIP: a computational approach for comparative CLIP-seq analyses. Genome Biol 15:R11|
|Zhong, Rui; Allen, Jeffrey D; Xiao, Guanghua et al. (2014) Ensemble-based network aggregation improves the accuracy of gene network reconstruction. PLoS One 9:e106319|
|Wang, Tao; Chen, Beibei; Kim, MinSoo et al. (2014) A model-based approach to identify binding sites in CLIP-Seq data. PLoS One 9:e93248|
|An, Zhenyi; Tassa, Amina; Thomas, Collin et al. (2014) Autophagy is required for G?/G? quiescence in response to nitrogen starvation in Saccharomyces cerevisiae. Autophagy 10:1702-11|
|Yang, Jichen; Wang, Xinlei; Kim, Minsoo et al. (2014) Detection of candidate tumor driver genes using a fully integrated Bayesian approach. Stat Med 33:1784-800|
|Zhong, Rui; Kim, Jimi; Kim, Hyun Seok et al. (2014) Computational detection and suppression of sequence-specific off-target phenotypes from whole genome RNAi screens. Nucleic Acids Res 42:8214-22|