The development of molecularly targeted drugs, specifically those which modulate the activities of one or several proteins involved in the pathogenesis of a cancer, is the most exciting field for cancer treatment because targeted anticancer drugs have the potential to provide dramatic clinical benefits with little toxicity. In order to develop new molecularly targeted drugs for lung cancer, the leading cause of cancer in the world, we have collected a large amount of data, including genetic/epigenetic (mutations, copy number variation, and methylation), mRNA expression, protein expression and genome-wide RNAi functional screening data on 108 non-small cell lung cancer (NSCLC) cell lines. Integrating these large-scale and complementary datasets from different sources will provide great opportunities to discover new molecular mechanisms of lung cancer.
In Aim 1 of this study, we will develop a powerful computational model to integrate multiple genomic, proteomic and functional datasets to identify new lung cancer driver genes. Only a small subset of tumor driver genes is traditionally "druggable" targets.
In Aim 2 of this study, we will use a data-driven and unbiased approach to discover and evaluate potential new therapeutic targets in lung cancer. A novel reverse engineering approach will be proposed to construct a lung-cancer-specific gene network.
In Aim 3 of this study, we will develop a publicly available comprehensive lung cancer database with a user-friendly interface and powerful analysis engine. This database will include all genomic, proteomic and functional data together with the de-identified clinical data used in this study. By using the state-of-the-art information technology, we will integrate these datasets with analytic algorithms and a user-friendly interface in a publicly available database so that researchers worldwide can utilize and test the data and computational tools generated from this study.

Public Health Relevance

Lung cancer is the leading cause of death from cancer for both men and women in the United States with a 5- year survival rate of approximately 15%. The overall goal of this study is to develop novel analytical models and systems biology approaches to identify new potential therapeutic targets of lung cancer.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Research Project (R01)
Project #
Application #
Study Section
Drug Discovery and Molecular Pharmacology Study Section (DMP)
Program Officer
Li, Jerry
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Texas Sw Medical Center Dallas
Other Clinical Sciences
Schools of Medicine
United States
Zip Code
Xiao, Guanghua; Ma, Shuangge; Minna, John et al. (2014) Adaptive prediction model in prospective molecular signature-based clinical studies. Clin Cancer Res 20:531-9
Yun, Jonghyun; Wang, Tao; Xiao, Guanghua (2014) Bayesian hidden Markov models to identify RNA-protein interaction sites in PAR-CLIP. Biometrics 70:430-40
Wang, Tao; Xie, Yang; Xiao, Guanghua (2014) dCLIP: a computational approach for comparative CLIP-seq analyses. Genome Biol 15:R11
Zhong, Rui; Allen, Jeffrey D; Xiao, Guanghua et al. (2014) Ensemble-based network aggregation improves the accuracy of gene network reconstruction. PLoS One 9:e106319
Wang, Tao; Chen, Beibei; Kim, MinSoo et al. (2014) A model-based approach to identify binding sites in CLIP-Seq data. PLoS One 9:e93248
An, Zhenyi; Tassa, Amina; Thomas, Collin et al. (2014) Autophagy is required for G?/G? quiescence in response to nitrogen starvation in Saccharomyces cerevisiae. Autophagy 10:1702-11
Yang, Jichen; Wang, Xinlei; Kim, Minsoo et al. (2014) Detection of candidate tumor driver genes using a fully integrated Bayesian approach. Stat Med 33:1784-800
Zhong, Rui; Kim, Jimi; Kim, Hyun Seok et al. (2014) Computational detection and suppression of sequence-specific off-target phenotypes from whole genome RNAi screens. Nucleic Acids Res 42:8214-22