The proposed Genome Data Analysis Center B (GDAC B) will work cooperatively with other GDACs funded by The Cancer Genome Atlas (TCGA) project to (i) develop an innovative, integrative pipeline for systems- level analysis of TCGA's molecular profiling data on many different types of human tumors and (ii) apply that pipeline and its component modules to TCGA data to address important biological and clinical questions. An overarching goal is to 'personalize'the management of patients'cancers on the basis of new tumor biomarkers and biosignatures. For the first time, it is easier to generate millions of data points on tumors than to analyze or interpret those data, hence the bioinformatic challenge is formidable. The pipeline will be constructed using the Agile software development paradigm and semantic web query architecture. It will be based on novel algorithms and modules developed by participants in the GDAC. Included will be modules for data integration, data visualization, pathway analysis, and systems biological interpretation, all designed to be user-friendly for the bench researcher and clinician. Those modules will be interfaced with additional ones developed by other GDACs, All development will adhere to standards of TCGA and the Cancer Biomedical Informatics Grid (caBIG) and will provide controlled access to ensure confidentiality of personally identifiable data. The proposed GDAC team brings to this project expertise in bioinformatics, biostatistics, software engineering, high-throughput molecular profiling technologies, systems-oriented biology, biomarker studies, pathology, and clinical research. The three co-PIs (for bioinformatics, systems biology, and clinical research) have each participated actively in TCGA since its inception, as have other members of the team, including the lead software engineer. A major strength is the University of Texas M. D. Anderson Cancer Center (MDACC) as an institution. MDACC has been, and presumably will continue to be, the largest source of tumor specimens for TCGA. As one of the country's foremost cancer centers, with by far the largest cancer clinical research program, MDACC has unparalleled expertise for follow up on medically important leads that result from the development and application of the pipeline to TCGA data.

Public Health Relevance

The Cancer Genome Atlas project will generate multi-faceted molecular profiles on 25 different human cancer types. The result will be a treasure trove of information that can be used to personalize cancer diagnosis and treatment. Analysis of the data is a bottleneck, which the proposed Genome Data Analysis Center will alleviate by building an innovative, advanced bioinformatic analysis pipeline.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Resource-Related Research Projects--Cooperative Agreements (U24)
Project #
3U24CA143883-05S1
Application #
8925446
Study Section
Special Emphasis Panel (ZCA1-SRLB-U (O1))
Program Officer
Yang, Liming
Project Start
2009-09-29
Project End
2015-07-31
Budget Start
2013-08-09
Budget End
2015-07-31
Support Year
5
Fiscal Year
2014
Total Cost
$595,560
Indirect Cost
$165,945
Name
University of Texas MD Anderson Cancer Center
Department
Biostatistics & Other Math Sci
Type
Other Domestic Higher Education
DUNS #
800772139
City
Houston
State
TX
Country
United States
Zip Code
77030
Ng, Patrick Kwok-Shing; Li, Jun; Jeong, Kang Jin et al. (2018) Systematic Functional Annotation of Somatic Mutations in Cancer. Cancer Cell 33:450-462.e10
Radovich, Milan; Pickering, Curtis R; Felau, Ina et al. (2018) The Integrated Genomic Landscape of Thymic Epithelial Tumors. Cancer Cell 33:244-258.e10
Shen, Hui; Shih, Juliann; Hollern, Daniel P et al. (2018) Integrated Molecular Characterization of Testicular Germ Cell Tumors. Cell Rep 23:3392-3406
Berger, Ashton C; Korkut, Anil; Kanchi, Rupa S et al. (2018) A Comprehensive Pan-Cancer Molecular Study of Gynecologic and Breast Cancers. Cancer Cell 33:690-705.e9
Gomez, Daniel Richard; Byers, Lauren Averett; Nilsson, Monique et al. (2018) Integrative proteomic and transcriptomic analysis provides evidence for TrkB (NTRK2) as a therapeutic target in combination with tyrosine kinase inhibitors for non-small cell lung cancer. Oncotarget 9:14268-14284
Hoadley, Katherine A; Yau, Christina; Hinoue, Toshinori et al. (2018) Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer. Cell 173:291-304.e6
Chen, Jian; Zaidi, Sobia; Rao, Shuyun et al. (2018) Analysis of Genomes and Transcriptomes of Hepatocellular Carcinomas Identifies Mutations and Gene Expression Changes in the Transforming Growth Factor-? Pathway. Gastroenterology 154:195-210
Schaub, Franz X; Dhankani, Varsha; Berger, Ashton C et al. (2018) Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas. Cell Syst 6:282-300.e2
Peng, Bo; Wang, Gao; Ma, Jun et al. (2018) SoS Notebook: an interactive multi-language data analysis environment. Bioinformatics 34:3768-3770
Liu, Jianfang; Lichtenberg, Tara; Hoadley, Katherine A et al. (2018) An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell 173:400-416.e11

Showing the most recent 10 out of 163 publications