The overarching goal of TCGA is to change the practice of cancer medicine and improve patient survival through cancer genomics. A key deliverable is to enable access and use of complex multi-dimensional genomic data for downstream studies. We propose to operate a GDAC-A center with the leadership, expertise and infrastructure required to develop an analysis pipeline that will generate pre-defined integrative analyses and interpretations that are tailor-designed for hypothesis-testing by basic, translational and clinical investigators. Our team consists of experts in cancer biology, genomics and bioinformatics with a track record of leadership in TCGA. The analytical tools and pipeline structure are based on our extensive TCGA experiences and designed to optimally achieve its goals. This pipeline will be built using the GenePattern bioinformatic workflow environment - a flexible and modular architecture that is caBIG and caGRID compliant, maintained in the well-established, robust and secure IT infrastructure at the Broad Institute and can be operated 24/7 as a Production Pipeline. Leveraging this well-established resource, we will pursue the following specific aims.
Aim 1. We will define caBIG compliant data format for all input and output files. To further enhance standardization, we propose two additions to the standard data structure defined in the Pilot Project (Levels 1-4). Level 0 will define specific versions of all reference databases used in the analyses and Level 5 will capture disease-level findings that incorporate prior knowledge.
Aim 2. We will design analysis modules to consolidate data from all components of TCGA and to perform integrative analyses. Results will be submitted to DCC in caBIG compliant output files accompanied by human-readable reports containing text summaries, tables and figures in a format understandable to scientists of diverse disciplines, similar to the Results Section of a publication. In addition, we are committed to continuous technical and analytical improvement of the pipeline, particularly in supporting the transition to next-generation sequencing platforms.
Aim 3. We will implement this high-throughput analysis pipeline in an industrial-level production mode with rigorous quality control, leveraging the Broad's infrastructural support and extensive experiences in running and maintaining high-throughput computational pipelines.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Resource-Related Research Projects--Cooperative Agreements (U24)
Project #
1U24CA143845-01
Application #
7788542
Study Section
Special Emphasis Panel (ZCA1-SRLB-U (O1))
Program Officer
Lee, Jerry S
Project Start
2009-09-29
Project End
2014-07-31
Budget Start
2009-09-29
Budget End
2010-07-31
Support Year
1
Fiscal Year
2009
Total Cost
$2,664,889
Indirect Cost
Name
Broad Institute, Inc.
Department
Type
DUNS #
623544785
City
Cambridge
State
MA
Country
United States
Zip Code
02142
Shen, Hui; Shih, Juliann; Hollern, Daniel P et al. (2018) Integrated Molecular Characterization of Testicular Germ Cell Tumors. Cell Rep 23:3392-3406
Berger, Ashton C; Korkut, Anil; Kanchi, Rupa S et al. (2018) A Comprehensive Pan-Cancer Molecular Study of Gynecologic and Breast Cancers. Cancer Cell 33:690-705.e9
Haradhvala, N J; Kim, J; Maruvka, Y E et al. (2018) Distinct mutational signatures characterize concurrent loss of polymerase proofreading and mismatch repair. Nat Commun 9:1746
Hoadley, Katherine A; Yau, Christina; Hinoue, Toshinori et al. (2018) Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer. Cell 173:291-304.e6
Schaub, Franz X; Dhankani, Varsha; Berger, Ashton C et al. (2018) Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas. Cell Syst 6:282-300.e2
Liu, Jianfang; Lichtenberg, Tara; Hoadley, Katherine A et al. (2018) An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell 173:400-416.e11
Bailey, Matthew H; Tokheim, Collin; Porta-Pardo, Eduard et al. (2018) Comprehensive Characterization of Cancer Driver Genes and Mutations. Cell 173:371-385.e18
Hmeljak, Julija; Sanchez-Vega, Francisco; Hoadley, Katherine A et al. (2018) Integrative Molecular Characterization of Malignant Pleural Mesothelioma. Cancer Discov 8:1548-1565
Sanchez-Vega, Francisco; Mina, Marco; Armenia, Joshua et al. (2018) Oncogenic Signaling Pathways in The Cancer Genome Atlas. Cell 173:321-337.e10
Way, Gregory P; Sanchez-Vega, Francisco; La, Konnor et al. (2018) Machine Learning Detects Pan-cancer Ras Pathway Activation in The Cancer Genome Atlas. Cell Rep 23:172-180.e3

Showing the most recent 10 out of 87 publications