Integrated analysis of protein expression data from the Reverse Phase Protein Array (RPPA) platform

Akbani, Rehan; Mills, Gordon; Weinstein, John

Abstract

The National Cancer Institute has initiated, or will initiate, a number of large-scale cancer genomics programs under the aegis of the Center for Cancer Genomics (CCG). The overall goal of those programs is to help elucidate the mechanisms of cancer initiation, evolution, and resistance to therapy through detailed molecular characterization of tumor samples across multiple technological platforms. As most therapy targets are proteins, and protein phosphorylation is functionally important, accurate analysis of protein is critical to the effort. Consequently, MD Anderson was awarded the Genome Characterization Center contract to develop and apply a high-throughput reverse-phase protein array (RPPA) pipeline (Contact PD/PI Gordon Mills; PD/PI Rehan Akbani). The present proposal is for the establishment of a Specialized Genome Data Analysis Center (GDAC) at MD Anderson under the same auspices. As its first objective, the GDAC will directly support CCG projects by analyzing RPPA data for the Analysis Working Groups (AWGs). The GDAC will participate in discussions, solicit feedback from the AWGs, and suggest future directions for research. A second objective of the GDAC will be to enhance its current bioinformatic tools to improve the analysis and interpretation of RPPA data, whether developed under the aegis of the CCG or through other community approaches. Specifically, the aims of the GDAC are to (i) Extract high-quality, analysis-ready protein expression measures from the RPPA data; (ii) Cluster RPPA data and conduct integrated analysis by correlating RPPA data with clinical and other molecular data; (iii) Perform knowledge-based and independent pathway analysis of RPPA data to identify proteomic pathways that have been substantially altered in the set of cases in each CCG project; and (iv) Continue to develop innovative bioinformatic and computational tools and methodologies to improve the RPPA data analysis pipeline. The pipeline will be shared publicly for the benefit of other researchers. The GDAC will perform the stated tasks by continuing to develop a fully or semi-automated software pipeline using the Galaxy software infrastructure and their own software modules. A preliminary version of the pipeline, together with the necessary expertise for systems biological interpretation of the results, is already in place and will be available at the beginning of the performance period. Further enhancements of the pipeline will be implemented as the GDAC progresses. The pipeline will input raw or pre-processed RPPA data from a central data repository that is specified by the CCG; perform quality control; remove any batch effects; analyze the data using novel plus traditional algorithms; correlate the data with other molecular/clinical features; visualize the outcome; and then deposit the results back in the repository for use by the AWG. The GDAC will interact and collaborate with other components of the CCG consortium to discover biologically and clinically relevant findings that will shed light on the underlying mechanisms of cancer and offer potential avenues for novel therapeutic approaches.

Public Health Relevance

We propose to establish a Genome Data Analysis Center (GDAC) for the analysis of data obtained by the National Cancer Institute on proteins in cancers. The objectives of the GDAC are: (i) to analyze cancer protein data and support the scientific community in their investigations of the data; (ii) develop new or enhanced software tools to meet the first objective more fully and provide the software to the broader research community. We expect that work of the GDAC will lead to biologically and clinically relevant findings that will shed light on the underlying mechanisms of cancer and offer potential avenues for therapy of cancer patients.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Cancer Institute (NCI)
Type: Resource-Related Research Projects--Cooperative Agreements (U24)
Project #: 5U24CA210950-05
Application #: 10005168
Study Section: Special Emphasis Panel (ZCA1)
Program Officer: Yang, Liming

Project Start: 2016-09-13
Project End: 2021-08-31
Budget Start: 2020-09-01
Budget End: 2021-08-31
Support Year: 5
Fiscal Year: 2020
Total Cost
Indirect Cost

Institution

Name: University of Texas MD Anderson Cancer Center
Department: Biostatistics & Other Math Sci
Type: Hospitals
DUNS #: 800772139

City: Houston
State: TX
Country: United States
Zip Code: 77030

Related projects


NIH 2020 U24 CA	Integrated analysis of protein expression data from the Reverse Phase Protein Array (RPPA) platform Akbani, Rehan; Mills, Gordon B.; Weinstein, John N. / University of Texas MD Anderson Cancer Center
NIH 2019 U24 CA	Integrated analysis of protein expression data from the Reverse Phase Protein Array (RPPA) platform Akbani, Rehan; Mills, Gordon B.; Weinstein, John N. / University of Texas MD Anderson Cancer Center
NIH 2018 U24 CA	Integrated analysis of protein expression data from the Reverse Phase Protein Array (RPPA) platform Akbani, Rehan; Mills, Gordon B.; Weinstein, John N. / University of Texas MD Anderson Cancer Center
NIH 2017 U24 CA	Integrated analysis of protein expression data from the Reverse Phase Protein Array (RPPA) platform Akbani, Rehan; Mills, Gordon B.; Weinstein, John N. / University of Texas MD Anderson Cancer Center
NIH 2016 U24 CA	Integrated analysis of protein expression data from the Reverse Phase Protein Array (RPPA) platform Akbani, Rehan; Mills, Gordon B.; Weinstein, John N. / University of Texas MD Anderson Cancer Center	$420,664

Publications

Radovich, Milan; Pickering, Curtis R; Felau, Ina et al. (2018) The Integrated Genomic Landscape of Thymic Epithelial Tumors. Cancer Cell 33:244-258.e10

Berger, Ashton C; Korkut, Anil; Kanchi, Rupa S et al. (2018) A Comprehensive Pan-Cancer Molecular Study of Gynecologic and Breast Cancers. Cancer Cell 33:690-705.e9

Corces, M Ryan; Granja, Jeffrey M; Shams, Shadi et al. (2018) The chromatin accessibility landscape of primary human cancers. Science 362:

Hoadley, Katherine A; Yau, Christina; Hinoue, Toshinori et al. (2018) Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer. Cell 173:291-304.e6

Schaub, Franz X; Dhankani, Varsha; Berger, Ashton C et al. (2018) Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas. Cell Syst 6:282-300.e2

Liu, Jianfang; Lichtenberg, Tara; Hoadley, Katherine A et al. (2018) An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics. Cell 173:400-416.e11

Bailey, Matthew H; Tokheim, Collin; Porta-Pardo, Eduard et al. (2018) Comprehensive Characterization of Cancer Driver Genes and Mutations. Cell 173:371-385.e18

Sun, Chaoyang; Yin, Jun; Fang, Yong et al. (2018) BRD4 Inhibition Is Synthetic Lethal with PARP Inhibitors through the Induction of Homologous Recombination Deficiency. Cancer Cell 33:401-416.e8

Sanchez-Vega, Francisco; Mina, Marco; Armenia, Joshua et al. (2018) Oncogenic Signaling Pathways in The Cancer Genome Atlas. Cell 173:321-337.e10

Ge, Zhongqi; Leighton, Jake S; Wang, Yumeng et al. (2018) Integrated Genomic Analysis of the Ubiquitin Pathway across Cancer Types. Cell Rep 23:213-226.e3

Showing the most recent 10 out of 44 publications

Comments

Be the first to comment on Rehan Akbani's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: