Lung cancer is the greatest cause of cancer-related deaths of men and women in the United States. Recent appreciation of the specific types of lung cancer has led to more effective, tailored therapies. One of the major types is lung squamous cell carcinoma (LSCC), which is a heterogeneous disease with a wide range of clinical outcomes. Several recent studies identified LSCC subtypes using gene expression microarrays, but there has been no attempt to validate these results, which is a prerequisite if LSCC subtypes are to be used in clinical decisions or future research. One of the principal drivers of the wildly altered tumor gene expression is genomic copy number or the gain and loss of chromosomal segments. Our core hypothesis is LSCC has reproducible molecular subtypes defined by alterations in gene expression and copy number, which represent distinct clinical diseases and biological processes. In preliminary work, our laboratory analyzed published raw microarray data from multiple independent cohorts and demonstrated that four LSCC subtypes can be reliably recognized. To pursue our core hypothesis, we have assembled a new independent LSCC cohort (UNC). We hypothesize the four a priori LSCC subtypes are robust biological diseases and as such will exist in the independent UNC cohort. The UNC cohort will be assayed by gene expression microarrays. UNC cohort subtype will be predicted by historical cohorts and validated with internal UNC predictions. Our preliminary analysis of inferring copy number differences by cytoband differential gene expression demonstrated several subtype-specific regions: 3q22-29, 8q24, and 18q12. We hypothesize that these and additional regions will differentiate the LSCC subtypes. To test this hypothesis, we will computationally detect UNC cohort genomic copy number, measured by SNP microarrays. To translate our results into a clinical application, we will develop an immunohistochemical assay that can predict LSCC subtype of clinical tissue specimens. We will then use our assay to predict LSCC subtype in a large independent cohort and evaluate whether LSCC subtypes have distinct clinical courses, such as survival, metastasis patterns, and response to chemotherapy.

Public Health Relevance

A new molecular LSCC classification may help explain the wide variety of clinical outcomes in this disease and provide a basis for new specialized therapies. The assay developed through this proposal will allow LSCC subtype identification in clinical specimens and is a step towards a routine clinical diagnostic. Subtype-specific genomic copy number aberrations may contribute to etiological models of LSCC.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Postdoctoral Individual National Research Service Award (F32)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-F09-B (20))
Program Officer
Jakowlew, Sonia B
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Internal Medicine/Medicine
Schools of Medicine
Chapel Hill
United States
Zip Code
Wilkerson, Matthew D; Cabanski, Christopher R; Sun, Wei et al. (2014) Integrated RNA and DNA sequencing improves mutation detection in low purity tumors. Nucleic Acids Res 42:e107
Kimes, Patrick K; Cabanski, Christopher R; Wilkerson, Matthew D et al. (2014) SigFuge: single gene clustering of RNA-seq reveals differential isoform usage among cancer samples. Nucleic Acids Res 42:e113
Wilkerson, Matthew D; Schallheim, Jason M; Hayes, D Neil et al. (2013) Prediction of lung cancer histological types by RT-qPCR gene expression in FFPE specimens. J Mol Diagn 15:485-97
Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew et al. (2013) BlackOPs: increasing confidence in variant detection through mappability filtering. Nucleic Acids Res 41:e178
Liao, Rachel G; Jung, Joonil; Tchaicha, Jeremy et al. (2013) Inhibitor-sensitive FGFR2 and FGFR3 mutations in lung squamous cell carcinoma. Cancer Res 73:5195-205
Wilkerson, Matthew D; Yin, Xiaoying; Walter, Vonn et al. (2012) Differential pathogenesis of lung adenocarcinoma subtypes involving sequence mutations, copy number, chromosomal instability, and methylation. PLoS One 7:e36530
Cabanski, Christopher R; Cavin, Keary; Bizon, Chris et al. (2012) ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data. BMC Bioinformatics 13:221
Wilkerson, Matthew D; Yin, Xiaoying; Hoadley, Katherine A et al. (2010) Lung squamous cell carcinoma mRNA expression subtypes are reproducible, clinically important, and correspond to normal cell types. Clin Cancer Res 16:4864-75
Cabanski, Christopher R; Qi, Yuan; Yin, Xiaoying et al. (2010) SWISS MADE: Standardized WithIn Class Sum of Squares to evaluate methodologies and dataset elements. PLoS One 5:e9905
Wilkerson, Matthew D; Hayes, D Neil (2010) ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics 26:1572-3