Genome-wide association studies (GWAS) have analyzed patterns of genomic variation in thousands of subjects and identified a large number of genes and chromosomal regions that are associated with various lung diseases. Since many of these genes remain poorly characterized, a major challenge facing pulmonary research is to systematically investigate their biological functions, the pathways in which they are involved, and the effects of genetic mutations on both. Fortunately, there is a large and growing body of genomic, transcriptomic, and epigenomic data that can be used to help shed light on gene function. Here we propose to leverage these data, using advanced computational methods to systematically characterize the functions of genes identified through GWAS studies for lung diseases with an emphasis on Chronic Obstructive Pulmonary Disease (COPD). Several research groups, including investigators involved in this proposed project, have conducted detailed genetic, genomic, and epigenomic studies on COPD and identified multiple genetic loci that are associated with disease. On the other hand, the multiple data-types generated from these studies have provided a uniquely exciting opportunity to systematically formulate testable hypotheses by using data-integration computational methods. To this end, we have assembled an interdisciplinary team including experts in GWAS, medicine, laboratory biology, and bioinformatics. We will apply recently developed systems biology-based approaches to integrate multiple data-types and construct gene regulatory networks (GRN) centered on GWAS candidates and from these, use the local network to predict functions of the uncharacterized genes. These new functional assignments will then be experimentally validated and, if necessary, further refined through additional rounds of network inference and experimental assessment. Finally, we will create a publicly accessible website to make the functional predictions and network models accessible to the pulmonary community.

Public Health Relevance

Numerous studies have probed the genetics of lung disease, analyzing variation in the genome across thousands of subjects. These studies have identified many variants within genes and other genomic regions that are strongly associated with the disease state, but often these are of unknown functional significance. Here, using COPD as a model, we propose to use systems biology methods to map these genes to pathways and to use these pathway-based associations to assign putative functions to these genes.

National Institute of Health (NIH)
Research Project (R01)
Project #
Application #
Study Section
Infectious Diseases, Reproductive Health, Asthma and Pulmonary Conditions Study Section (IRAP)
Program Officer
Punturieri, Antonello
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Dana-Farber Cancer Institute
United States
Zip Code
Quackenbush, John (2014) Perspective: Learning to share. Nature 509:S68
Wu, Gengze; Cai, Jin; Han, Yu et al. (2014) LincRNA-p21 regulates neointima formation, vascular smooth muscle cell proliferation, apoptosis, and atherosclerosis by enhancing p53 activity. Circulation 130:1452-65
Hatzis, Christos; Bedard, Philippe L; Birkbak, Nicolai J et al. (2014) Enhancing reproducibility in cancer drug screening: how do we move forward? Cancer Res 74:4016-23
Hobbs, Brian D; Hersh, Craig P (2014) Integrative genomics of chronic obstructive pulmonary disease. Biochem Biophys Res Commun 452:276-86
Chu, Jen-hwa; Hersh, Craig P; Castaldi, Peter J et al. (2014) Analyzing networks of phenotypes in complex diseases: methodology and applications in COPD. BMC Syst Biol 8:78
Glass, Kimberly; Huttenhower, Curtis; Quackenbush, John et al. (2013) Passing messages between biological networks to refine predicted interactions. PLoS One 8:e64832