A major goal in cancer biology is to understand the protein interactions involved in the molecular pathways that lead to initiation and progression of the disease. Advances in sequencing technologies are generating an overwhelming amount of data towards that goal, however these data lack specific functional context for the molecules involved in cancer. This missing functional content could be provided by a bioinformatics approach that is based on domain analysis. Protein domains are evolutionary conserved regions of the proteins and are considered to be the functional and structural units of the protein. Functional and structural genomic studies have provided extensive information about protein domains and their functional roles. How- ever, current methods to analyze the molecular bases of cancer do not systematically integrate protein domain studies into their methodologies, and that are mostly based on the study of individual genes or proteins. Functional studies at the gene level are unreliable due to gene multi-functionality. Instead, studies at the domain level account for modularity of proteins and discriminate between protein regions with different functionality, which is more accurate and informative. Here, we propose the first genome-wide study of the molecular disruptions associated with cancer that relies on protein domains, rather than on proteins, as the molecular units of the study. Our hypothesis is that the disruptions within a domain that is shared by similar proteins will have related functional effects in cancer. Thus, relevant functional information can be transferred among proteins with similar functional characteristics. First, we will develop domain-based methodologies for the functional classification and of molecular disruptions related to cancer, as well as for the prediction of relevance to disease. Second, we will develop a novel approach to support pharmacogenomic studies of cancer drugs. Finally, we will integrate these domain-based analyses into a molecular database annotated with cancer phenotype information, which we will make publicly available. We expect this research to lead to new hypotheses for the biological mechanisms involved in the disease and to provide new molecular bases for the classification of cancer. This proposal will provide the foundational research for the development of new therapeutic approaches, new drug targets, and efficient gene therapies to fight cancer.

Public Health Relevance

A major goal in cancer biology is to understand how the interaction between different proteins produced by the body can cause the disease. New technology is offering a very detailed look into the DNA sequences involved with cancer, but these advances also provide an overwhelming amount of data that does not necessarily give a clear picture as to how specific molecules function in the progression of the disease. One way to clarify this picture is to analyze protein domains. A protein domain is a certain region of a protein that has remained structurally identical over the course of evolution. These domains are considered to be what actually determines a protein's structure and function. Currently, research on the molecular bases of cancer focuses on individual genes or proteins. However, study on the gene level is unreliable, because each gene has many different functions, which confuses the data. Additionally, by analyzing domains, we can focus on the particular regions of the protein with functions we are interested in, providing more accurate and informative results. Here, we propose the first human genome-wide study of the molecular bases of cancer that relies on protein domains as the core of the study. Our hypothesis is that, by observing how the structure of protein domains changes in cancerous cells, we can predict the effect of cancer on other proteins with similar protein domains. First, we will develop domain-based methods for classifying the molecular indicators of cancer and examine how these relate to the disease. Second, we will develop a novel approach to support studies of cancer drugs. Finally, we will use our domain-based analyses as a basis for a molecular database containing information on the physical characteristics of cancer, which we will make publicly available. We expect this research to lead to new hypotheses for the biological effects of the disease and proved new molecular bases for classifying cancer. This proposal will provide the foundational research for the development of new therapeutic approaches, drug targets, and efficient gene therapies to fight cancer.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Career Transition Award (K22)
Project #
5K22CA143148-02
Application #
7942985
Study Section
Subcommittee G - Education (NCI)
Program Officer
Wali, Anil
Project Start
2009-09-29
Project End
2012-08-31
Budget Start
2010-09-01
Budget End
2011-08-31
Support Year
2
Fiscal Year
2010
Total Cost
$163,140
Indirect Cost
Name
University of Maryland Balt CO Campus
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
061364808
City
Baltimore
State
MD
Country
United States
Zip Code
21250
Cirincione, Ann G; Clark, Kaylyn L; Kann, Maricel G (2018) Pathway networks generated from human disease phenome. BMC Med Genomics 11:75
Zhou, Hua; Kann, Maricel G; Mallory, Emily K et al. (2017) Recruitment of Tiam1 to Semaphorin 4D Activates Rac and Enhances Proliferation, Invasion, and Metastasis in Oral Squamous Cell Carcinoma. Neoplasia 19:65-74
Peterson, Thomas A; Mort, Matthew; Cooper, David N et al. (2016) Regulatory Single-Nucleotide Variant Predictor Increases Predictive Performance of Functional Regulatory Variants. Hum Mutat 37:1137-1143
Cao, Chunzhang; Zhao, Juanjuan; Doughty, Emily K et al. (2015) Mac-1 Regulates IL-13 Activity in Macrophages by Directly Interacting with IL-13R?1. J Biol Chem 290:21642-51
Burger, John D; Doughty, Emily; Khare, Ritu et al. (2014) Hybrid curation of gene-mutation relations combining automated extraction and crowdsourcing. Database (Oxford) 2014:
Peterson, Thomas A; Doughty, Emily; Kann, Maricel G (2013) Towards precision medicine: advances in computational approaches for the analysis of human variants. J Mol Biol 425:4047-63
Peterson, Thomas A; Park, DoHwan; Kann, Maricel G (2013) A protein domain-centric approach for the comparative analysis of human and yeast phenotypically relevant mutations. BMC Genomics 14 Suppl 3:S5
Batchelor, Eric; Kann, Maricel G; Przytycka, Teresa M et al. (2013) Modeling cell heterogeneity: from single-cell variations to mixed cells. Pac Symp Biocomput :445-50
Rance, Bastien; Doughty, Emily; Demner-Fushman, Dina et al. (2012) A mutation-centric approach to identifying pharmacogenomic relations in text. J Biomed Inform 45:835-41
Gonzalez, Mileidy W; Kann, Maricel G (2012) Chapter 4: Protein interactions and disease. PLoS Comput Biol 8:e1002819

Showing the most recent 10 out of 19 publications