Genome Geography A tool for connecting genes and other genomic features

Vaughan, Laura

Abstract

As we accumulate knowledge, our understanding of the genome continues to evolve. We now realize that the 99% of the genome that does not code for proteins, what was once thought of as 'junk DNA', has important functional roles. This understanding has important implications in the analysis and interpretation of high dimensional genomics analysis. It is rare that a study is lucky enough to find significantly associated variant that lie within an exon of a protein coding gene that is biologically related to the phenotype. It s more common to identify a region of interest that is intergenic, or even in a gene desert. The implication of these associations is not directly obvious and often requires extensive bioinformatics analysis to even begin to understand the possible underlying biological mechanisms. To effectively capture our greater understanding of the relationship between coding and non-coding variants with complex disease, we must be able to accurately connect those variants with their biological annotations. This application proposes to build on my current K01 research of mapping SNPs to protein coding genes to capture these other features by accomplishing the following specific aims:
Aim 1 - Capturing non-protein coding 'genes'. In this aim we will identify non-protein coding genes and define their boundaries.
Aim 2 - Map variants in gene associated regions to the corresponding genes. Phenotypes are not controlled by genic sequences alone. In this aim we will identify and map the non-genic portions of the chromosome which can influence the expression of coding genes.
Aim 3 - Expanding beyond physical boundaries. Expanding feature boundaries to account for LD regions will allow for researchers to capture genomic features that would not be identified otherwise. It is vitaly important that any bioinformatics workflow follows the principles of reproducible research, particularly when utilizing database driven resources.
These aims will be accomplished by exploiting various database repositories and presenting the compiled information in a user friendly interface.

Public Health Relevance

Currently the interpretation of whole genome analysis tends to focus on protein coding genes. As our knowledge expands, we are beginning to understand that the rest of the genome has important functional roles. This proposal will provide a tool that will enable researchers to easily access, organize, and use information located in numerous ever expanding databases.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
Type: Small Research Grants (R03)
Project #: 5R03DK096071-02
Application #: 8523850
Study Section: Diabetes, Endocrinology and Metabolic Diseases B Subcommittee (DDK)
Program Officer: Podskalny, Judith M,

Project Start: 2012-08-15
Project End: 2014-07-31
Budget Start: 2013-08-01
Budget End: 2014-07-31
Support Year: 2
Fiscal Year: 2013
Total Cost: $70,686
Indirect Cost: $22,436

Institution

Name: University of Alabama Birmingham
Department: Biostatistics & Other Math Sci
Type: Schools of Public Health
DUNS #: 063690705

City: Birmingham
State: AL
Country: United States
Zip Code: 35294

Related projects


NIH 2013 R03 DK	Genome Geography A tool for connecting genes and other genomic features Vaughan, Laura Kelly / University of Alabama Birmingham	$70,686
NIH 2012 R03 DK	Genome Geography A tool for connecting genes and other genomic features Vaughan, Laura Kelly / University of Alabama Birmingham	$73,250

Publications

Vaughan, Laura K; Srinivasasainagendra, Vinodh (2013) Where in the genome are we? A cautionary tale of database use in genomics research. Front Genet 4:38

Comments

Be the first to comment on Laura Vaughan's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: