Efficient Analysis of SNPs &Haplotypes with Applications in Gene Mapping

Li, Jing

Abstract

? ? The overall objective is to develop efficient algorithms and software tools for the analysis of genetic variation in human populations and its association with phenotypic variation. Correlating variations in DMA sequences with phenotypic differences has been one of the grand challenges in biomedical research. With the completion of the human genome project, substantial effort has been made to identify all common genetic variations such as single nucleotide polymorphisms (SNPs). While millions of SNPs have been identified, there is a great need for models and tools to characterize genetic variation in humans, and to facilitate the localization of genes underling complex diseases/traits. To meet the need, we propose to develop novel algorithmic approaches and software tools to address some of the fundamental issues in the analysis of SNPs and haplotypes with applications in gene association mapping. More specifically, we propose to devise efficient and robust algorithms to infer haplotypes from genotypes on a pedigree, impute missing SNPs, discover the haplotype structure, and select informative (tag) SNPs. We will also develop computational models that could utilize haplotypes in the identification of disease genes. The focus of this proposal is the development of novel combinatorial algorithms, datamining approaches, statistical techniques, as well as robust and user friendly tools. The emphasis is on the efficiency of algorithms because existing methods could not handle data sets at the whole genome level. The algorithms will be performed on the public databases (e.g. the HapMap project), as well as other human data generated in our collaborators' ongoing ? projects, including data sets concerning various complications of pregnancy, modifier genes of the cystic fibrosis disease and the genetic effects of late-onset Alzheimer disease. We anticipate that this project will result in a full spectrum of efficient and effective algorithms and software tools that will be useful to the broad biomedical research community and will greatly facilitate the study of human genetic variation and its association with complex diseases/traits. The proposed project fits well in two of the three themes that NIH has identified in its Roadmap initiatives: research in bioinformatics and computational biology under the theme of New Pathways to Discovery and interdisciplinary research under the theme of Research Teams of the Future. ? ?

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Library of Medicine (NLM)
Type: Research Project (R01)
Project #: 5R01LM008991-02
Application #: 7209009
Study Section: Biomedical Library and Informatics Review Committee (BLR)
Program Officer: Ye, Jane

Project Start: 2006-03-15
Project End: 2009-02-28
Budget Start: 2007-03-01
Budget End: 2008-02-29
Support Year: 2
Fiscal Year: 2007
Total Cost: $393,862
Indirect Cost

Institution

Name: Case Western Reserve University
Department: Engineering (All Types)
Type: Schools of Engineering
DUNS #: 077758407

City: Cleveland
State: OH
Country: United States
Zip Code: 44106

Related projects


NIH 2010 R01 LM	Multi-point and multi-locus analysis of genomic association data Li, Jing / Case Western Reserve University	$930,276
NIH 2009 R01 LM	Multi-point and multi-locus analysis of genomic association data Li, Jing / Case Western Reserve University	$951,009
NIH 2008 R01 LM	Efficient Analysis of SNPs &Haplotypes with Applications in Gene Mapping Li, Jing / Case Western Reserve University	$409,445
NIH 2007 R01 LM	Efficient Analysis of SNPs &Haplotypes with Applications in Gene Mapping Li, Jing / Case Western Reserve University	$393,862
NIH 2006 R01 LM	Efficient Analysis of SNPs &Haplotypes with Applications in Gene Mapping Li, Jing / Case Western Reserve University	$422,754

Publications

Wang, Wenhui; Yang, Sen; Zhang, Xiang et al. (2014) Drug repositioning by integrating target information through a heterogeneous network model. Bioinformatics 30:2923-30

Wang, Wei-Bung; Jiang, Tao; Gardner, Shea (2013) Detection of homologous recombination events in bacterial genomes. PLoS One 8:e75230

Wang, Wenhui; Yin, Xiaolin; Soo Pyon, Yoon et al. (2013) Rare variant discovery and calling by sequencing pooled samples with overlaps. Bioinformatics 29:29-38

Wang, Wenhui; Yang, Sen; Li, Jing (2013) Drug target predictions based on heterogeneous graph inference. Pac Symp Biocomput :53-64

Hayes, Matthew; Li, Jing (2013) Bellerophon: a hybrid method for detecting interchromosomal rearrangements at base pair resolution using next-generation sequencing data. BMC Bioinformatics 14 Suppl 5:S6

Azad, Rajeev K; Li, Jing (2013) Interpreting genomic data via entropic dissection. Nucleic Acids Res 41:e23

Pirola, Yuri; Bonizzoni, Paola; Jiang, Tao (2012) An efficient algorithm for haplotype inference on pedigrees with recombinations and mutations. IEEE/ACM Trans Comput Biol Bioinform 9:12-25

Li, Xin; Li, Jing (2012) Haplotype inference. Methods Mol Biol 850:411-21

Xie, Minzhu; Li, Jing; Jiang, Tao (2012) Detecting genome-wide epistases based on the clustering of relatively frequent items. Bioinformatics 28:5-12

Hayes, Matthew; Pyon, Yoon Soo; Li, Jing (2012) A model-based clustering method for genomic structural variant prediction and genotyping using paired-end sequencing data. PLoS One 7:e52881

Showing the most recent 10 out of 54 publications

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: