? ? The overall objective is to develop efficient algorithms and software tools for the analysis of genetic variation in human populations and its association with phenotypic variation. Correlating variations in DMA sequences with phenotypic differences has been one of the grand challenges in biomedical research. With the completion of the human genome project, substantial effort has been made to identify all common genetic variations such as single nucleotide polymorphisms (SNPs). While millions of SNPs have been identified, there is a great need for models and tools to characterize genetic variation in humans, and to facilitate the localization of genes underling complex diseases/traits. To meet the need, we propose to develop novel algorithmic approaches and software tools to address some of the fundamental issues in the analysis of SNPs and haplotypes with applications in gene association mapping. More specifically, we propose to devise efficient and robust algorithms to infer haplotypes from genotypes on a pedigree, impute missing SNPs, discover the haplotype structure, and select informative (tag) SNPs. We will also develop computational models that could utilize haplotypes in the identification of disease genes. The focus of this proposal is the development of novel combinatorial algorithms, datamining approaches, statistical techniques, as well as robust and user friendly tools. The emphasis is on the efficiency of algorithms because existing methods could not handle data sets at the whole genome level. The algorithms will be performed on the public databases (e.g. the HapMap project), as well as other human data generated in our collaborators' ongoing ? projects, including data sets concerning various complications of pregnancy, modifier genes of the cystic fibrosis disease and the genetic effects of late-onset Alzheimer disease. We anticipate that this project will result in a full spectrum of efficient and effective algorithms and software tools that will be useful to the broad biomedical research community and will greatly facilitate the study of human genetic variation and its association with complex diseases/traits. The proposed project fits well in two of the three themes that NIH has identified in its Roadmap initiatives: research in bioinformatics and computational biology under the theme of New Pathways to Discovery and interdisciplinary research under the theme of Research Teams of the Future. ? ?

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM008991-02
Application #
7209009
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2006-03-15
Project End
2009-02-28
Budget Start
2007-03-01
Budget End
2008-02-29
Support Year
2
Fiscal Year
2007
Total Cost
$393,862
Indirect Cost
Name
Case Western Reserve University
Department
Engineering (All Types)
Type
Schools of Engineering
DUNS #
077758407
City
Cleveland
State
OH
Country
United States
Zip Code
44106
Wang, Wenhui; Yang, Sen; Zhang, Xiang et al. (2014) Drug repositioning by integrating target information through a heterogeneous network model. Bioinformatics 30:2923-30
Wang, Wei-Bung; Jiang, Tao; Gardner, Shea (2013) Detection of homologous recombination events in bacterial genomes. PLoS One 8:e75230
Wang, Wenhui; Yin, Xiaolin; Soo Pyon, Yoon et al. (2013) Rare variant discovery and calling by sequencing pooled samples with overlaps. Bioinformatics 29:29-38
Wang, Wenhui; Yang, Sen; Li, Jing (2013) Drug target predictions based on heterogeneous graph inference. Pac Symp Biocomput :53-64
Hayes, Matthew; Li, Jing (2013) Bellerophon: a hybrid method for detecting interchromosomal rearrangements at base pair resolution using next-generation sequencing data. BMC Bioinformatics 14 Suppl 5:S6
Azad, Rajeev K; Li, Jing (2013) Interpreting genomic data via entropic dissection. Nucleic Acids Res 41:e23
Pirola, Yuri; Bonizzoni, Paola; Jiang, Tao (2012) An efficient algorithm for haplotype inference on pedigrees with recombinations and mutations. IEEE/ACM Trans Comput Biol Bioinform 9:12-25
Li, Xin; Li, Jing (2012) Haplotype inference. Methods Mol Biol 850:411-21
Xie, Minzhu; Li, Jing; Jiang, Tao (2012) Detecting genome-wide epistases based on the clustering of relatively frequent items. Bioinformatics 28:5-12
Hayes, Matthew; Pyon, Yoon Soo; Li, Jing (2012) A model-based clustering method for genomic structural variant prediction and genotyping using paired-end sequencing data. PLoS One 7:e52881

Showing the most recent 10 out of 54 publications