Characterization of genetic variation and dissection of complex genetic architectures of complex traits or diseases have been a great challenge. Family-based gene-mapping approaches have shown tremendous success for rare monogenic diseases, but are underpowered to detect genes underling complex diseases. Furthermore, with advance in high throughput technologies, many previous methods cannot adequately handle new data because of lack of efficiency or violation of assumptions. In this project, we seek to develop efficient algorithms to tackle the inheritance inference problem in large pedigrees. The algorithms relies on a framework developed previously by our group that can effectively handle pair-wise identical by decent (IBD) inference from large pedigrees with many untyped members. We will also apply the new algorithms on a real dataset consisting of 20 large complex pedigrees and perform analysis on recombination breakpoint identification, haplotype inference, family-based linkage and association studies based on haplotype segments separated by recombination breakpoints, joint analysis of SNP data and sequence data in families. All our developments will focus on practically important issues, including large pedigrees with many untyped members, genotyping errors, scalability issues on high-throughput datasets. Software tools will be developed and will be made available to the public.

Public Health Relevance

Title: Inheritance inference in large pedigrees using the spectral clustering algorithm and applications Project Narrative: In this project, we propose to develop efficient algorithms for the analysis of large scale pedigree data. The algorithms can be utilized by many applications, including recombination breakpoints identification, haplotype inference, family-based linkage and association studies based on haplotype segments separated by recombination breakpoints, joint analysis of SNP data and sequence data. We will also apply our approaches on a real dataset consisting of 20 large scale complex pedigrees.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Small Research Grants (R03)
Project #
1R03HG008632-01A1
Application #
9112362
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Brooks, Lisa
Project Start
2016-06-14
Project End
2018-04-30
Budget Start
2016-06-14
Budget End
2017-04-30
Support Year
1
Fiscal Year
2016
Total Cost
$71,579
Indirect Cost
$21,579
Name
Case Western Reserve University
Department
Engineering (All Types)
Type
Schools of Engineering
DUNS #
077758407
City
Cleveland
State
OH
Country
United States
Zip Code
44106
Hu, Ke; Li, Jing (2018) Detection and analysis of CpG sites with multimodal DNA methylation level distributions and their relationships with SNPs. BMC Proc 12:36
Song, Sunah; Li, Xin; Li, Jing (2017) Haplotype Inference. Methods Mol Biol 1666:469-484