This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. The subproject and investigator (PI) may have received primary funding from another NIH source, and thus could be represented in other CRISP entries. The institution listed is for the Center, which is not necessarily the institution for the investigator. Array comparative genomic hybridization (aCGH) allows identification of copy number alterations across genomes. The key computational challenge in analyzing copy number variations (CNVs) using aCGH data or other similar data generated by a variety of array technologies is the detection of segment boundaries of copy number changes and inference of the copy number state for each segment. In this subproject, we have developed a novel statistical model based on the framework of conditional random fields (CRFs) that can effectively combine data smoothing, segmentation and copy number state decoding into one unified framework. Our approach (termed CRF-CNV) provides great flexibilities in defining meaningful feature functions. Therefore, it can effectively integrate local spatial information of arbitrary sizes into the model. For model parameter estimations, we have adopted the conjugate gradient (CG) method for likelihood optimization and developed efficient forward/backward algorithms within the CG framework. The method is evaluated using real data with known copy numbers as well as simulated data with realistic assumptions, and compared with two popular publicly available programs. Experimental results have demonstrated that CRF-CNV outperforms a Bayesian Hidden Markov Model-based approach on both datasets in terms of copy number assignments. Comparing to a non-parametric approach, CRF-CNV has achieved much greater precision while maintaining the same level of recall on the real data, and their performance on the simulated data is comparable.

Agency
National Institute of Health (NIH)
Institute
National Center for Research Resources (NCRR)
Type
Biotechnology Resource Grants (P41)
Project #
5P41RR003655-25
Application #
8171739
Study Section
Special Emphasis Panel (ZRG1-GGG-J (40))
Project Start
2010-08-01
Project End
2011-07-31
Budget Start
2010-08-01
Budget End
2011-07-31
Support Year
25
Fiscal Year
2010
Total Cost
$9,884
Indirect Cost
Name
Case Western Reserve University
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
077758407
City
Cleveland
State
OH
Country
United States
Zip Code
44106
Elston, Robert C; Satagopan, Jaya; Sun, Shuying (2017) Statistical Genetic Terminology. Methods Mol Biol 1666:1-9
Thota, Prashanthi N; Zackria, Shamiq; Sanaka, Madhusudhan R et al. (2017) Racial Disparity in the Sex Distribution, the Prevalence, and the Incidence of Dysplasia in Barrett's Esophagus. J Clin Gastroenterol 51:402-406
Liang, Jingjing; Cade, Brian E; Wang, Heming et al. (2016) Comparison of Heritability Estimation and Linkage Analysis for Multiple Traits Using Principal Component Analyses. Genet Epidemiol 40:222-32
Wang, Chuchu; Wu, Manman; Qian, Jin et al. (2016) Identification of rare variants in TNNI3 with atrial fibrillation in a Chinese GeneID population. Mol Genet Genomics 291:79-92
Lemas, Dominick J; Klimentidis, Yann C; Aslibekyan, Stella et al. (2016) Polymorphisms in stearoyl coa desaturase and sterol regulatory element binding protein interact with N-3 polyunsaturated fatty acid intake to modify associations with anthropometric variables and metabolic phenotypes in Yup'ik people. Mol Nutr Food Res 60:2642-2653
Day, Kenneth; Waite, Lindsay L; Alonso, Arnald et al. (2016) Heritable DNA Methylation in CD4+ Cells among Complex Families Displays Genetic and Non-Genetic Effects. PLoS One 11:e0165488
Justice, Cristina M; Bishop, Kevin; Carrington, Blake et al. (2016) Evaluation of IRX Genes and Conserved Noncoding Elements in a Region on 5p13.3 Linked to Families with Familial Idiopathic Scoliosis and Kyphosis. G3 (Bethesda) 6:1707-12
Petrovic, Dusan; Pivin, Edward; Ponte, Belen et al. (2016) Sociodemographic, behavioral and genetic determinants of allostatic load in a Swiss population-based study. Psychoneuroendocrinology 67:76-85
Castiblanco, John; Sarmiento-Monroy, Juan Camilo; Mantilla, Ruben Dario et al. (2015) Familial Aggregation and Segregation Analysis in Families Presenting Autoimmunity, Polyautoimmunity, and Multiple Autoimmune Syndrome. J Immunol Res 2015:572353
Shetty, Priya B; Tang, Hua; Feng, Tao et al. (2015) Variants for HDL-C, LDL-C, and triglycerides identified from admixture mapping and fine-mapping analysis in African American families. Circ Cardiovasc Genet 8:106-13

Showing the most recent 10 out of 922 publications