A fundamental challenge in life sciences is the characterization of genetic factors that underlie phenotypic differences. Thanks to the advanced sequencing technologies, an enormous amount of genetic variants have been identified and cataloged. Such data hold great potential to understand how genes affect phenotypes and contribute to the susceptibility to environmental stimulus. However, the existing computational methods for analyzing and interpreting the high?throughput genetic data are still in their infancy. We propose to systematically investigate the computational and statistical principles in modeling and discovering genetic basis of complex phenotypes. The proposed research provides answers to the following fundamental questions in genetic association study: (1) How to effectively and efficiently assess statistical significance of the findings? (2) How to account for the relatedness between samples in genetic association study? (3) How to accurately capture possible interactions between multiple genetic factors and their joint contribution to phenotypic variation? In particular, we will develop data structures and efficient algorithms for accurate and robust significance assessment that account for local population structure and joint effect of multiple genetic factors. The proposed computational tools will be integrated into software packages under common application framework adopted by the broad scientific community.
A fundamental challenge in life sciences is the characterization of genetic factors that underlie phenotypic differences. Existing methods are not able to adequately address the complexity of high throughput data. Innovative computational models and methods developed in this project will enable scientists more effectively analyze the research data, thus further understanding of human diseases and speed the development diagnostic tools, cures, and therapies.
Ni, Jingchao; Cheng, Wei; Fan, Wei et al. (2018) ComClus: A Self-Grouping Framework for Multi-Network Clustering. IEEE Trans Knowl Data Eng 30:435-448 |
Goldfarb, Dennis; Lafferty, Michael J; Herring, Laura E et al. (2018) Approximating Isotope Distributions of Biomolecule Fragments. ACS Omega 3:11383-11391 |
Ju, Chelsea J-T; Zhao, Zhuangtian; Wang, Wei (2017) Efficient Approach to Correct Read Alignment for Pseudogene Abundance Estimates. IEEE/ACM Trans Comput Biol Bioinform 14:522-533 |
Yu, Wenchao; Aggarwal, Charu C; Wang, Wei (2017) Temporally Factorized Network Modeling for Evolutionary Network Analysis. Proc Int Conf Web Search Data Min 2017:455-464 |
Ni, Jingchao; Cheng, Wei; Fan, Wei et al. (2016) Self-Grouping Multi-Network Clustering. Proc IEEE Int Conf Data Min 2016:1119-1124 |
Cheng, Wei; Guo, Zhishan; Zhang, Xiang et al. (2016) CGC: A Flexible and Robust Approach to Integrating Co-Regularized Multi-Domain Graph for Clustering. ACM Trans Knowl Discov Data 10: |
Ni, Jingchao; Koyuturk, Mehmet; Tong, Hanghang et al. (2016) Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model. BMC Bioinformatics 17:453 |
Cheng, Wei; Shi, Yu; Zhang, Xiang et al. (2016) Sparse regression models for unraveling group and individual associations in eQTL mapping. BMC Bioinformatics 17:136 |
Cheng, Wei; Zhang, Kai; Chen, Haifeng et al. (2016) Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations. KDD 2016:805-814 |
Cheng, Wei; Shi, Yu; Zhang, Xiang et al. (2015) Fast and robust group-wise eQTL mapping using sparse graphical models. BMC Bioinformatics 16:2 |
Showing the most recent 10 out of 11 publications