This project involves the development of new statistical methodologies and computational tools for network-based integrative analysis of epigenetic risk factors of cardiovascular diseases (CVD). While the advent of omics data from new technologies has facilitated the study of epigenetic factors, existing methodologies often do not account for complexities of biological data such as correlations due to interactions of genes/proteins as part of biological pathways and fail to e?ciently integrate diverse omics data sets for instance genetic variation, DNA methylation and gene expression. The methodologies proposed in this project, and the software tools that will be developed to implement them, address these shortcomings, and facilitate further research by the biomedical community to gain a better understanding of the underlying biology of CVD, and to develop new diagnostic biomarkers and potential targets for therapies. The proposed methodologies are motivated by the study of epigenetic data from the Multi-Ethnic Study of Atherosclerosis (MESA), and include (i) a network-based pathway enrichment analysis method that incorporates available knowledge of interactions among genes and proteins while complementing and re?ning such information (Aim 1A), as well as its extension for analysis of multiple types of omics data (Aim 1B), and (ii) an integrative analysis framework to identify associations among gene expression levels and DNA methylation (Aim 2A) and identify common epigenetic factors of multiple CVD phenotypes through integrated analysis of DNA methylation and mRNA expression data (Aim 2B). We will develop e?cient and user-friendly software tools for the proposed methods (Aim 3), which will be made freely available to the public after extensive tests using both simulated data, as well as real data from MESA.
The proposed research addresses the need for development of new statistical machine learning methods for analysis of diverse epigenetic data from multiple cardiovascular disease outcomes. The proposed training activities are designed to enhance the applicant's knowledge of epigenetics and physiology of cardiovascular diseases (CVD), and to further his career as an independent investigator in the area of computational epigenetics for CVD.
Showing the most recent 10 out of 14 publications