Being able to correctly infer the perturbed pathways interactions that cause the disease from a list of differentially expressed (DE) genes or proteins may be the key to transforming the now abundant high- throughput expression data into biological knowledge. However, the current methods that aim to bridge this gap by using the DE genes to identify significantly impacted pathways are rather unsophisticated. Many if not all such methods often treat the pathways as simple sets of genes, and either ignore or under-utilize the very essence of such pathways: the graphs that describe the complex ways in which genes interact with each other. Our preliminary results show that the existing pathway analysis methods often provide incorrect results. In addition, the p-values they provide are inappropriately influenced by common pathway genes through a pathway coupling phenomenon. The goal of this proposal is to address the problems above by developing methods that implement a systems biology approach for the analysis of gene signaling pathways. Given a disease characterized using a high throughput gene expression approach, we propose an impact analysis technique able to: i) identify the significantly impacted pathways, and ii) propose specific gene signaling cascades that could potentially be targeted by drugs. This technique takes into consideration biologically important factors currently neglected by the existing pathway analysis tools including: i) the gene interactions as described by the pathway graph, ii) the gene type and position in the given pathways, and iii) the efficiency with which perturbations propagate from one gene to another across the pathway. Furthermore, we propose to study the pathway coupling and develop appropriate correction methods for the hypergeometric, GSEA and pathway impact analysis methods. This analysis will be applied to diabetes and obesity research. The novel approach developed here will be applied to microarray data from white fat of mice treated with low dose CL 316,243 (CL), which has been shown to have the potential to transform white fat into brown fat (which burns energy rather than store it). We will also apply this approach on data collected during the differentiation of 3T3-L1 pre-adipocytes after induction of adipogenesis. The goal here is three-fold: i) to validate the novel approach;ii) to assess the efficiency with which gene perturbations propagate on each KEGG pathway during adipogenesis and fat tissue remodeling, and construct a custom set of pathways relevant to obesity and diabetes;and iii) to identify pathways and signaling cascades that are important in adipogenesis and fat tissue remodeling. The methods developed will be made available as a Bioconductor package, as well as a free Java web application. Our team has excellent qualifications and track record in developing novel algorithms for the analysis of high-throughput data, multiple hypothesis testing, as well as obesity and diabetes.

Public Health Relevance

In molecular biology and genetics, our data gathering capabilities have greatly surpassed the available data analysis techniques. Even though high-throughput data is relatively easy to be obtained, understanding the underlying phenomena is as challenging as ever, if not more so. There is a large gap between our ability to collect data and our ability to interpret it. We are proposing an effective way to analyze the vast amount of data that has been and will continue to be collected. The proposed approach will reliable identify the most impacted gene signaling pathways in a given condition. This can greatly facilitate pinpointing the causes of the observed phenomena and therefore has the potential to have a great impact in many public health areas by facilitating the identification of putative molecular causes of disease, as well as the identification of potential therapeutic interventions and their potential side effects. The main focus of this proposal is on obesity and diabetes. Achieving of the goals described here can lead to new potential therapeutic interventions to help millions of people suffering from these conditions. However, due to the generality of the methods proposed, the benefits of the proposed research are expected to impact a larger number of research areas spanning from cancer, to development, to aging as well as any other life science area in which high-throughput methods (e.g. DNA microarrays, protein microarrays, metabolomics, etc.) are used.

Agency
National Institute of Health (NIH)
Institute
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
Type
Research Project (R01)
Project #
1R01DK089167-01
Application #
7949001
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Sechi, Salvatore
Project Start
2010-07-15
Project End
2014-06-30
Budget Start
2010-07-15
Budget End
2011-06-30
Support Year
1
Fiscal Year
2010
Total Cost
$334,013
Indirect Cost
Name
Wayne State University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
001962224
City
Detroit
State
MI
Country
United States
Zip Code
48202
Peyvandipour, Azam; Saberian, Nafiseh; Shafi, Adib et al. (2018) A novel computational approach for drug repurposing using systems biology. Bioinformatics 34:2817-2825
Nguyen, Tin; Mitrea, Cristina; Draghici, Sorin (2018) Network-Based Approaches for Pathway Level Analysis. Curr Protoc Bioinformatics 61:8.25.1-8.25.24
Shafi, Adib; Mitrea, Cristina; Nguyen, Tin et al. (2018) A survey of the approaches for identifying differential methylation using bisulfite sequencing data. Brief Bioinform 19:737-753
Talwar, Harvinder; Hanoudi, Samer Najeeb; Draghici, Sorin et al. (2018) Novel T7 Phage Display Library Detects Classifiers for Active Mycobacterium Tuberculosis Infection. Viruses 10:
Ansari, Sahar; Voichita, Calin; Donato, Michele et al. (2017) A novel pathway analysis approach based on the unexplained disregulation of genes. Proc IEEE Inst Electr Electron Eng 105:482-495
Nguyen, Tin; Mitrea, Cristina; Tagett, Rebecca et al. (2017) DANUBE: Data-driven meta-ANalysis using UnBiased Empirical distributions-applied to biological pathway analysis. Proc IEEE Inst Electr Electron Eng 105:496-515
Ahsan, Sidra; Dr?ghici, Sorin (2017) Identifying Significantly Impacted Pathways and Putative Mechanisms with iPathwayGuide. Curr Protoc Bioinformatics 57:7.15.1-7.15.30
Diaz, Diana; Donato, Michele; Nguyen, Tin et al. (2017) MICRORNA-AUGMENTED PATHWAYS (mirAP) AND THEIR APPLICATIONS TO PATHWAY ANALYSIS AND DISEASE SUBTYPING. Pac Symp Biocomput 22:390-401
Talwar, Harvinder; Hanoudi, Samer Najeeb; Geamanu, Andreea et al. (2017) Detection of Cystic Fibrosis Serological Biomarkers Using a T7 Phage Display Library. Sci Rep 7:17745
Nguyen, Tin; Tagett, Rebecca; Diaz, Diana et al. (2017) A novel approach for data integration and disease subtyping. Genome Res 27:2025-2039

Showing the most recent 10 out of 26 publications