Causal Discovery Algorithms for Translational Research with High-Throughput DataThe long-term goal of this project is to provide to the biomedical community next-generation causalalgorithms to facilitate discovery of disease molecular pathways and causative as well as predictivebiomarkers and molecular signatures from high-throughput data. Such knowledge and methods arenecessary toward earlier and more accurate diagnosis and prognosis, personalized medicine, andrational drug design.If successful, the proposed research will have significant and wide methodological and practicalimplications spanning several areas of biomedicine with a primary focus and immediate benefits inhigh-throughput diagnostics and personalized medicine. It will provide significantly improvedcomputational methods and deeper theoretical understanding related to producing molecularsignatures and understanding mechanisms of disease and concomitant leads for new drugs. It willprovide evidence about applicability of novel causal methods in other types of data. It will generateinsights in specific pathways of lung cancer in humans. It will deepen our understanding and solutionsto the Rashomon effect in omics data. The proposed research will also shed light on the operationalvalue of the stability heuristic. Finally the research will engage the international research community toaddress open computational causal discovery problems relevant to high-throughput and otherbiomedical data.
Aim 1. Evaluate and characterize several novel causal algorithms for biomarkerselection, molecular signature creation and reverse network engineering using real, simulated,resimulated, and experimental datasets. Study generality of the methods by means ofapplicability to non- omics datasets.
Aim 2. Evaluate and characterize, novel and state of the art causal algorithms againststate-of-the-art non-causal and quasi-causal algorithms.
Aim 3. Systematically investigate the Rashomon effect as it applies to biomarker andsignature multiplicity.
Aim 4. Systematically investigate the utility of applying the stability heuristic forcausal discovery.
Aim 5. Derive novel biomarkers, pathways and hypotheses for lung cancer.
Aim 6. Induce novel solutions through an international causal discovery competition.
Aim 7. Disseminate findings.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
High Priority, Short Term Project Award (R56)
Project #
7R56LM007948-05
Application #
7869031
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2003-08-01
Project End
2011-11-30
Budget Start
2009-12-01
Budget End
2011-11-30
Support Year
5
Fiscal Year
2008
Total Cost
$344,512
Indirect Cost
Name
New York University
Department
Pathology
Type
Schools of Medicine
DUNS #
121911077
City
New York
State
NY
Country
United States
Zip Code
10016
Statnikov, Alexander; Lytkin, Nikita I; Lemeire, Jan et al. (2013) Algorithms for Discovery of Multiple Markov Boundaries. J Mach Learn Res 14:499-566
Feig, Jonathan E; Vengrenyuk, Yuliya; Reiser, Vladimir et al. (2012) Regression of atherosclerosis is characterized by broad changes in the plaque macrophage transcriptome. PLoS One 7:e39790
Narendra, Varun; Lytkin, Nikita I; Aliferis, Constantin F et al. (2011) A comprehensive assessment of methods for de-novo reverse-engineering of genome-scale regulatory networks. Genomics 97:7-18
Alekseyenko, Alexander V; Lytkin, Nikita I; Ai, Jizhou et al. (2011) Causal graph-based analysis of genome-wide association data in rheumatoid arthritis. Biol Direct 6:25
Fu, Lawrence D; Aphinyanaphongs, Yindalon; Wang, Lily et al. (2011) A comparison of evaluation metrics for biomedical journals, articles, and websites in terms of sensitivity to topic. J Biomed Inform 44:587-94
Lytkin, Nikita I; McVoy, Lauren; Weitkamp, Jörn-Hendrik et al. (2011) Expanding the understanding of biases in development of clinical-grade molecular signatures: a case study in acute respiratory viral infections. PLoS One 6:e20662
Espinosa, Lluis; Cathelin, Severine; D'Altri, Teresa et al. (2010) The Notch/Hes1 pathway sustains NF-?B activation through CYLD repression in T cell leukemia. Cancer Cell 18:268-81
Statnikov, Alexander; McVoy, Lauren; Lytkin, Nikita et al. (2010) Improving development of the molecular signature for diagnosis of acute respiratory viral infections. Cell Host Microbe 7:100-1; author reply 102
Statnikov, Alexander; Aliferis, Constantin F (2010) Analysis and computational dissection of molecular signature multiplicity. PLoS Comput Biol 6:e1000790
Aliferis, Constantin F; Statnikov, Alexander; Tsamardinos, Ioannis et al. (2009) Factors influencing the statistical power of complex data analysis protocols for molecular signature development from microarray data. PLoS One 4:e4922

Showing the most recent 10 out of 11 publications