In this collaborative R01, """"""""Networks from multidimensional data for schizophrenia and related disorders"""""""" submitted in response to RFA-MH-12-020, we propose to develop methods for integrating a broad range of genomic, imaging, and clinical data, hosting all data, methods, and results on a novel, flexible and extensible computing platform. Subsequently, these data and methods will be used to establish workflows available to the research community to integrate and mine the data for discovery. As proof-of-concept, multiple datasets for schizophrenia (SCZ) will be used and then extended to additional mental disorders. Specifically, in AIM 1 we will adapt the Synapse platform at Sage Bionetworks to host, QC, normalize, and transform data in an analysis ready format. Synapse will also enable computation, storage, sharing, and integration of SCZ specific data with pre-existing public data. The Sage platform will be hosted by the data center in the Institute of Genomics and Multiscale Biology at the Mount Sinai School of Medicine consisting of a data warehouse (organized file systems and databases), a web service tier and applications tier adapted to facilitate network reconstruction and more generally model building with SCZ data.
In AIM 2, we will develop a pipeline of analytic methods that include new and existing tools for the primary processing of multiple types of data. Using direct experimental findings we will generate primary analysis datasets (e.g., expression QTLs, imaging QTLs, GWAS with SNP/CNV genotypes, RNASeq signatures, and DNA methylation and RNAseq associations), construct interaction networks with population-based expression and imaging datasets (e.g. gene expression, functional MRI and structural MRI), transform all data and results into analysis ready formats, and construct a standard set of queries to facilitate SCZ gene discovery.
In AIM 3 following platform development, generation of primary analysis datasets, and basic network constructions, we will develop and apply methods to construct integrated, higher-order molecular networks and more generalized models to enhance our understanding of the genetic loci and gene networks underlying schizophrenia. Using a Bayesian framework, methods will be developed that identify network modules and the underlying genetic variance component (including epistatic interactions), incorporate prior disease information and extensive prior biological knowledge to construct more detailed probabilistic causal models, and identify causal regulators of networks associated with SCZ.
In AIM 4, we will assess the extent to which the models validate in independent SCZ data and in bipolar disorder and autism. This proposal should have a major impact on the field as it proposes to create a solution, in the form of new platforms and analytic methods, for the bottleneck in gene discovery that results from our limited ability to fully analyze the data currently available on large samples of individuals suffering fro mental illness. This proposal will make possible the efficient use of this wealth of multi-dimensional data.

Public Health Relevance

In the United States, over a million people have schizophrenia. The costs are staggering in human and financial terms. We propose to develop methods for integrating a broad range of genomic data into a novel, flexible and extensible computing platform. Subsequently, these data will be used to develop a pipeline of algorithms for integrating and mining the data. We will use as a proof-of-concept multiple datasets for schizophrenia, and then extend this to additional mental disorders.

Agency
National Institute of Health (NIH)
Institute
National Institute of Mental Health (NIMH)
Type
Research Project (R01)
Project #
5R01MH097276-02
Application #
8501690
Study Section
Special Emphasis Panel (ZMH1-ERB-C (02))
Program Officer
Senthil, Geetha
Project Start
2012-07-01
Project End
2015-06-30
Budget Start
2013-07-01
Budget End
2014-06-30
Support Year
2
Fiscal Year
2013
Total Cost
$750,279
Indirect Cost
$209,749
Name
Icahn School of Medicine at Mount Sinai
Department
Psychiatry
Type
Schools of Medicine
DUNS #
078861598
City
New York
State
NY
Country
United States
Zip Code
10029
Agrawal, A; Chou, Y-L; Carey, C E et al. (2017) Genome-wide association study identifies a novel locus for cannabis dependence. Mol Psychiatry :
Carcamo-Orive, Ivan; Hoffman, Gabriel E; Cundiff, Paige et al. (2017) Analysis of Transcriptional Variability in a Large Human iPSC Library Reveals Genetic and Non-genetic Determinants of Heterogeneity. Cell Stem Cell 20:518-532.e9
Mancuso, Nicholas; Shi, Huwenbo; Goddard, Pagé et al. (2017) Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits. Am J Hum Genet 100:473-487
Zhu, Lingxue; Lei, Jing; Devlin, Bernie et al. (2017) TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES. Ann Appl Stat 11:1810-1831
Jasinska, Anna J; Zelaya, Ivette; Service, Susan K et al. (2017) Genetic variation and gene expression across multiple tissues and developmental stages in a nonhuman primate. Nat Genet 49:1714-1721
Li, Ming; Jaffe, Andrew E; Straub, Richard E et al. (2016) A human-specific AS3MT isoform and BORCS7 are molecular risk factors in the 10q24.32 schizophrenia-associated locus. Nat Med 22:649-56
Topol, Aaron; Zhu, Shijia; Hartley, Brigham J et al. (2016) Dysregulation of miRNA-9 in a Subset of Schizophrenia Patient-Derived Neural Progenitor Cells. Cell Rep 15:1024-1036
Sanderson, Saskia C; Linderman, Michael D; Suckiel, Sabrina A et al. (2016) Motivations, concerns and preferences of personal genome sequencing research participants: Baseline findings from the HealthSeq project. Eur J Hum Genet 24:14-20
Fromer, Menachem; Roussos, Panos; Sieberts, Solveig K et al. (2016) Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat Neurosci 19:1442-1453
Chang, Rui; Karr, Jonathan R; Schadt, Eric E (2015) Causal inference in biology networks with integrated belief propagation. Pac Symp Biocomput :359-70

Showing the most recent 10 out of 16 publications