COINSTAC: Decentralized, Scalable Analysis of Loosely Coupled Data

Calhoun, Vince; Hutchison, Kent

Abstract

The brain imaging community is greatly benefiting from extensive data sharing efforts currently underway. However, there is a significant gap in existing strategies which focus on anonymized, post-hoc sharing of either 1) full raw or preprocessed data [in the case of open studies] or 2) manually computed summary measures [such as hippocampal volume, in the case of closed (or not yet shared) studies] which we propose to address. Current approaches to data sharing often include significant logistical hurdles both for the investigator sharing the dat as well as for the individual requesting the data (e.g. often times multiple data sharing agreements and approvals are required from US and international institutions). This needs to change, so that the scientific community becomes a venue where data can be collected, managed, widely shared and analyzed while also opening up access to the (many) data sets which are not currently available (see recent overview on this from our group). The large amount of existing data requires an approach that can analyze data in a distributed way while also leaving control of the source data with the individual investigator; this motivates dynamic, decentralized way of approaching large scale analyses. We are proposing a peer-to-peer system called the Collaborative Informatics and Neuroimaging Suite Toolkit for Anonymous Computation (COINSTAC). The system will provide an independent, open, no-strings-attached tool that performs analysis on datasets distributed across different locations. Thus, the step of actually aggregating data can be avoided, while the strength of large-scale analyses can be retained. To achieve this, in Aim 1, the uniform data interfaces that we propose will make it easy to share and cooperate. Robust and novel quality assurance and replicability tools will also be incorporated. Collaboration and data sharing will be done through forming temporary (need and project-based) virtual clusters of studies performing automatically generated local computation on their respective data and aggregating statistics in global inference procedures. The communal organization will provide a continuous stream of large scale projects that can be formed and completed without the need of creating new rigid organizations or project-oriented storage vaults.
In Aim 2, we develop, evaluate, and incorporate privacy-preserving algorithms to ensure that the data used are not re-identifiable even with multiple re-uses. We also will develop advanced distributed and privacy preserving approaches for several key multivariate families of algorithms (general linear model, matrix factorization [e.g. independent component analysis], classification) to estimate intrinsic networks and perform data fusion. Finally, in Aim 3, we will demonstrate the utility of this approach in a proof of concept study through distributed analyses of substance abuse datasets across national and international venues with multiple imaging modalities.

Public Health Relevance

Hundreds of millions of dollars have been spent to collect human neuroimaging data for clinical and research purposes, many of which don't have data sharing agreements or collect sensitive data which are not easily shared, such as genetics. Opportunities for large scale aggregated analyses to infer health-relevant facts create new challenges in protecting the privacy of individuals' data. Open sharing of raw data, though desirable from the research perspective, and growing rapidly, is not a good solution for a large number of datasets which have additional privacy risks or IRB concerns. The COINSTAC solution we are proposing will capture this 'missing data' and allow for pooling of both open and 'closed' repositories by developing privacy preserving versions of widely-used algorithms and incorporating within an easy-to-use platform which enables distributed computation. In addition, COINSTAC will accelerate research on both open and closed data by offering a distributed computational solution for a large toolkit of widely used algorithms.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Drug Abuse (NIDA)
Type: Research Project (R01)
Project #: 7R01DA040487-05
Application #: 9938885
Study Section: Neuroscience and Ophthalmic Imaging Technologies Study Section (NOIT)
Program Officer: Pariyadath, Vani

Project Start: 2019-06-01
Project End: 2021-04-30
Budget Start: 2019-06-01
Budget End: 2021-04-30
Support Year: 5
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: Georgia State University
Department: Miscellaneous
Type: Organized Research Units
DUNS #: 837322494

City: Atlanta
State: GA
Country: United States
Zip Code: 30302

Related projects


NIH 2019 R01 DA	COINSTAC: Decentralized, Scalable Analysis of Loosely Coupled Data Calhoun, Vince D.; Hutchison, Kent E. / Georgia State University
NIH 2018 R01 DA	COINSTAC: decentralized, scalable analysis of loosely coupled data Calhoun, Vince D.; Hutchison, Kent E. / The Mind Research Network
NIH 2018 R01 DA	COINSTAC: decentralized, scalable analysis of loosely coupled data Calhoun, Vince D.; Hutchison, Kent E. / The Mind Research Network
NIH 2017 R01 DA	COINSTAC: decentralized, scalable analysis of loosely coupled data Calhoun, Vince D.; Hutchison, Kent E. / The Mind Research Network
NIH 2016 R01 DA	COINSTAC: decentralized, scalable analysis of loosely coupled data Calhoun, Vince D.; Hutchison, Kent E. / The Mind Research Network
NIH 2015 R01 DA	COINSTAC: decentralized, scalable analysis of loosely coupled data Calhoun, Vince D.; Hutchison, Kent E. / The Mind Research Network

Publications

Yaesoubi, Maziar; Adal?, Tülay; Calhoun, Vince D (2018) A window-less approach for capturing time-varying connectivity in fMRI data reveals the presence of states with variable rates of change. Hum Brain Mapp 39:1626-1636

Faghiri, Ashkan; Stephen, Julia M; Wang, Yu-Ping et al. (2018) Changing brain connectivity dynamics: From early childhood to adulthood. Hum Brain Mapp 39:1108-1117

Vergara, Victor M; Weiland, Barbara J; Hutchison, Kent E et al. (2018) The Impact of Combinations of Alcohol, Nicotine, and Cannabis on Dynamic Brain Connectivity. Neuropsychopharmacology 43:877-890

Miller, Robyn L; Abrol, Anees; Adali, Tulay et al. (2018) Resting-State fMRI Dynamics and Null Models: Perspectives, Sampling Variability, and Simulations. Front Neurosci 12:551

Wilcox, Claire E; Claus, Eric D; Calhoun, Vince D et al. (2018) Default mode network deactivation to smoking cue relative to food cue predicts treatment outcome in nicotine use disorder. Addict Biol 23:412-424

Yu, Qingbao; Du, Yuhui; Chen, Jiayu et al. (2018) Application of Graph Theory to Assess Static and Dynamic Brain Connectivity: Approaches for Building Brain Graphs. Proc IEEE Inst Electr Electron Eng 106:886-906

Steele, Vaughn R; Maurer, J Michael; Arbabshirani, Mohammad R et al. (2018) Machine Learning of Functional Magnetic Resonance Imaging Network Connectivity Predicts Substance Abuse Treatment Completion. Biol Psychiatry Cogn Neurosci Neuroimaging 3:141-149

Ming, Jing; Verner, Eric; Sarwate, Anand et al. (2017) COINSTAC: Decentralizing the future of brain imaging analysis. F1000Res 6:1512

Vergara, Victor M; Liu, Jingyu; Claus, Eric D et al. (2017) Alterations of resting state functional network connectivity in the brain of nicotine and alcohol users. Neuroimage 151:45-54

Abrol, Anees; Damaraju, Eswar; Miller, Robyn L et al. (2017) Replicability of time-varying connectivity patterns in large resting state fMRI samples. Neuroimage 163:160-176

Showing the most recent 10 out of 28 publications

Comments

Be the first to comment on Vince Calhoun's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: