BIGDATA: Small DCM: ESCA DA Computational infrastructure for massive neurosci

Mitra, Partha

Abstract

Ideally, as neuroscientists collect terabytes of image stacks, the data are automatically processed for open access and analysis. Yet, while several labs around the world are collecting data at unprecedented rates- up to terabytes per day-the computational technologies that facilitate streaming data-intensive computing remain absent. Also deploying data-intensive compute clusters is beyond the means and abilities of most experimental labs. This project will extend, develop, and deploy such technologies. To demonstrate these tools, we will utilize them in support of the ongoing mouse brain architecture (MBA) project, which already has amassed over 0.5 petabytes (PBs) of image data. The main computational challenges posed by these datasets are ones of scale. The tasks that follow remain relatively stereotyped across acquisition modalities. Until now, labs collecting data on this scale have been almost entirely isolated, left to """"""""reinvent the wheel"""""""" for each of these problems. Moreover, the extant solutions are insufficient for a number of reasons: they often include numerous excel spreadsheets that rely on manual data entry, they lack scalable scientific database backends, and they run on ad hoc clusters not specifically designed for the computational tasks at hand.
We aim to augment the current state of the art by implementing the following technological advancements into the MBA project pipeline: (1) Data Management will consist of a unified system that automatically captures metadata, launches processing pipelines, and provides quality control feedback in minutes instead of hours. (2) Data Processing tasks will run algorithms """"""""out-of-core"""""""", appropriate for their computational requirements, including registration, alignment, and semantic segmentation of cell bodies and processes. (3) Data Storage will automatically build databases for storing multimodal image data and extracted annotations learned from the machine vision algorithms. These databases will be spatially co-registered and stored on an optimized heterogeneous compute cluster. (4) Data Access will be automatically available to everyone-including all the image data and data derived products-via Web-services, including 3D viewing, downloading, and further processing. (5) Data Analytics will extend random graph models suitable for multiscale circuit graphs.

Public Health Relevance

Nervous system disorders are responsible for approximately 30% of the total burden of illness in the United States. Whole brain neuroanatomy-available from massive neuroscientific image stacks-is widely believed to be a key missing link in our ability to prevent and treat such illnesses. Thus, this project aims to close this gap via the development and application of BIGDATA tools for management, storage, access, and analytics.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute on Drug Abuse (NIDA)
Type: Research Project (R01)
Project #: 5R01DA036400-02
Application #: 8631080
Study Section: Special Emphasis Panel (ZRG1)
Program Officer: Pollock, Jonathan D

Project Start: 2013-03-15
Project End: 2016-01-31
Budget Start: 2014-02-01
Budget End: 2015-01-31
Support Year: 2
Fiscal Year: 2014
Total Cost
Indirect Cost

Institution

Name: Cold Spring Harbor Laboratory
Department
Type
DUNS #

City: Cold Spring Harbor
State: NY
Country: United States
Zip Code: 11724

Related projects


NIH 2015 R01 DA	BIGDATA: Small DCM: ESCA DA Computational infrastructure for massive neurosci Mitra, Partha Pratim / Cold Spring Harbor Laboratory	$248,295
NIH 2014 R01 DA	BIGDATA: Small DCM: ESCA DA Computational infrastructure for massive neurosci Mitra, Partha Pratim / Cold Spring Harbor Laboratory
NIH 2013 R01 DA	BIGDATA: Small DCM: ESCA DA Computational infrastructure for massive neurosci Mitra, Partha Pratim / Cold Spring Harbor Laboratory	$249,999

Publications

Majka, Piotr; Chaplin, Tristan A; Yu, Hsin-Hao et al. (2016) Towards a comprehensive atlas of cortical connections in a primate brain: Mapping tracer injection studies of the common marmoset into a reference digital template. J Comp Neurol 524:2161-81

Mitra, Partha P (2014) The circuit architecture of whole brains at the mesoscopic scale. Neuron 83:1273-83

Comments

Be the first to comment on Partha Mitra's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: