(Data Management and Analysis Core: Aikseng Ooi and Nirav Merchant) The University of Arizona Superfund Research Program (UA SRP) will generate volumes and types of data that are not manageable in traditional laboratory settings. The Data Management and Analysis Core (DMAC) will function as the primary service for UA SRP into large biological, geophysical, and chemical datasets, including but not limited to RNA sequencing, chromatin immunoprecipitation sequencing, exome sequencing, metabolomics, metagenomics, microbiome amplicon sequencing, geospatial positioning, analytical chemistry, and imaging. DMAC enables investigators by performing three core functions: (1) DMAC will lead the housing of all data in an easy-to-access data repository system: CyVerse. Cyverse is a computational infrastructure consisting of hardware, software, and personnel that are designed to handle huge datasets and complex analyses, and is maintained at the University of Arizona. DMAC will utilize a reference implementation (RI) that divides data into five different levels for easy data sharing, processing, and analyzing. Lowest levels (level 1) will be raw data, while higher levels (level 5) will be file formats utilizable in graphics visualizations. DMAC will support these processes with help from on-staff statisticians and bioinformaticians who can devise analysis strategies for individual investigators. In addition to data storage, DMAC will orchestrate sample management using Fulcrum software. Fulcrum allows barcoding, global positioning, and annotation of biological samples in an easy-to-use application available on both traditional workstations and mobile platforms. Fulcrum is critical for point-of-generation sample tracking due to its mobility. (2) Beyond data and sample management, DMAC will perform both standard and custom computational analyses of the data. This will include DMAC-lead investigations into ?feature signatures?, which address the predictability of data across UA SRP projects; for example, can the gene expression changes associated with a particular arsenic treatment predict metagenomics changes in a similarly treated sample? In conjunction with UA SRP investigators, DMAC will apply traditional algorithms, or develop novel algorithms as needed, to identify signatures for the different data types collected. (3) The storage and analytical capabilities of DMAC will be integrated into a user-friendly web application that allows individual investigators to retrieve, manipulate, and visualize UA SRP data. The web application will be implemented using an in-house maintained server in conjunction with the R statistical environment. DMAC is thus an integral component of the UA SRP proposal that utilizes state-of-the-art technologies to enable the discovery of novel insights into arsenic exposure and its role in health and disease.

Public Health Relevance

(Data Management and Analysis Core: Aikseng Ooi and Nirav Merchant) Understanding the roles of arsenic in complex diseases like diabetes requires an integrated approach that combines vast quantities of information from multiple fields, including biomedical science, geology, and environmental science. The Data Management and Analysis Core serves as the primary storage and analytical service for the data generated from various experiments investigating arsenic toxicity. DMAC builds upon the CyVerse infrastructure and utilizes state-of-the-art hardware, software, and analytics to merge the findings of various scientific disciplines into new multifaceted insights.

Agency
National Institute of Health (NIH)
Institute
National Institute of Environmental Health Sciences (NIEHS)
Type
Hazardous Substances Basic Research Grants Program (NIEHS) (P42)
Project #
2P42ES004940-31
Application #
9841036
Study Section
Special Emphasis Panel (ZES1)
Project Start
Project End
2025-01-31
Budget Start
2020-04-01
Budget End
2021-03-31
Support Year
31
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of Arizona
Department
Type
DUNS #
806345617
City
Tucson
State
AZ
Country
United States
Zip Code
85721
Khan, Muhammad Amjad; Ding, Xiaodong; Khan, Sardar et al. (2018) The influence of various organic amendments on the bioavailability and plant uptake of cadmium present in mine-degraded soil. Sci Total Environ 636:810-817
Yellowhair, Monica; Romanotto, Michelle R; Stearns, Diane M et al. (2018) Uranyl acetate induced DNA single strand breaks and AP sites in Chinese hamster ovary cells. Toxicol Appl Pharmacol 349:29-38
Fu, Xiaori; Dionysiou, Dionysios D; Brusseau, Mark L et al. (2018) Enhanced effect of EDDS and hydroxylamine on Fe(II)-catalyzed SPC system for trichloroethylene degradation. Environ Sci Pollut Res Int 25:15733-15742
Duncan, Candice M; Brusseau, Mark L (2018) An assessment of correlations between chlorinated VOC concentrations in tree tissue and groundwater for phytoscreening applications. Sci Total Environ 616-617:875-880
Virgone, K M; Ramirez-Andreotta, M; Mainhagu, J et al. (2018) Effective integrated frameworks for assessing mining sustainability. Environ Geochem Health 40:2635-2655
Namdari, Soodabeh; Karimi, Neamat; Sorooshian, Armin et al. (2018) Impacts of climate and synoptic fluctuations on dust storm activity over the Middle East. Atmos Environ (1994) 173:265-276
Hossein Mardi, Ali; Khaghani, Ali; MacDonald, Alexander B et al. (2018) The Lake Urmia environmental disaster in Iran: A look at aerosol pollution. Sci Total Environ 633:42-49
Dehghani, Mansooreh; Fazlzadeh, Mehdi; Sorooshian, Armin et al. (2018) Characteristics and health effects of BTEX in a hot spot for urban pollution. Ecotoxicol Environ Saf 155:133-143
Pu, Mengjie; Guan, Zeyu; Ma, Yongwen et al. (2018) Synthesis of iron-based metal-organic framework MIL-53 as an efficient catalyst to activate persulfate for the degradation of Orange G in aqueous solution. Appl Catal A Gen 549:82-92
Brusseau, Mark L; Guo, Zhilin (2018) The integrated contaminant elution and tracer test toolkit, ICET3, for improved characterization of mass transfer, attenuation, and mass removal. J Contam Hydrol 208:17-26

Showing the most recent 10 out of 497 publications