The Data Management and Analysis Core (DMAC) plays a critical role in achieving the Center objectives by serving as a central repository of Center data and providing for cross-indexing and linkage of the diverse data sets produced by the environmental and biomedical projects and cores in the Center. The current PROTECT Database System holds nearly 7 million cleaned and secure data entities. The DMAC is responsible for the reliability of the data, including cleaning, replication and backup, as well as the protection of the data, including de-identification of human subjects, and secure and authenticated access. The DMAC allows data generated by the projects to be cross-indexed by all projects based on a global PROTECT Data Dictionary that includes common index fields (subject ID, GIS coordinates) to foster sharing and integration. DMAC also provides a rich set of modeling and statistical analysis toolsets and expertise to support Project-level objectives. The combined collection of data and tools allows PROTECT to work seamlessly across project domains and effectively ties environmental factors to human subject outcomes. To support Center goals and ensure its long-term impact, we will continue to build upon the rich infrastructure developed in the first eight years of this Center. We will continue to partner with EarthSoft, a major provider of environmental data management software, to provide enhanced database capabilities appropriate for all Center projects and cores. We will continue to support cleaning, indexing, documenting, and security of all Center-based data through a secure, online, database system, as well as provide a common suite of advanced statistical/analysis tools integrated into the backend of the database system. As part of the renewal, we will expand our analytics support by adding Jennifer Dy, Justin Manjourides and Bhramar Mukherjee to the DMAC, supporting machine learning and statistical analysis of mixtures that include phthalates, chlorinated volatile organic compounds (CVOCs), polycyclic aromatic hydrocarbons (PAHs), metals and pesticides across all projects. We will expand our use of mapping with a Geographic Information System (GIS), integrating analytics and mapping into a common framework, making our data easily understood by a wide range of communities. To achieve our Center-level aims that tie environmental factors to health-related outcomes, the DMAC will continue to develop a common suite of analysis and visualization tools based on GIS, SAS, R and Python, providing analysis tailored for each project, while also leveraging state-of-the-art software and frameworks. The specific statistical tools developed for mixtures analysis will use RStudio?s data cleaning, visualization and archiving functions, and will be disseminated through GitHub. The DMAC already has developed a suite of Data Mining tools that provide regression and clustering analysis in an integrated online visualization framework. Finally, we will work closely with the Community Engagement Core and the Training cores to provide education on data analysis, and to support data reporting and communication of results.

Public Health Relevance

The Data Management and Analysis Core (DMAC) plays a critical role in the efficient and secure transmission, storage, cleaning, harmonization, management, sharing, analysis and dissemination of biomedical and environmental data collected and analyzed across the PROTECT center. The DMAC provides software- engineered user-friendly analytic tools and automated pipelines to address the needs of the projects and other cores in PROTECT, and enables cross-project collaboration through data integration and harmonization, and effectively accommodates the growing volume and velocity of data collection. Effective study design, data management and analysis are key components to efficiently understanding the impact of environmental contamination on adverse pregnancy outcomes across the spectrum of research, training and community engagement activities in PROTECT.

Agency
National Institute of Health (NIH)
Institute
National Institute of Environmental Health Sciences (NIEHS)
Type
Hazardous Substances Basic Research Grants Program (NIEHS) (P42)
Project #
2P42ES017198-10
Application #
9839909
Study Section
Special Emphasis Panel (ZES1)
Project Start
Project End
2025-01-31
Budget Start
2019-12-01
Budget End
2020-11-30
Support Year
10
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Northeastern University
Department
Type
DUNS #
001423631
City
Boston
State
MA
Country
United States
Zip Code
02115
Bedrosian, Leah D; Ferguson, Kelly K; Cantonwine, David E et al. (2018) Urinary phthalate metabolite concentrations in relation to levels of circulating matrix metalloproteinases in pregnant women. Sci Total Environ 613-614:1349-1352
Nazari, Roya; Raji?, Ljiljana; Xue, Yunfei et al. (2018) Degradation of 4-Chlorophenol in Aqueous Solution by Sono-Electro-Fenton Process. Int J Electrochem Sci 13:9214-9230
Zhou, Wei; Meng, Xiaoxiao; Rajic, Ljiljana et al. (2018) ""Floating"" cathode for efficient H2O2 electrogeneration applied to degradation of ibuprofen as a model pollutant. Electrochem commun 96:37-41
Ashrap, Pahriya; Watkins, Deborah J; Calafat, Antonia M et al. (2018) Elevated concentrations of urinary triclocarban, phenol and paraben among pregnant women in Northern Puerto Rico: Predictors and trends. Environ Int 121:990-1002
Ferguson, Kelly K; Meeker, John D; Cantonwine, David E et al. (2018) Environmental phenol associations with ultrasound and delivery measures of fetal growth. Environ Int 112:243-250
Cathey, Amber; Ferguson, Kelly K; McElrath, Thomas F et al. (2018) Distribution and predictors of urinary polycyclic aromatic hydrocarbon metabolites in two pregnancy cohort studies. Environ Pollut 232:556-562
Lan, Jiaqi; Rahman, Sheikh Mokhlesur; Gou, Na et al. (2018) Genotoxicity Assessment of Drinking Water Disinfection Byproducts by DNA Damage and Repair Pathway Profiling Analysis. Environ Sci Technol 52:6565-6575
Wang, Poguang; Giese, Roger W (2018) Interpretation of Mass Spectral Data for the Cisplatin 1,2 Intrastrand Guanine-Guanine Adduct. Chem Res Toxicol 31:1106-1107
Hojabri, Shirin; Rajic, Ljiljana; Alshawabkeh, Akram N (2018) Transient reactive transport model for physico-chemical transformation by electrochemical reactive barriers. J Hazard Mater 358:171-177
Ferguson, Kelly K; Kamai, Elizabeth M; Cantonwine, David E et al. (2018) Associations between repeated ultrasound measures of fetal growth and biomarkers of maternal oxidative stress and inflammation in pregnancy. Am J Reprod Immunol 80:e13017

Showing the most recent 10 out of 163 publications