The Data Management and Modeling Core (Core D) plays a critical role in achieving the Center objectives by serving as a central repository of Center data and provides for cross-indexing of the diverse data sets produced by the environmental and biomedical projects and cores in the Center. An additional role of Core D is to ensure the reliability of the data, including cleaning, replication and backup, as well as the protection of the data, including de-identification of human subjects, and secure and authenticated access. Core D allows data generated by the projects to be cross-indexed by all projects based on a global PROTECT Data Dictionary that includes common index fields (subject ID, GIS coordinates) to foster sharing and integration. Core D also provides a rich set of modeling and statistical analysis toolsets to support Project-level objectives. The combined collection of data and tools allows PROTECT to work seamlessly across project domains and effectively ties environmental factors to human subject outcomes. Finally, Core D is leveraging state-of-the-art in search-based technologies that have been developed at Google to support search and Data Mining. To support Center goals and ensure its long-term impact, we will continue to build upon the rich infrastructure developed in the first three years of this Center. We have partnered with EarthSoft, a major developer of environmental data management software, to provide enhanced database capabilities appropriate for all Center projects. We will continue to support cleaning, indexing, documenting, and security of all Center-based data through a secure, online, database system. To allow data to be analyzed by each project, we will provide advanced statistical/analysis tools integrated into the backend of the database system. Specifically, we will work with Project 1 to incorporate appropriate biostatistics tools, with Project 2 to integrate toxicological data, with Project 3 to assist in nontargeted detection, with Project 4 to utilize appropriate environmental assessment tools and Geographic Information System (GIS), and with Project 5 to integrate and model remediation field data. To allow the data in the center to be easily understood by a range of communities, we are integrating mapping and data visualization capabilities and enabling effective dissemination of data to a broad audience. To achieve our Center-level objectives that tie environmental factors to health-related outcomes, Core D provides Data Mining, modeling and analysis for all projects by leveraging state-of-the-art computing capabilities. Core D provides a suite of Data Mining tools that go well beyond standard statistical techniques. Finally, we will work closely with the Community Engagement Core, Training and Research Translation cores to generate mapping and statistical information required by our partner cores to foster awareness, research translation, education, and high quality publications.

Public Health Relevance

The Data Management and Modeling Core (Core D) supports the management of all Project-level data in the Center, and provides sophisticated data analysis and modeling to answer Center-level questions for PROTECT. Core D has developed a rich cache of information that insures the long-term impact of the work in this Center and its ability to effectively report back to the affected community.

Agency
National Institute of Health (NIH)
Institute
National Institute of Environmental Health Sciences (NIEHS)
Type
Hazardous Substances Basic Research Grants Program (NIEHS) (P42)
Project #
2P42ES017198-05
Application #
8649394
Study Section
Special Emphasis Panel (ZES1-LKB-K (S))
Project Start
Project End
Budget Start
2014-04-01
Budget End
2015-03-31
Support Year
5
Fiscal Year
2014
Total Cost
$322,795
Indirect Cost
$112,809
Name
Northeastern University
Department
Type
DUNS #
001423631
City
Boston
State
MA
Country
United States
Zip Code
02115
Ferguson, Kelly K; Yu, Youfei; Cantonwine, David E et al. (2018) Foetal ultrasound measurement imputations based on growth curves versus multiple imputation chained equation (MICE). Paediatr Perinat Epidemiol 32:469-473
Boss, Jonathan; Zhai, Jingyi; Aung, Max T et al. (2018) Associations between mixtures of urinary phthalate metabolites with gestational age at delivery: a time to event analysis using summative phthalate risk scores. Environ Health 17:56
Aker, Amira M; Ferguson, Kelly K; Rosario, Zaira Y et al. (2018) The associations between prenatal exposure to triclocarban, phenols and parabens with gestational age and birth weight in northern Puerto Rico. Environ Res 169:41-51
Johns, Lauren E; Ferguson, Kelly K; Cantonwine, David E et al. (2018) Subclinical Changes in Maternal Thyroid Function Parameters in Pregnancy and Fetal Growth. J Clin Endocrinol Metab 103:1349-1358
Bedrosian, Leah D; Ferguson, Kelly K; Cantonwine, David E et al. (2018) Urinary phthalate metabolite concentrations in relation to levels of circulating matrix metalloproteinases in pregnant women. Sci Total Environ 613-614:1349-1352
Nazari, Roya; Raji?, Ljiljana; Xue, Yunfei et al. (2018) Degradation of 4-Chlorophenol in Aqueous Solution by Sono-Electro-Fenton Process. Int J Electrochem Sci 13:9214-9230
Zhou, Wei; Meng, Xiaoxiao; Rajic, Ljiljana et al. (2018) ""Floating"" cathode for efficient H2O2 electrogeneration applied to degradation of ibuprofen as a model pollutant. Electrochem commun 96:37-41
Ashrap, Pahriya; Watkins, Deborah J; Calafat, Antonia M et al. (2018) Elevated concentrations of urinary triclocarban, phenol and paraben among pregnant women in Northern Puerto Rico: Predictors and trends. Environ Int 121:990-1002
Ferguson, Kelly K; Meeker, John D; Cantonwine, David E et al. (2018) Environmental phenol associations with ultrasound and delivery measures of fetal growth. Environ Int 112:243-250
Cathey, Amber; Ferguson, Kelly K; McElrath, Thomas F et al. (2018) Distribution and predictors of urinary polycyclic aromatic hydrocarbon metabolites in two pregnancy cohort studies. Environ Pollut 232:556-562

Showing the most recent 10 out of 163 publications