Data Science Core ABSTRACT The objective of the Texas A&M Superfund Research Center is to explore and develop descriptive models and tools that can predict the possible hazardous outcomes of chemical exposure during environmental emergencies and to produce powerful solutions which can mitigate the negative effects on human health. The ultimate goal of the Center is to contribute to decision-making capabilities for planning and control in emergency environmental contamination events. The Data Science Core is one of the essential components of the Center that will contribute to achieving the goals of the Center by supporting the work of four challenging Research Projects. The projects will produce high-dimensional data that requires comprehensive analysis and expertise in state-of-the-art data science methodologies in order to translate raw experimental data into actionable insights and predictive models. Directed by Dr. Christodoulos A. Floudas and in collaboration with Co-investigator Dr. Fred A. Wright, the Data Science Core will provide numerous methods and services to the Center researchers under three specific aims: (i) by sharing expertise and providing support via advanced methodologies in data science and statistics; (ii) by developing high-performance, novel methods for simultaneous regression or classification with dimensionality reduction and data integration; and (iii) by constructing and maintaining a computational platform that will enable collaboration across the Center and facilitate dissemination of knowledge to the wider community and key stakeholders. Research Project 1 will characterize exposure pathways of contaminated sediments that are vulnerable to movement and re- deposition due to storm activity; the Data Science Core will provide services for experimental design, hypothesis testing, and regression for contaminated sediment binding experiments. Project 2 will study the mitigation of adverse health effects of chemicals through broad-acting sorption materials; the Data Science Core will utilize predictive modeling of sorption activity via advanced regression and simultaneous dimensionality reduction with nonlinear kernels to guide experimental design and material property identification. Project 3 will investigate the inter-tissue and inter-individual variability in response to complex environmental mixtures; the Data Science Core will apply composite classification and clustering strategies for characterization of chemical mixtures. Project 4 will develop single-cell, high-throughput platforms to quantify the endocrine disruptor potential of environmental contaminants and mixtures; the Data Science Core will aid in predicting the activity of multiple endocrine receptors through model construction and reduction of predictive models. Furthermore, the Data Science Core will maximize productivity within the Center by establishing an ideal environment for data sharing and collaboration via a computational platform service. The platform will also disseminate the results of the Center, including access to the final high-performance predictive models and tools, by providing interactive interfaces amenable for use by the scientific community.

Public Health Relevance

Data Science Core PROJECT NARRATIVE The Data Science Core of the Texas A&M Superfund Research Center serves as basis for translating the raw experimental data produced by the Research Projects into useful knowledge to the community via data collection, integration, quality control, analysis, and model generation. The Core will utilize state-of-the-art methods in data science, optimization and machine learning, develop and apply novel dimensionality reduction techniques, and establish a computational platform for collaboration within the Center and data dissemination to the Center stakeholders.

Agency
National Institute of Health (NIH)
Institute
National Institute of Environmental Health Sciences (NIEHS)
Type
Hazardous Substances Basic Research Grants Program (NIEHS) (P42)
Project #
1P42ES027704-01
Application #
9257876
Study Section
Special Emphasis Panel (ZES1)
Project Start
Project End
Budget Start
2017-09-01
Budget End
2018-03-31
Support Year
1
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Texas A&M University
Department
Type
DUNS #
020271826
City
College Station
State
TX
Country
United States
Zip Code
77845
Shapiro, Andrew J; Antoni, Sébastien; Guyton, Kathryn Z et al. (2018) Software Tools to Facilitate Systematic Review Used for Cancer Hazard Identification. Environ Health Perspect 126:104501
Kim, Jun-Hyun; Li, Wei; Newman, Galen et al. (2018) The Influence of Urban Landscape Spatial Patterns on Single-Family Property Values. Environ Plan B Urban Anal City Sci 45:26-43
Klaren, William D; Rusyn, Ivan (2018) High-Content Assay Multiplexing for Muscle Toxicity Screening in Human-Induced Pluripotent Stem Cell-Derived Skeletal Myoblasts. Assay Drug Dev Technol 16:333-342
Nicora, Carrie D; Burnum-Johnson, Kristin E; Nakayasu, Ernesto S et al. (2018) The MPLEx Protocol for Multi-omic Analyses of Soil Samples. J Vis Exp :
Chiu, Weihsueh A; Guyton, Kathryn Z; Martin, Matthew T et al. (2018) Use of high-throughput in vitro toxicity screening data in cancer hazard evaluations by IARC Monograph Working Groups. ALTEX 35:51-64
Onel, Melis; Beykal, Burcu; Wang, Meichen et al. (2018) Optimal Chemical Grouping and Sorbent Material Design by Data Analysis, Modeling and Dimensionality Reduction Techniques. ESCAPE 43:421-426
Guyton, Kathryn Z; Rusyn, Ivan; Chiu, Weihsueh A et al. (2018) Application of the key characteristics of carcinogens in cancer hazard identification. Carcinogenesis 39:614-622
Li, Gen; Shabalin, Andrey A; Rusyn, Ivan et al. (2018) An empirical Bayes approach for multiple tissue eQTL analysis. Biostatistics 19:391-406
Zheng, Xueyun; Dupuis, Kevin T; Aly, Noor A et al. (2018) Utilizing ion mobility spectrometry and mass spectrometry for the analysis of polycyclic aromatic hydrocarbons, polychlorinated biphenyls, polybrominated diphenyl ethers and their metabolites. Anal Chim Acta 1037:265-273
MacLean, Brendan X; Pratt, Brian S; Egertson, Jarrett D et al. (2018) Using Skyline to Analyze Data-Containing Liquid Chromatography, Ion Mobility Spectrometry, and Mass Spectrometry Dimensions. J Am Soc Mass Spectrom 29:2182-2188

Showing the most recent 10 out of 56 publications