Core D: Quantitative Biology: Biostatistics, Bioinformatics, and Computation

Smith, Martyn

Abstract

The purpose of the Quantitative Biology Core is to provide investigators with consultative support in biostatistics/computational biology and bioinformatics, and to support web-based dissemination of bioinformatic solutions and database access.
Most specific aims with the projects produce high-dimensional biological and exposure data, and often involve complicated questions addressing the possible interaction of environmental exposures and high-dimensional measures of the genome, proteome, and other high throughput technologies. These high-dimensional data sets are characterized by many thousands of measurements made on each unit (e.g. person, yeast culture, soil community). Core D reflects an evolution in the field of biostatistics and bioinformatics towards developing methodologies that can both find patterns in high dimensional data sets as well as providing proper statistical inference for these patterns. A consensus among our project researches and the methodological experts has formed around a set of core principles regarding optimal estimation and inference in the context of complicated questions and high-dimensional data. Specifically, the consensus favors using (when possible): semi-parametric locally efficient estimation with robust inference and the development of optimal methods used to integrate the statistical results into existing metadata to suggest relevant biological pathways and networks. Applying this approach will enable analyses to incorporate diverse data to query similar patterns/pathways in both related toxins and possible related diseases thus substantially leveraging data generated by the Program. To implement this methodology, the Quantitative Biology Core will provide access to a computational environment that lends itself to the computationally intensive methods developed for data mining and re-sampling based inference. Because of the scale of the data collection as well as the desirability of converging to a general methodology, our Program requires a more centralized system that can both archive data for, provide sharing to this Core, guidance on the access of metadata/annotation and routines for leveraging such data to find overprinting of our results on existing hypothesized regulatory networks. The Core will also develop tools to find and compares pathway, and create and maintain a web-based system that will allow for both efficient sharing of our methodological expertise with the project researchers and ultimately serve as a tool for outreach among the general scientific community.

Public Health Relevance

Despite improvements in technology, the lack of statistical rigor among the proliferating methods used to discover disease etiology and develop effective interventions are producing large numbers of false positive claims. However, by using methods that optimally balance the complexity of models with the need to provide inferences consistent with the amount of data available, one can avoid wild goose chases engendered by false discoveries.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Environmental Health Sciences (NIEHS)
Type: Hazardous Substances Basic Research Grants Program (NIEHS) (P42)
Project #: 5P42ES004705-28
Application #: 8838133
Study Section: Special Emphasis Panel (ZES1-SET-V)

Project Start
Project End: 2017-03-31
Budget Start: 2015-04-01
Budget End: 2016-03-31
Support Year: 28
Fiscal Year: 2015
Total Cost: $198,673
Indirect Cost: $67,875

Institution

Name: University of California Berkeley
Department
Type
DUNS #: 124726725

City: Berkeley
State: CA
Country: United States
Zip Code: 94704

Related projects

Publications

Rappaport, Stephen M (2018) Redefining environmental exposure for disease etiology. NPJ Syst Biol Appl 4:30

Tachachartvanich, Phum; Sangsuwan, Rapeepat; Ruiz, Heather S et al. (2018) Assessment of the Endocrine-Disrupting Effects of Trichloroethylene and Its Metabolites Using in Vitro and in Silico Approaches. Environ Sci Technol 52:1542-1550

Guyton, Kathryn Z; Rieswijk, Linda; Wang, Amy et al. (2018) Key Characteristics Approach to Carcinogenic Hazard Identification. Chem Res Toxicol :

Roh, Taehyun; Steinmaus, Craig; Marshall, Guillermo et al. (2018) Age at Exposure to Arsenic in Water and Mortality 30-40 Years After Exposure Cessation. Am J Epidemiol 187:2297-2305

Daniels, Sarah I; Chambers, John C; Sanchez, Sylvia S et al. (2018) Elevated Levels of Organochlorine Pesticides in South Asian Immigrants Are Associated With an Increased Risk of Diabetes. J Endocr Soc 2:832-841

Guyton, Kathryn Z; Rusyn, Ivan; Chiu, Weihsueh A et al. (2018) Application of the key characteristics of carcinogens in cancer hazard identification. Carcinogenesis 39:614-622

Grigoryan, Hasmik; Edmands, William M B; Lan, Qing et al. (2018) Adductomic signatures of benzene exposure provide insights into cancer induction. Carcinogenesis 39:661-668

Barazesh, James M; Prasse, Carsten; Wenk, Jannis et al. (2018) Trace Element Removal in Distributed Drinking Water Treatment Systems by Cathodic H2O2 Production and UV Photolysis. Environ Sci Technol 52:195-204

Counihan, Jessica L; Wiggenhorn, Amanda L; Anderson, Kimberly E et al. (2018) Chemoproteomics-Enabled Covalent Ligand Screening Reveals ALDH3A1 as a Lung Cancer Therapy Target. ACS Chem Biol 13:1970-1977

Lavy, Adi; Keren, Ray; Yu, Ke et al. (2018) A novel Chromatiales bacterium is a potential sulfide oxidizer in multiple orders of marine sponges. Environ Microbiol 20:800-814

Showing the most recent 10 out of 629 publications

Comments

Be the first to comment on Martyn Smith's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: