Thousands of chemicals in wide commercial use have not been tested for adverse effects on humans, but are present in the environment. Accordingly, there is a need to improve chemical prioritization for in vivo toxicity testing and, ultimately, to find cell-based alternatives for evaluating the large inventory of potentially harmful substances. Quantitative high throughput screening (qHTS) assays are multiple-concentration experiments with an important role in the efforts of the National Toxicology Program to meet these testing challenges and advance toxicology from a predominantly observational science to a predominantly predictive science. qHTS can simultaneously assay thousands of chemicals over a wide chemical space with reduced cost per substance. Previous approaches for making activity calls from qHTS data were based on pharmaceutical applications seeking to minimize false positives and usually relied on heuristics rather than statistical tests to make activity calls. For that reason, we developed a three-stage algorithm to classify substances from qHTS data into statistically supported activity categories relevant to toxicological evaluation, seeking to improve sensitivity while minimizing Type I error rate (Shockley, 2012). The first stage of our approach fits a four-parameter Hill equation to find active substances with a robust concentration-response profile within the tested concentration range. The robust criterion specifies that response profiles are statistically significant using both unweighted and weighted non-linear least squares (NLS and WNLS) regression. NLS weights all data points equally and, consequently, may not discriminate between a profile with data along both asymptotes and a profile supported by a single point. WNLS weights each response point based on 1/s2, where s is the sample standard deviation estimated from all response data within a defined concentration range containing the response point of interest, so that more influence is given to neighboring data points with similar response levels than neighboring data points with very different responses. The second stage finds relatively potent substances with substantial activity at the lowest tested concentration, substances not captured in the first stage. The third and final stage separates statistically significant profiles from responses that lack statistically compelling support, or inactives. This framework accommodates large volumes of qHTS data, tolerates missing data, and does not require replicate measurements. We evaluated this three-stage classification algorithm via extensive simulations (Shockley, 2012). The area under receiver operating characteristic curves (AUC) was used to assess performance. Using AUC statistics, our algorithm outperformed overall F-tests comparing the fit of the Hill equation to a horizontal line (no response) when the concentration for half maximal response (AC50) was less than 0.1micro molar. It also outperformed t-test approaches in detecting known actives when the AC50 was greater than 0.001 micro molar. The three-stage decision strategy yielded good (AUC ≥0.75) to high (AUC ≥0.9) performance for 14 point concentration-response curves when the response was in the detectable region of the simulated assay (>25% of the positive control). Our approach was able to detect relatively potent substances (e.g., AC50 = 0.001 micro molar) with as few as 4 data points when the tested response was at least 50% of the positive control response.

Project Start
Project End
Budget Start
Budget End
Support Year
3
Fiscal Year
2012
Total Cost
$323,905
Indirect Cost
City
State
Country
Zip Code
Shockley, Keith R (2016) Estimating Potency in High-Throughput Screening Experiments by Maximizing the Rate of Change in Weighted Shannon Entropy. Sci Rep 6:27897
Pei, Ying; Peng, Jun; Behl, Mamta et al. (2016) Comparative neurotoxicity screening in human iPSC-derived neural stem cells, neurons and astrocytes. Brain Res 1638:57-73
Chen, Shiuan; Hsieh, Jui-Hua; Huang, Ruili et al. (2015) Cell-Based High-Throughput Screening for Aromatase Inhibitors in the Tox21 10K Library. Toxicol Sci 147:446-57
Shockley, Keith R (2015) Quantitative high-throughput screening data analysis: challenges and recent advances. Drug Discov Today 20:296-300
Ray, Mitas; Shockley, Keith; Kissling, Grace (2014) Minimizing Systematic Errors in Quantitative High Throughput Screening Data Using Standardization, Background Subtraction, and Non-Parametric Regression. J Exp Second Sci 3:
Huang, Ruili; Sakamuru, Srilatha; Martin, Matt T et al. (2014) Profiling of the Tox21 10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep 4:5664
Shockley, Keith R (2014) Using weighted entropy to rank chemicals in quantitative high-throughput screening experiments. J Biomol Screen 19:344-53
Teng, Christina; Goodwin, Bonnie; Shockley, Keith et al. (2013) Bisphenol A affects androgen receptor function via multiple mechanisms. Chem Biol Interact 203:556-64
Shockley, Keith R (2012) A three-stage algorithm to make toxicologically relevant activity calls from quantitative high throughput screening data. Environ Health Perspect 120:1107-15