Skip Navigation
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.


The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Your Environment. Your Health.

Publication Detail

Title: Machine Learning Models for Estrogen Receptor Bioactivity and Endocrine Disruption Prediction.

Authors: Zorn, Kimberley M; Foil, Daniel H; Lane, Thomas R; Russo, Daniel P; Hillwalker, Wendy; Feifarek, David J; Jones, Frank; Klaren, William D; Brinkman, Ashley M; Ekins, Sean

Published In Environ Sci Technol, (2020 10 06)

Abstract: The U.S. Environmental Protection Agency (EPA) periodically releases in vitro data across a variety of targets, including the estrogen receptor (ER). In 2015, the EPA used these data to construct mathematical models of ER agonist and antagonist pathways to prioritize chemicals for endocrine disruption testing. However, mathematical models require in vitro data prior to predicting estrogenic activity, but machine learning methods are capable of prospective prediction from the molecular structure alone. The current study describes the generation and evaluation of Bayesian machine learning models grouped by the EPA's ER agonist pathway model using multiple data types with proprietary software, Assay Central. External predictions with three test sets of in vitro and in vivo reference chemicals with agonist activity classifications were compared to previous mathematical model publications. Training data sets were subjected to additional machine learning algorithms and compared with rank normalized scores of internal five-fold cross-validation statistics. External predictions were found to be comparable or superior to previous studies published by the EPA. When assessing six additional algorithms for the training data sets, Assay Central performed similarly at a reduced computational cost. This study demonstrates that machine learning can prioritize chemicals for future in vitro and in vivo testing of ER agonism.

PubMed ID: 32857505 Exiting the NIEHS site

MeSH Terms: Bayes Theorem; Endocrine Disruptors*/toxicity; Machine Learning; Prospective Studies; Receptors, Estrogen*

to Top