Search in:

SAR and QSAR in Environmental Research Volume 32, 2021 - Issue 2

Submit an article Journal homepage

Open access

1,835

Views

CrossRef citations to date

Altmetric

Research Article

Fish early life stage toxicity prediction from acute daphnid toxicity and quantum chemistry

S. SchmidtEnvironmental Safety, Crop Science Division, Bayer AG, Monheim, GermanyCorrespondence[email protected]

https://orcid.org/0000-0002-1364-5436

M. SchindlerEnvironmental Safety, Crop Science Division, Bayer AG, Monheim, Germany

https://orcid.org/0000-0002-5141-636X

D. FaberEnvironmental Safety, Crop Science Division, Bayer AG, Monheim, Germany

J. HagerEnvironmental Safety, Crop Science Division, Bayer AG, Monheim, Germany

Pages 151-174 | Received 06 Nov 2020, Accepted 07 Jan 2021, Published online: 02 Feb 2021

Cite this article
https://doi.org/10.1080/1062936X.2021.1874514
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. Distribution of pNOEC values as counts and normalized distribution functions a) for the chemicals in the training, DEV, and VAL set as defined in the Model development section, and b) according to their main pesticidal indication

Table 1. Comparison of descriptor packages and their performance in training (r²) and cross-validation (q²) for a less complex model (3 latent variables (LVs) with 5 descriptors each) and a more complex model (5 LVs without regularization)

Download CSV Display Table

Figure 2. Goodness of fit (r²) and cross-validated q² of sPLS models with increasing numbers of latent variables (LV) and descriptors per LV from the packages CDK, RDKit, PaDEL, Dragon, QM, and pEC50 daphnid

Figure 2. Goodness of fit (r2) and cross-validated q2 of sPLS models with increasing numbers of latent variables (LV) and descriptors per LV from the packages CDK, RDKit, PaDEL, Dragon, QM, and pEC50 daphnid

Figure 3. Average similarity to the 5 most similar compounds from the training set, based on Tanimoto distance of Unity fingerprints. For the training (left) and DEV (right) sets, all compounds from the training set were considered, while for the cross-validation (middle) only compounds from the other CV-batches were considered

Table 2. Model performance summary along the major development steps, starting from all descriptors (CDK, RDKit, PaDEL, Dragon, QM and pEC50 daphnid) and gradually refining the model while reducing its complexity

Download CSV Display Table

Table 3. Model coefficients (c_i) and descriptors for use with Equationequation 3(3) $p N O E C = - log (N O E C [m M]) = a + \sum c_{i} d_{i}$ (3) and model M6 (for more detailed descriptions see Table S2 in the Supporting Information)

Download CSV Display Table

Figure 4. Loadings plot of the 9 descriptors and pNOEC for the two latent variables in model M6. See descriptor explanations in Table 3

Figure 5. Model M6: Predictions with confidence intervals vs. experiment, for the training set compounds (a) and test sets (b), coloured in red (DEV), blue (VAL), and grey (EXT)

Figure 6. Error distribution of pNOEC predictions by model M6 as counts and normalized distribution functions a) according to set membership and b) for pesticide classes

Figure 7. Error distribution of pNOEC predictions for pesticides by model M6, coloured according to mode of action (for an explanation of MoA acronyms see SI Table S8, and Figures S6-S8 for histograms on individual MoAs)

Figure 8. Cook’s distances for the training and test sets (n = 338) for PLS model M6

Figure 9. Williams plot with 5 influential training set compounds from Cook’s plot marked

Figure 10. Representation of the chemical space covered by the full data set with 338 molecules. We used a t-stochastic neighbour embedding (tSNE) based on Tanimoto distances to embed chemical similarities in two dimensions. Training set (orange) and test set (blue) compounds of the models by Furuhama et al. highlighted in colour

Table 4. Comparison of QSAAR model performance with models by Furuhama et al. [Citation15]

Download CSV Display Table

A. Furuhama, T.I. Hayashi, and H. Yamamoto, Development of models to predict fish early-life stage toxicity from acute Daphnia magna toxicity, SAR QSAR Environ. Res. 29 (2018), pp. 725–742. doi:10.1080/1062936X.2018.1513423.

PubMed Web of Science ®Google Scholar

Supplemental material

Supplemental Material

Download PDF (1.1 MB)

Data availability Statement

The data that support the findings of this study are openly available via figshare at http://doi.org/10.6084/m9.figshare.c.5194022. The model workflow is available in KNIME Hub at https://kni.me/w/CEyXPUo_n1i4pUiR.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Fish early life stage toxicity prediction from acute daphnid toxicity and quantum chemistry

Table 1. Comparison of descriptor packages and their performance in training (r²) and cross-validation (q²) for a less complex model (3 latent variables (LVs) with 5 descriptors each) and a more complex model (5 LVs without regularization)

Table 2. Model performance summary along the major development steps, starting from all descriptors (CDK, RDKit, PaDEL, Dragon, QM and pEC50 daphnid) and gradually refining the model while reducing its complexity

Table 3. Model coefficients (c_i) and descriptors for use with Equationequation 3(3) $p N O E C = - log (N O E C [m M]) = a + \sum c_{i} d_{i}$ (3) and model M6 (for more detailed descriptions see Table S2 in the Supporting Information)

Table 4. Comparison of QSAAR model performance with models by Furuhama et al. [Citation15]

Supplemental Material

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Fish early life stage toxicity prediction from acute daphnid toxicity and quantum chemistry

Figures & data

Table 1. Comparison of descriptor packages and their performance in training (r2) and cross-validation (q2) for a less complex model (3 latent variables (LVs) with 5 descriptors each) and a more complex model (5 LVs without regularization)

Table 2. Model performance summary along the major development steps, starting from all descriptors (CDK, RDKit, PaDEL, Dragon, QM and pEC50 daphnid) and gradually refining the model while reducing its complexity

Table 3. Model coefficients (ci) and descriptors for use with Equationequation 3(3) pNOEC= −logNOEC mM=a+∑cidi(3) and model M6 (for more detailed descriptions see Table S2 in the Supporting Information)

Table 4. Comparison of QSAAR model performance with models by Furuhama et al. [Citation15]

Supplemental Material

Data availability Statement

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. Comparison of descriptor packages and their performance in training (r²) and cross-validation (q²) for a less complex model (3 latent variables (LVs) with 5 descriptors each) and a more complex model (5 LVs without regularization)

Table 3. Model coefficients (c_i) and descriptors for use with Equationequation 3(3) $p N O E C = - log (N O E C [m M]) = a + \sum c_{i} d_{i}$ (3) and model M6 (for more detailed descriptions see Table S2 in the Supporting Information)