147
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Optimal selection of learning data for highly accurate QSAR prediction of chemical biodegradability: a machine learning-based approach

ORCID Icon, , & ORCID Icon
Pages 729-743 | Received 13 Jun 2023, Accepted 19 Aug 2023, Published online: 07 Sep 2023
 

ABSTRACT

Prior to the manufacture of new chemicals, regulations mandate a thorough review of the chemicals under risk management. This review involves evaluating their effects on the environment and human health. To assess these effects, a review report that conforms to the OECD Test Guidelines must be submitted to the regulatory body. One of the essential components of the report is an assessment of the biodegradability of chemicals in the environment. In addition to conventional methods, quantitative structure-activity relationship (QSAR) models have been developed to predict the properties of chemicals based on their structural features. Although a greater number of chemicals in the learning set may enhance the prediction accuracy, it may also lead to a decrease in accuracy due to the mixing of different structural features and properties of the chemicals. To improve the prediction performance, it is recommended to use only the appropriate data for biodegradability prediction as a training set. In this study, we propose a novel approach for the optimal selection of training set that enables a highly accurate prediction of the biodegradability of chemicals by QSAR. Our findings indicate that the proposed method effectively reduces the root mean squared error and improves the prediction accuracy.

Acknowledgements

This study was conducted under the research project, ‘Research on the Introduction of Weight of Evidence Approach to the Evaluations of Biodegradation and Bioaccumulation of Chemicals (2022)’, funded by Ministry of Economy, Trade and Industry of Japan.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed at: https://doi.org/10.1080/1062936X.2023.2251889

Additional information

Funding

This work was supported by the Ministry of Economy, Trade and Industry of Japan.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.