Abstract
1. Unexpected metabolism could lead to the failure of many late-stage drug candidates or even the withdrawal of approved drugs. Thus, it is critical to predict and study the dominant routes of metabolism in the early stages of research. In this study, we describe the development and validation of a ‘WhichEnzyme’ model that accurately predicts the enzyme families most likely to be responsible for a drug-like molecule’s metabolism. Furthermore, we combine this model with our previously published regioselectivity models for Cytochromes P450, Aldehyde Oxidases, Flavin-containing Monooxygenases, UDP-glucuronosyltransferases and Sulfotransferases – the most important Phase I and Phase II drug metabolising enzymes – and a ‘WhichP450’ model that predicts the Cytochrome P450 isoform(s) responsible for a compound’s metabolism. The regioselectivity models are based on a mechanistic understanding of these enzymes’ actions, and use quantum mechanical simulations with machine learning methods to accurately predict sites of metabolism and the resulting metabolites. We train heuristic based on the outputs of the ‘WhichEnzyme’, ‘WhichP450’, and regioselectivity models to determine the most likely routes of metabolism and metabolites to be observed experimentally. Finally, we demonstrate that this combination delivers high sensitivity in identifying experimentally reported metabolites and higher precision than other methods for predicting in vivo metabolite profiles.
Disclaimer
As a service to authors and researchers we are providing this version of an accepted manuscript (AM). Copyediting, typesetting, and review of the resulting proofs will be undertaken on this manuscript before final publication of the Version of Record (VoR). During production and pre-press, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal relate to these versions also.Funding
The author(s) reported there is no funding associated with the work featured in this article.