935
Views
12
CrossRef citations to date
0
Altmetric
Research Articles (GP2A Conference)

Prediction of acute toxicity in fish by using QSAR methods and chemical modes of action

, , , , &
Pages 195-203 | Received 31 Oct 2008, Accepted 20 Feb 2009, Published online: 29 Oct 2009

Abstract

Three quantitative structure–activity relationship (QSAR) models were evaluated for their power to predict the toxicity of chemicals in two datasets: (1) EPAFHM (US Environmental Protection Agency—Fathead Minnow) and (2) derivatives having a high production volume (HPV), as compiled by the European Chemical Bureau. For all three QSAR models, the quality of the predictions was found to be highly dependent on the mode of action of the chemicals. An analysis of outliers from the three models gives some clues for improving the QSAR models. Two classification methods, Toxtree and a Bayesian approach with fingerprints as descriptors, were also analyzed. Predictions following the Toxtree classification for narcosis were good, especially for the HPV set. The learning model (Bayesian approach) produced interesting results for the EPAFHM dataset but gave lower quality predictions for the HPV set.

Introduction

Achieving a high level of environmental protection against the adverse effects of chemicals requires an effective chemical regulatory policy. Approximately 100,000 different chemical substances are registered in the European Union (EU), of which about 30,000 are manufactured or imported in quantities greater than one metric ton per yearCitation1. While experimental physicochemical properties and toxicity data are available for “new chemicals” (developed since 1981), there are no such provisions for “existing chemicals” (developed before 1981), even though they make up almost 99% of the total volume in the EU market. Therefore, adequate toxicological and ecotoxicological data are available only for a very small proportion of chemicals. To address this lack of health and safety data, the European Union recently adopted a new regulation, called REACHCitation2,Citation3 (Registration, Evaluation, Authorization and Restriction of Chemicals). In REACH, all compounds manufactured or imported into the EU in quantities greater than one metric ton per year are required to be registered in a central database. Registration includes providing information concerning the effects of the chemical on human health and the environment. Under REACH, the requirements in terms of aquatic toxicity call for data from short-term toxicity testing on invertebrates and growth inhibition studies on aquatic plants. In addition, short-term toxicity data on fish are required for chemicals manufactured/imported in quantities greater than 10 metric tons per year. Traditionally, such information has been obtained through in vivo animal testing; however, under REACH legislation, quantitative structure–activity relationship (QSAR) models and in vitro methods as alternatives to animal experiments are expected to play a significant role. (Q)SARs will be used more extensively, in the interests of time- and cost- effectiveness and animal welfare. In particular, (Q)SARs are likely to play an important role in the assessment of chemicals produced or imported in quantities between 1 and 10 metric tons per year, for which minimal animal testing is foreseenCitation3,Citation4. In aquatic ecotoxicology, QSARs allow, for example, an estimation of toxic concentrations based on physical and/or chemical properties, without the use of experiments.

A number of conditions still need to be met in order that (Q)SAR results can provide an acceptable alternative to experimental data. These are: (1) a defined endpoint; (2) an unambiguous algorithm; (3) a defined domain of applicability; (4) appropriate measures of goodness-of-fit, robustness, and predictivity; and (5) a mechanistic interpretation, if possibleCitation5. No single QSAR model yet fulfills all of these criteria, and incorrect selection of a QSAR model can result in 10- to 1000-fold errors in toxic potency estimatesCitation6. Therefore, a key factor in the use of QSARs for predicting chemical toxicity is the choice of QSAR model.

Traditionally, the selection of structural analogs or QSARs has been based on the assumption that compounds from the same ‘chemical class’ should behave in a toxicologically similar manner. However, identification of chemical classes could be problematic because chemicals often carry different chemical moieties, thereby confounding efforts to achieve a meaningful classification. Moreover, within the QSAR programs themselves, new chemicals are per definition not included in the training set, so it is difficult to assign a good QSAR model to such chemicals. As a result, another approach consists of defining a general model based on a maximum number of chemicals and/or focusing on the notion of mode of toxic actionCitation7,Citation8. Various modes of toxic action for chemicals in fish have been identified and extended to other aquatic organisms. Currently, the Verharr scheme seems to be the most widely used, recognizing modes of action associated with different structural classes: class 1—inert chemicals associated with non polar narcosis; class 2—relatively inert chemicals associated with polar narcosis; class 3—reactive chemicals associated with enhanced toxicity; and class 4—active chemicals associated with a central nervous system effectCitation7. In the context of acute toxicity to fish, three groups of QSAR models have been reported in the literature: models for narcoses, global QSARs (developed without respect to chemical class and mode of action (MOA)), and QSARs for specific chemical classes.

Considering the volume of measurements to be undertaken, the lack of experience with implementation of such QSAR test policies, and the need for making informed choices about the multiple available QSAR models (which should eventually be available to all stakeholders in the REACH process), it is necessary to evaluate the accuracy of prediction of various models when they are used as a “black box,” without clear guidance on when to use which QSAR. The present article compares the predictive quality of three QSAR models applied to a database of 617 compounds, the Environmental Protection Agency (EPA) Fathead Minnow Acute Toxicity database. This database was generated by the US Environmental Protection Agency with the goal of developing an expert system to predict acute toxicity from chemical structure based on a chemical’s MOA. The 617 industrial chemicals were expressly chosen to serve as a useful training set for the development of predictive QSARs. The three QSAR models investigated here are: (1) a model developed in our laboratory, referred to as CERMNCitation9; (2) the TOPKAT expert systemCitation10; and (3) a model developed by the European Chemicals Bureau (ECB)Citation11.

Materials and methods

Data set

A set of 617 chemicals was extracted from the EPA siteCitation12. The record for each chemical contains its name, mode of action (MOA)Citation6, molecular weight (MW), hydrophobicity (logPOW), and biological data corresponding to 96-h LC50 bioassays for the fathead minnow (Pimephales promelas). Chemicals without logPOW and/or LC50 values were discarded, leading to a set of 566 chemical derivatives. Hereafter, we refer to this set as the EPAFHM set. This dataset was the basis for the Bayesian categorizations (see “Learning molecular categories” below).

A second set was created with ECB-high production volume (HPV)Citation11 chemicals. This set was used as a test set after the classification as baseline narcosis (Bayesian categorizations). In total, 1749 substances from over 2782 structures described in the ECB-HPV dataset were downloaded from the ECB site. Of these 1749 derivatives, 225 derivatives had 96-h LC50 value(s) for the fathead minnow recorded and were considered in this study (the lowest LC50 value was taken if several values were given for one derivative).

QSAR models

For the prediction of the biological response (96-h LC50 (mol/L)) toward the fathead minnow, four QSAR models were first tested.

TOPKATCitation10 is a QSAR-based system which generates assessments of chemical toxicity solely from a chemical’s molecular structure. TOPKAT uses cross-validated models based on experimental data.

The ECB modelCitation13 is a model (Equation (1)) associated with narcosis as the MOA based on one hydrophobic descriptor (logPOW):

In addition to the ECB model, Netzeva et al.Citation14 published a general model (regardless of MOA) for toxicity in the fathead minnow (Equation (2)) using the same previous descriptor but with a lower intercept value (higher basic toxicity) and a slight decrease of the slope relative to logPOW:

Our QSAR modelCitation9, referred to below as CERMN, is also a general model (Equation (3)), incorporating logPOW and two other descriptors corresponding to steric (MW) and electronic descriptors (ELUMO):

Calculation of descriptors and regressions

Hydrophobicity or octanol–water partition coefficient (logPOW)

In EPAFHM, the values of logPOW correspond to experimental values or to calculated (ClogPCitation15) values when experimental ones are missing. For the testing set, logPOW values were calculated by KOWWINCitation16. This latter software gave either experimental logPOW values if they were recorded in KOWWIN’s internal database or estimated values in other cases. For EPAFHM, the correlation between ClogP and KOWWIN values was very good (r2 = 0.96).

Quantum descriptors

LUMO (lowest unoccupied molecular orbital energy) values were calculated for each chemical with Pipeline PilotCitation17 using VAMP, a semi-empirical molecular orbital package, and AM1 for Hamiltonian.

Pipeline Pilot fingerprints (ECFP_12)

Extended Chemical Fingerprint 12, called ECFP_12, is a class of circular substructural fingerprintCitation18. The ECFP_12 fingerprint encodes each atom and its molecular environment within a circle with a maximal diameter of 12 chemical bonds. The fingerprint was recorded into a fixed length of 1024 bits.

Regression methods

All the regressions were done in Microsoft Excel. The correlation coefficient r2 represents the quality of the regression between two variables (how much the variance of one variable is accounted for by the predictive power of the other variables). The standard deviation s characterizes the width of the distribution of the predicted values by the regression. The greater is the standard deviation, the more widely distributed are the values and the worse is the predictive power. The limit for outliers was defined from the s value (data normally distributed). The n values correspond to the number of chemicals used to calculate the correlation.

Classification tools

ToxtreeCitation19

Toxtree is an open-source application that places chemicals into categories and predicts various kinds of toxic effects by applying decision tree approaches. Verhaar’s schemeCitation7, for aquatic MOAs, classifies chemicals into five classes: (1) narcosis or baseline toxicity; (2) less inert compounds; (3) nonspecific reactivity; (4) compounds acting by a specific mechanism; and (5) unclassified compounds.

Learning molecular categoriesCitation17

This protocol was built using multiple Bayesian categorization equations within a single model, using a property that lists the categories. The EPAFHM set is divided into nine clear MOAs (see ). For each MOA, the confidence degree toward this classification is expressed from “High” to “Low.” Compounds for which the MOA confidence had “High” and “High-Moderate” probabilities for each MOA were used as the training set. AlogP, molecular weight, the number of H donors and acceptors, the number of rotatable bonds, the molecular polar surface area, and ECFP_12 fingerprint were used for descriptorsCitation17.

Table 1. Classification of EPAFHM derivatives based on mode of action (MOA). Compounds with “High” or “High-Moderate” probabilities for each MOA were used as the training set for defining the categories.

Results and discussion

Choice of ECB model

Two different QSARs with logPOW as the single descriptor could be used in principle (Equation (1) for narcosis and Equation (2) for a general model). The logLC50 values predicted by the two methods are completely correlated. However, as expected, the toxicity values predicted by the general model (Equation (2)) are in most cases higher than those predicted by Equation (1), although the observed differences are smaller than the level of accuracy. Therefore, we arbitrarily chose the narcosis model (Equation (1)) for the following studies.

Quality of the predictions

The statistical results for the three studies are presented in . The ECB model is slightly inferior to the TOPKAT and CERMN models. However, TOPKAT () could predict only 492 chemicals out of 566 derivatives (EPAFHM). Also, the graphical representations of the relationships between the real and predicted values show some discrepancies between the models. Moreover, we note that the compounds excluded by TOPKAT are generally well predicted by the two other models (black full circles, and ).

Table 2. Statistical results for the studies.

Figure 1. Predicted logLC50 vs. real logLC5050 for TOPKAT.

Figure 1.  Predicted logLC50 vs. real logLC5050 for TOPKAT.

Figure 2. Predicted logLC50 vs. real logLC50 for CERMN. Black full circles represent the chemicals not predicted with TOPKAT.

Figure 2.  Predicted logLC50 vs. real logLC50 for CERMN. Black full circles represent the chemicals not predicted with TOPKAT.

Figure 3. Predicted logLC50 vs. real logLC50 for ECB. Black full circles represent the chemicals not predicted with TOPKAT.

Figure 3.  Predicted logLC50 vs. real logLC50 for ECB. Black full circles represent the chemicals not predicted with TOPKAT.

Outliers

Several compounds with high under- and overestimated values appeared (outliers). We selected a cutoff (dashed line) at 2.56s (see for s values) to point out these outliers. For the ECB and CERMN models, only one derivative was overestimated for its toxicity ( and ); in contrast, TOPKAT () overestimated five derivatives. Concerning the underestimated outliers, TOPKAT generated the fewest outliers with eight derivatives; CERMN generated 12 outliers and ECB 15. Three of the underestimated outliers (4, 6, and 7) are common to all models (). Eleven of the underestimated outliers are common () to the CERMN and ECB models.

Table 3. Descriptions of the outliers produced by the models.

Outliers with overestimated values. LogPOW is the main descriptor in the CERMN and ECB models. An incorrect prediction of logPOW could be one explanation for the observed overestimates. This could be the case for 22. Indeed, this compound has a calculated value (ClogP) of 2.46 in EPAFHM but with KOWWIN the prediction is–4.15. The experimentally determined logLC50 value for 22 is–0.45 (logLC50 in mol/L). Starting from the KOWWIN prediction, the predicted LC50 value would be 1.62 for ECB (–3.73 initially with ClogP) and–0.51 for CERMN (–3.88 with ClogP). Thus, the prediction of the CERMN model is very good for this derivative using the KOWWIN calculation for logPOW. Experimental determination of logPOW should be done to clarify this point. At this stage, we presume that the KOWWIN value is closer to the real logPOW value for 22. The other overestimated values generated by TOPKAT correspond to nitriles (9, 11, and 12). TOPKAT must consider these derivatives to be very toxic, presumably as a result of release of the cyanide group (we note that 18 and 19 are not outliers for TOPKAT). However, in our opinion, without activation of one proton of the alpha carbon of the nitrile group, the formation of cyanide is less probable (see below for a discussion on this point). For the two other derivatives (10, 13) we have no explanation. TOPKAT could estimate a toxicity based on the action of amine oxidase on 10 as described for some derivativesCitation20. The toxicity assigned to 13 is more surprising in light of the interest of propargyl amine in neuroprotection, for instanceCitation21.

Outliers with underestimated values. Several of the outliers correspond to electrophile or proelectrophile reactants. Among them, aldehydes such as 14 (acrolein) are systematic outliers in several QSAR studies. The toxic potency of acrolein, mediated by a Michael addition toward nucleophiles, is very high and often underestimated. Compounds 4, 6, 7, 15, and 16 are also outliers, and their mechanism of toxicity could be similar to that of α,β unsaturated aldehyde. Indeed, the metabolites of 16 have been studiedCitation22, and they result from oxidation of the alcohol leading to aldehyde and carboxylic acid. Thus, the metabolites of 16 should have the same chemical properties as acrolein. The alcohol function of 15 could be oxidized leading straight to acrolein. The metabolites of 6 (1,3-dichloropropene) were described by the US EPACitation23. The formation of an alcohol followed by oxidation led to 3-chloroacrolein. We can suppose that compound 4 has the same type of metabolites, with easy substitution of one chlorine atom by a hydroxyl group followed by oxidation. Compound 7 has the chemical features of an α,β unsaturated aldehyde. Compound 8 could lead to 1,4-dibenzaldehyde. We have no information about whether the toxicity of this last derivative is related to α,β unsaturated aldehyde, with an activation of the phenyl group by the two electron-withdrawing groups. However, the graphic representation of the relationships between these derivatives and logPOW shows a good statistical relationship if we discard acrolein (because of its underestimated toxicity):

We observed that four other derivatives (see ) are close to the cutoff value. With these derivatives, the following equation was obtained:

Table 4. Description of chemicals used to create the model with Equation (5).

With this class of derivatives, we have a uniformly high toxicity (value of intercept) and the toxicity increases with their ADME (adsorption, distribution, metabolism, excretion) properties (logPOW and metabolism).

As for the other outliers, 17, a hydrazine derivative, is described as being very reactiveCitation24, although the mechanism of its toxicity is not actually known. For halonitrile, 18, and dicyanomethane, 19, their high toxicities should derive from the oxidation of a CH bond (activation with chlorine or nitrile group) followed by the release of HCNCitation25, a potent inhibitor of cytochrome c oxidase. For 21, the rapid formation of phenol must be the first step (substitution of the chlorines by hydroxyl groups). Oxidation of one of the two phenol groups could lead to a very reactive derivative. The toxicities of nitrobenzene are also higher than either the baseline or polar narcosis MOA would suggestCitation26. These two factors could explain the toxicity of 21. Compounds 23, 24, 27, and 1 correspond to acetyl cholinesterase inhibitors. Therefore, their predicted toxicities could be improved by considering a QSAR model specific for this class of derivatives. Compounds 20, 25, and 26 are flucythrinate, strychnine, and rotenone, respectively, well-known poisons used as pesticides. Their MOAs are very specific and their toxicities cannot be reliably estimated by QSAR models such as ECB or CERMN. An expert system such as TOPKAT, with a specific database, is more powerful. The toxicity of 5 was correctly predicted starting from Equation (5). We have no clear explanation for the mechanism of toxicity of diethyl chloromalonate, 2. For 2-allyl phenol, 3, used as a fungicideCitation27, its action was reported to be mediated by glutathione S-transferase interactions (GSTP1, GSTM1, GSTA1)Citation28.

Quality of the prediction in relation to mode of action

The derivatives from the EPAFHM dataset were classified into nine MOAs (). For this part of the study we focused on the following MOAs: baseline narcosis; arylate/ester narcosis; polar narcosis; electrophile or proelectrophile reactivity (reactants). Arylate/ester narcosis and polar narcosis MOAs were combined into a single group called “multiple narcosis.” The statistical results from the same predictive models (TOPKAT, ECB, and CERMN) are summarized in . The results of the TOPKAT model for baseline narcosis in relation to overestimation (see ) of the toxicity are described above for three derivatives (11, 12, and 13). The CERMN model produced a very good correlation; however, we observed a slight overestimation of toxicity of the compounds (). The quality of the correlation decreased in all three models for the MOAs multiple narcosis and reactants. TOPKAT predicted fewer derivatives without giving better predictions. From these initial data, a very high correlation was obtained for derivatives with an MOA of baseline narcosis. Therefore, we sought to determine whether we could classify a derivative as belonging to a specific MOA, and particularly to baseline narcosis, before a prediction of its toxicity was made. For this study, two different datasets were used: (1) EPAFHM, to analyze relationships between the described and predicted classifications; and (2) the HPV dataset, to analyze relationships between the predicted and real biological data starting from the supposed classification.

Table 5. Quality of the prediction models for the three principal MOAs.

Figure 4. Real vs. predicted logLC50 values (baseline narcosis for MOA) with TOPKAT.

Figure 4.  Real vs. predicted logLC50 values (baseline narcosis for MOA) with TOPKAT.

Figure 5. Real vs. predicted logLC50 values (baseline narcosis for MOA) with the CERMN model.

Figure 5.  Real vs. predicted logLC50 values (baseline narcosis for MOA) with the CERMN model.

Classification with Toxtree software

Toxtree (Verhaar scheme) classified the chemicals into five groups (). We were particularly interested in the narcosis group. The initial correlation between the real and predicted values (generated by the CERMN model) for the HPV set of 225 derivatives, without classification, was 0.37 (r2). Ninety-nine EPAFHM derivatives and 57 HPV derivatives were classified as having a narcosis MOA. From these derivatives, the correlation between real and predicted values with the CERMN model was 0.87 (r2) for EPAFHM and 0.72 for HPV. Therefore, Toxtree is able to extract and correctly predict (starting from the CERMN model) derivatives with a potential baseline narcosis for the MOA, but the statistical relationship is lower than expected ().

Table 6. Results of classification with Toxtree.

Classification with Pipeline Pilot machine learning

The learning model was applied to the training set to check the quality of the MOA prediction as a function of the MOA indicated in the EPAFHM dataset. In this dataset, 116 derivatives had no defined MOA. With this learning model, each compound was classified by function to a specific class based on the highest probability obtained. shows the number of chemicals predicted in each MOA for EPAFHM and HPV.

Table 7. Reclassification of MOA for chemicals in the initial set and prediction by the learning model.

The correlation between the real values and those predicted by the CERMN model for the 358 derivatives classified as having a narcosis MOA (see ) was high (r2 = 0.77 and s = 0.54). If we compare this result to the Toxtree result, the CERMN model handled nearly four times as many derivatives for a similar correlation. The same studies were conducted with the test set (HPV). For this dataset, 60% of the chemicals were classified as having a baseline narcosis MOA. The correlation was close to the results obtained with Toxtree (r2 = 0.64 and s = 0.75), but this classification predicted 136 derivatives compared to only 57 with Toxtree. For the EPAFHM dataset, we can strongly improve the correlation by classifying the compounds according to the probability (P) of belonging to a class (e.g. with P = 0.9, the correlation was 0.88 for r2 for 127 chemicals instead of 358), but with HPV we observed that the correlation did not substantially improve using the same cutoff.

Conclusion

The definition of a method able to classify compounds as having a narcosis MOA with a high probability is necessary to improve the prediction of toxicity for the majority of organic derivatives. Bayesian approaches with fingerprints have made important advances toward this definition, but it will be necessary to couple such methods with a knowledge base that includes definitions of structural alerts (chemical features leading to high toxicity). With a correct classification of narcosis for the MOA, the ECB and CERMN models should give the most relevant statistical results along with an explanation of the origin of the toxic effects for these derivatives. Analysis of the outliers associated with general models (whatever the models) has shown clearly the origin of the discrepancies for the predictions. Very highly toxic compounds led to important errors for the ECB and CERMN predictions. TOPKAT included structural alerts, but in this case several derivatives were overestimated for their toxicities (the nitrile feature as a structural alert, for instance). Therefore, for each structural alert, its characteristic must be clearly defined. Toward this objective, data included in the DEREK expert system should be very useful. The next steps of this study will be to improve this classification as well as to analyze the possibility of improving the predictions for derivatives classified as “multiple narcosis” and “reactants” for the MOA. Indeed, as described herein, it is possible to build a good QSAR equation for reactants with the same MOA (Equation (5)). This work is now in progress.

Declaration of interest

We thank ANR (Agence National de la Recherche, ANR-07-CP2D-09-02) for financial support.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.