432
Views
7
CrossRef citations to date
0
Altmetric
Research Article

Computational neural network analysis of the affinity of N-n-alkylnicotinium salts for the α4β2* nicotinic acetylcholine receptor

, , , , &
Pages 157-168 | Received 31 Aug 2007, Accepted 14 Dec 2007, Published online: 20 Oct 2008

Abstract

Based on an 85 molecule database, linear regression with different size datasets and an artificial neural network approach have been used to build mathematical relationships to fit experimentally obtained affinity values (Ki) of a series of mono- and bis-quaternary ammonium salts from [3H]nicotine binding assays using rat striatal membrane preparations. The fitted results were then used to analyze the pattern among the experimental Ki values of a set of N-n-alkylnicotinium analogs with increasing n-alkyl chain length from 1 to 20 carbons. The affinity of these N-n-alkylnicotinium compounds was shown to parabolically vary with increasing numbers of carbon atoms in the n-alkyl chain, with a local minimum for the C4 (n-butyl) analogue. A decrease in Ki value between C12 and C13 was also observed. The statistical results for the best neural network fit of the 85 experimental Ki values are r2 = 0.84, rmsd = 0.39; rcv2 = 0.68, and loormsd = 0.56. The generated neural network model with the 85 molecule training set may also be of value for future predictions of Ki values for new virtual compounds, which can then be identified, subsequently synthesized, and tested experimentally.

Introduction

Nicotine, the major alkaloid in tobacco, is the addictive compound that maintains tobacco smoking behavior [Citation1–2]. Tobacco smoking is the number one cause of preventable mortality, and is responsible for over 4 million smoking-related deaths world-wide each year [Citation3]. Although 70% of the 50 million people that smoke tobacco have attempted to quit, only 3% of those individuals maintain cessation for a period of one year using currently available therapies. Consequently, relapse rates for tobacco smoking continue to be high, indicating that novel smoking cessation therapies are needed [Citation3].

Nicotine produces its effects on the central nervous system (CNS) by interacting with nicotinic acetylcholine receptors (nAChRs) that are essential for synaptic transmission. nAChRs consist of five protein subunits which transverse the neuronal cell membrane. The most common nAChR subtype is α4β2*, which accounts for over 90% of the high-affinity [3H]nicotine binding sites in brain [Citation4]. α4β2* nAChRs have been recognized as a major therapeutic target for mediating several CNS pathologies and diseases, including tobacco dependence [Citation3,Citation4].

A variety of α4β2* agonists and antagonists have been discovered [Citation5,Citation6]. Previous research [Citation7–13] in our laboratories has led to the discovery of a new class of nAChR antagonists composed of a series of mono- and bis-quaternary ammonium salts [Citation13]. Among these are the N-n-alkylnicotinium salts that vary in the length of their N-n-alkyl chains [Citation11]. After attachment of an n-alkyl chain to the pyridine nitrogen atom of the nicotine molecule, the properties of the resulting novel compounds are significantly altered. The pKa value of the nitrogen of the pyrrolidine ring drops from 8.5 to about 6 [Citation14], and the compound is converted from an agonist to an antagonist at nAChRs [Citation11,Citation12]. These novel compounds exhibit potent and competitive inhibition of nAChR subtypes mediating S-(-)-nicotine-evoked dopamine release from dopaminergic nerve terminals in superfused rat striatal slices [Citation11], and may have potential as smoking cessation agents, due to their selective antagonist activity at these nAChR subtypes [Citation11].

An interesting phenomenon observed in the structure-activity profile of the N-n-alkylnicotinium series of compounds is that the experimentally measured Ki values obtained from the [3H]nicotine binding assay (which probes the α4β2* nAChR subtype) afforded a greater diversity in pharmacological response than might be expected for a homologous series of compounds where the N-n-alkyl chain length varies from C1 to C20. However, these data can be mined to identify associations that can then provide insights for future drug design studies. Generally speaking, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information. Technically, data mining is the process of discovering previously unknown relationships (correlations) or hidden patterns among a group of data in a database [Citation15]. In this study, linear regression and artificial neural network approaches were used to analyze the pattern among the experimentally determined binding affinities (Ki values) at the α4β2* nAChR subtype for a set of N-n-alkylnicotinium salts that varied in the length of their N-n-alkyl chains. The generated quantitative structure-activity relationship (QSAR) models are also valuable for predicting the affinity of hypothetical molecules as antagonists at α4β2* nAChR subtypes which can then be identified as structures of interest, and therefore seriously considered for synthesis and pharmacological evaluation.

Materials and methods

Experimental database

A database of eighty-five mono- and bis-quaternary ammonium compounds with experimentally determined Ki values from the [3H]nicotine binding assay was available. The [3H]nicotine binding assay utilized rat striatal membrane preparations to probe the interaction of the analogs with the α4β2* nAChR subtype [Citation11]. Of the 85 molecules in the database, the Ki values of 7 of the molecules were ≤ 0.1 μM, 13 molecules had Ki values in the range 0.1–1 μM, 26 molecules had Ki values in the range 1–10 μM, and 39 molecules had Ki values ≥ 10 μM. This database was utilized for the linear and non-linear regression analyses ().

Table I.  Structures, experimentally determined pKi values (with Ki value in μM) from [3H]nicotine binding assays, and pKi values (with Ki value in μM) calculated by the NN731 model and its leave-one-out validation results for 85 quaternary ammonium salts.

Generation of molecular descriptors

Molecular modeling was carried out with the aid of the Sybyl discovery software package [Citation16a]. This software was used to construct the initial molecular structures utilized in the geometry optimization (energy minimization) for all molecules involved in this study. The geometry optimization was first performed using a molecular mechanics (MM) method with the Tripos force field and the default convergence criterion, which was then followed by a semi-empirical molecular orbital (MO) energy calculation at the PM3 level [Citation16b].

A set of 530 descriptors, including OD,1D, 2D and 3D whim descriptors, were calculated by the DRAGON program [Citation17] for these optimized molecules. In addition, a variety of other molecular properties, such as polar volume and polar surface area, were produced by use of the Tripos software package [Citation16a]. Other steric descriptors were measured from the optimized 3D molecular structures, including the intra-molecular distance between the pyridine ring nitrogen and the pyrrolidine ring nitrogen in the nicotine derived molecules. The PM3 method was used to determine the LUMO, HOMO energies, dipole moments, and atomic charges of each molecule [Citation16]. Pre-filtering for constant and pair-wise correlation (>0.95) descriptors were performed, and a stepwise procedure was carried out to select variables from the remaining 143 descriptors.

Target properties

Experimentally determined Ki values of the synthesized quaternary ammonium analogs were measured according to the procedure described by Dwoskin et al. [Citation11]. The log(1/Ki) or pKi (with Ki value in μM) was used as the target property for performing linear regression and neural network analyses.

Linear regression analysis

Linear least squares regression and multiple linear regression (MLR) analyses were performed utilizing an in-house Fortran77 program. Starting from the entire set of descriptors, variable selection by a forward and reverse stepwise regression procedure was performed, in which forward selection was followed by backward elimination of variables, resulting in an equation in which only variables that significantly increased the predictability of the dependent variable were included [Citation18a].

Artificial neural network (ANN) analysis

Feed-forward, back-propagation-of-error networks were developed using a neural network C program [Citation18]. Network weights (Wji(s)) for a neuron “j” receiving output from neuron “i” in the layer “s” were initially assigned random values between − 0.5 and +0.5. The sigmoidal function was chosen as the transfer function that generates the output of a neuron from the weighted sum of inputs from the preceding layer of units. Consecutive layers were fully interconnected; there were no connections within a layer or between the input and the output. A bias unit with a constant activation of unity was connected to each unit in the hidden and output layers.

The input vector was constructed from the set of descriptors for each molecule in the database, as generated by the previous steps. All descriptors and targets were normalized to the [0,1] interval. The network was configured with one or more hidden layers. To find the optimal number of neurons in the hidden layer, neural network architectures with different inputs and hidden neurons, respectively, were trained. During the ANN learning process, each compound in the training set was iteratively presented to the network, i.e. the input vector of the chosen descriptors in normalized form for each compound was fed to the input units, and the network's output was compared with the experimentally determined “target” value. During one “epoch”, all compounds in the training set were presented, and weights in the network were then adjusted on the basis of the discrepancy between network outputs and observed pKi values by back-propagation, using the generalized delta rule [Citation18].

Quality evaluation of regression analyses

A key point in any data mining process is quality evaluation of a mathematical analysis. It serves two purposes: i.e. to help identify the mathematical model that best represents the pattern in training data, and to predict how well the final model will work in the future [Citation15]. As the total number of data points used in the current study was less than 100, to obtain a precise estimate, the models generated with linear or non-linear mathematical approaches were validated by leave-one-out (LOO or loo) cross validation. The models were assessed by the Pearson correlation coefficient r2, root mean square deviation (rmsd), and predictive rcv2, which is defined as: where SD is the sum of squared deviations of each measured pKi value from its mean, and PRESS is the predictive sum of squared differences (the sum of squared differences between actual and predicted values). The final mathematical relationship was determined as the one having the smallest rmsd error of LOO validation for various linear or non-linear regression analyses.

Results and discussion

Results from linear regression analysis

The most widely used modeling method to fit data is linear least squares regression. Least squares fitting (LSF) with a single variable of the number of carbon atoms in the N-n-alkyl chain, was first used in this study to build a simple linear relationship, i.e. (Equation 2), to fit the affinity pKi values of the 17 N-n-alkylnicotinium salts (Compounds 218 in ). The computational pKi obtained from (Equation 2) showed a linear relationship of the binding affinity (pKi values) with the variation in the length of the N-n-alkyl chain. Assessment of the least-squares model indicated that the correlation coefficient r2 of the regression from fitting was 0.46, which is less than 0.5 (/); thus, this is not a statistically significant model.

Table II.  Quality of linear regression and ANN models.

Multiple linear regression analysis was then used to model the relationship between the pKi values of the 17 N-n-alkylnicotinium salts with more explanatory variables. By searching the 143-descriptor variable space with a forward and reverse stepwise variable selection procedure, a linear model MLR1 (Equation 3), was obtained: where E1m corresponds to the 1st component accessibility directional whim index / unweighted by atomic masses (the slash stands for divided by), and Gm corresponds to G total symmetry index / weighted by atomic masses, calculated by Dragon software [Citation17]. The correlation coefficient between these two descriptors was − 0.83. The computed affinity of the N-n-alkylnicotinium salts for the α4β2* nAChR subtype varied parabolically with increasing length of the N-n-alkyl chain (). A local minimum was found when the number of carbon atoms in the n-alkyl chain was equal to 4. The statistical analysis for the regression indicated that the correlation coefficient r2 and rmsd between the observed and the fitted results was 0.51 and 0.66, respectively (). The leave-one-out validation rcv2 was 0.38 and the leave-one-out rmsd was 0.75 (). The Pearson correlation coefficients between calculated and observed pKi values of the 17 molecules from fitting and LOO validation were 0.71 and 0.62 (), respectively.

Figure 1 pKi values versus the number of carbon atoms in the N-n-alkyl chain of the N-n-alkylnicotinium salts from: a. [3H]nicotine binding assays; b. the MLR1 model with 17 molecules; c. the MLR2 model with 32 molecules; d. the MLR3 model with 85 molecules; e. the NN711 model; f. the NN721 model; g. the NN731 model; and h. the NN741 model with 85 molecules.

Figure 1 pKi values versus the number of carbon atoms in the N-n-alkyl chain of the N-n-alkylnicotinium salts from: a. [3H]nicotine binding assays; b. the MLR1 model with 17 molecules; c. the MLR2 model with 32 molecules; d. the MLR3 model with 85 molecules; e. the NN711 model; f. the NN721 model; g. the NN731 model; and h. the NN741 model with 85 molecules.

To confirm the detected patterns from the above analysis, a multi-linear regression approach with a larger dataset was used to examine the relationship between the affinity of the N-n-alkylnicotinium salts at the α4β2* nAChR subtype and increasing length of the n-alkyl chain. The dataset included 32 molecules () composed of 17 N-n-alkylnicotinium salts (Compounds 2–18 in ), 13 N-n-alkylpyridinium salts (Compounds 41–53 in ), and two N-n-alkylpicolinium salts (Compounds 65 and 66 in ). With a forward and reverse stepwise variable selection procedure, six descriptors E2u, Hy, LUMO, Me, Du and RDSUM, as defined in , were selected. A lower inter-correlation between these descriptors (non-diagonal element is larger than 0.85 in ) indicated the possibility that the chance correlation for a linear equation built from these descriptors is low. A linear model (MLR2) between the six descriptors and the pKi values of these 32 molecules was generated. Statistical analysis of the regression showed that the correlation coefficient r2 was 0.79 with an rmsd of 0.48; leave-one-out validation rcv2 was 0.71 with an rmsd value of 0.57 (). The linear equation is as follows: The calculated pKi values from (Equation 4) for the 17 N-n-alkylnicotinium compounds versus the number of carbon atoms in the N-n-alkyl chain were plotted in . The fitting and validation Pearson correlation (R) between the experimental pKi and computational pKi values for the 17 N-n-alkylnicotinium molecules was 0.73 and 0.62, respectively (), which is similar to those obtained from MLR1. The computational results consistently showed that the affinity of the N-n-alkylnicotinium salts at the α4β2* nAChR subtype parabolically varied with increasing length of the N-n-alkyl chain. A local minimum was found when the number of carbon in the N-n-alkyl chain was equal to four carbons atoms. (Equation 4) was built by best fitting the affinity of 32 mono-nicotinium salts for the α4β2* nAChR subtype, which helps reduce or eliminate the effects of experimental noise or error in analyzing the affinity of the N-n-alkylnicotinium salts for the α4β2* nAChR subtype.

Table III.  Brief description of the descriptors used the linear regression relationship (MLR2) with a 32-molecule dataset.

Table IIIb.  Pearson correlation coefficient R between the descriptors used in MLR2 model.

Table IV.  Correlation between the observed pKi values from the [3H]nicotine binding assay and the calculated pKi values for the series of 17 N-n-alkylnicotinium salts from various models.

Results from neural network regression analysis

The artificial neural network (ANN) technique has been demonstrated recently to be an effective tool for data mining and has been used in many QSAR studies [Citation19–23]. The major advantage of ANN lies in its ability to model a wide set of linear and non-linear functions, without knowing the analytic forms in advance. The ANN approach is especially suitable for mapping complex relationships that may exist between model inputs and output. To identify the best correlation between the observed and calculated pKi values for the set of 85 compounds in the database we followed the same procedures as described previously for descriptor selection and determination of the optimal ANN configuration [18a,18b,24,25]. Variable selection from the dataset of 143 descriptors for the 85 molecules was carried out by a stepwise MLR procedure based on forward-selection and backward-elimination methods and located seven descriptors for an optimal multi-linear regression (Equation 5) (MLR3). This afforded a correlation coefficient r2 of 0.72 with an rmsd of 0.52; leave-one-out validation rcv2 of 0.67 with an rmsd value of 0.57 (). The inputs for the best ANN model were composed of the seven descriptors in (Equation 5) i.e. polar volume (PV), polar surface area (PSA), lowest unoccupied molecular orbital energy (LUMO), distances between nitrogen atoms (DISNN), 2nd component size directional Whim index (L2u), reciprocal distance Wiener-type index (RDSUM), and maximal electrotopological positive variation (MAXDP), as indicated in . The correlation coefficient between the different descriptors is provided in . No non-diagonal element was larger than 0.60 in , indicating that redundant information included in the set of descriptors is low. Although the four configurations of the neural networks listed in afforded close validation statistical results, the configurations NN721 and NN731 had a lower chance to be under-trained or over-trained for the 85 molecule dataset. The statistical results for model NN731 were as follows: r2 = 0.84, rmsd = 0.39, rcv2 = 0.68, and loormsd = 0.56. The trained and LOO predicted versus observed pKi values from NN731 were plotted in .

Table Va.  Brief description of the descriptors used in the ANN models.

Table Vb.  Pearson correlation coefficient R between the descriptors used in the ANN models.

Figure 2 The calculated versus the experimentally determined pKi values from the [3H]nicotine binding assay for the trained (shown in black squares), leave-one-out cross-validation (shown in red diamonds) for the best NN731 QSAR model. The solid line represents a perfect correlation. The dotted lines represent one order difference from the perfect fitting. Most points around the dotted lines correspond to the N-n-alkylnicotinum compounds in .

Figure 2 The calculated versus the experimentally determined pKi values from the [3H]nicotine binding assay for the trained (shown in black squares), leave-one-out cross-validation (shown in red diamonds) for the best NN731 QSAR model. The solid line represents a perfect correlation. The dotted lines represent one order difference from the perfect fitting. Most points around the dotted lines correspond to the N-n-alkylnicotinum compounds in Table VI.

The experimental and calculated pKi values versus the number of carbon atoms in the N-n-alkyl chain of the 17 nicotinium salts are plotted in –h. Affinities of the N-n-alkylnicotinium salts predicted by NN711 () show that increasing the length of the N-n-alkyl chain results in a linear increase in the affinity of the compounds for α4β2* nAChRs, which is similar to (results obtained from MLR3). The lower correlation coefficient 0.71 for fitting and 0.59 for LOO validation between the computational and experimental pKi values of the set of 17 homologous compounds () indicated that the nature of the pattern among the set of data had not been fully identified. α4β2* nAChR affinities of the N-n-alkylnicotinium salts predicted by both NN721 () and NN731 () showed that increasing the length of the n-alkyl chain resulted in a parabolic variation in affinity. A possible decrease in pKi values may occur for some compounds with n-alkyl chain lengths ranging from C12 to C14. The plot () from NN741 had a very similar pattern to that from NN731. The pattern detected by NN731 afforded the best fit and validation Pearson correlation (0.86 and 0.68, respectively) with the observed data (). The pattern in [experimentally obtained affinity values (pKi) versus the number of carbon atoms in the n-alkyl chain] was somewhat random, which is an interesting observation for further investigation. Generally, it is expected that the pKi values would vary continuously, according to the basic chemical variation rule for a homologous series of compounds. The simplest interpretation for a decrease in the pKi values for compounds with n-alkyl chain lengths in the range C12 to C13 from the pattern obtained from computational approaches, is that the binding site on the protein may be limited in size, and may not be able to accommodate the larger N-substituted molecules up to this point. The subsequent increase in the pKi values observed with molecules bearing N-n-alkyl substituents greater in length than C13 would suggests that these compounds with longer n-alkyl substituents bind to the receptor protein in a different manner, or at a different binding site. However, this remains to be proven in further studies.

Descriptor contributions to computational neural network models have been proposed and/or applied by several scientists [Citation18,Citation24–27]. Utilizing the method proposed by Guha and Jurs [Citation25], the results of the descriptor sensitivity analysis for NN731 showed that the percent contribution of the seven descriptors was: PV (14.3%), PSA (14.1%), LUMO (14.2%), DISNN (14.6%), L2u (13.9%), RDSUM (14.0%), and MAXDP (14.7%). The very similar percentage values for these seven descriptors suggest that the contributions of these descriptors to the model are almost equal. It should be noted that the above descriptor sensitivity analysis is related to a specific model. The parameters (weights) and transfer functions related to a single neuron in a neural network model also contribute to the power of the model's predictivity. In addition, a predictive model can be built from different sets of descriptors. Thus the descriptors having an important role in predicting target property are not limited only to the descriptors used to build the model. The correlation existing between the number of carbon atoms in an n-alkyl chain of a drug molecule containing a variable length n-alkyl substituent and the properties of these molecules, is well-known. These compounds, defined as members of a homologous series, have similar, as well as continuously varied properties. For example, a loss of potency as one ascends a homologous series of compounds is often used to map the dimensions of binding sites on a protein target [Citation28]. Thus, whether these properties are modeled or not, the correlation is known to exist. The correlation between the number of carbon atoms in the alkyl chain of the N-n-alkylnicotinium salts and the properties of the molecule is another example of the above. With the small dataset of the N-n-alkylnicotinium salts utilized in this current study, one has only a limited appreciation of the relationship between the binding affinity and the length of the n-alkyl chain in these compounds. Neural network analysis can provide a much more comprehensive view of the structure-activity relationships because of its ability to model a wide set of linear and non-linear functions, without knowing the analytic forms in advance. However, with a small number of compounds, the full predictive power of neural network analysis cannot be utilized, due to possible overtraining problems. Generally, when a dataset includes less than 40 compounds, back propagation neural network analysis will just model patterns that are not much different from linear function modeling. In this current study, we built more predictive models to detect patterns from a larger number of 85 compounds, by utilizing the advantages of the neural network approach and, in a creative way, graphically expressed the relationship of the binding affinity of the N-n-alkylnicotinium salts with the number of carbon atom in the n-alkyl chain from the neural network modeling results. Obviously, whereas a linear relationship can only detect major features, such as the longer the alkyl chain, the larger the pKi value of a compound (). with a higher coefficient R value and smaller RMSD () show the affinity values (pKi) vary near-parabolically with increasing alkyl chain length with a local minimum at C4. From , one can also see that a higher R (Fitting) corresponds to a better R(LOO), which makes the results more reliable.

indicates that 10 nicotinium analogs from the set of 85 mono- and bis-quaternary ammonium salts had absolute error values between the experimental and calculated pKi values of larger than 0.80, either in training or leave-one-out, or in both, when comparing the observed and calculated pKi values in model NN731. The original training correlation coefficient r2 of the 85 molecule dataset from the neural network model NN731 was 0.84 with an rmsd of 0.39; leave-one-out rcv2 was 0.68 with an rmsd of 0.56. After removing the 10 compounds listed in from the trained model, the statistical analysis showed that the training correlation coefficient r2 for the remaining data was 0.92 with an rmsd of 0.25; leave-one-out rcv2 was 0.80 with an rmsd of 0.41. Apparently, the difficulty in obtaining a better model from the dataset of 85 compounds is mainly due to the error associated with the experimental data from the nicotinium series of compounds. The same conclusion holds in the analysis of the data obtained from model NN721.

Table VI.  Compounds with large errors between the experimentally determined pKi values from [3H]nicotine binding assays and calculated pKi/LOO pKi values by model NN731.

Comparsion with the previously published computational models

As an important target for drug development, several modeling approaches have been carried out in order to understand quantitative structure-activity relationships of ligands that bind to nAChR subtypes. Two recent reviews that report on ligand-based models of neuronal nAChR agonists, involving either pharmacophore development or quantitative structure-affinity relationships, have been reported by Glennon et al. and Nicolotti et al., respectively [Citation29–30] More recently, QSAR studies in this area that focus on specific types of analogs, have also been reported [Citation31–35]. With regard to quaternary ammonium nAChR antagonists, QSAR studies on a series of mono- and bis-quaternary ammonium salts have been reported by Ayers et al. [Citation36]. In the study of Ayers et al, the Self Organizing Maps (SOM) approach was used to classify the compounds according to their structures and activities. Several genetic functional approximation models were created to simulate the quantitative structure-activity relationships from three small subsets of compound datasets (i.e., sets of 31, 38 and 23 compounds).

Our current work focuses on the pattern detection for the affinity of N-n-alkyl quaternary ammonium salts in the [3H]nicotine binding assay using rat striatal membranes versus the number of carbon atoms in the N-n-alkyl substitiuent by mining knowledge from 17, 32 and 85 molecule datasets. Models generated from this larger 85-molecule dataset may provide additional power for future prediction of Ki values of new virtual compounds.

Conclusions

Linear regression and neural network approaches have been used to build mathematical relationships to compute the binding affinity of a series of mono- and bis-quaternary ammonium salts in [3H]nicotine binding assays using rat striatal membranes. These results were then used to analyze the pattern among the experimentally determined affinities (Ki values) of a set of 17 N-n-alkylnicotinium salts for the α4β2* nicotinic receptor subtype. The affinity of these N-n-alkylnicotinium compounds was shown to vary parabolically with increasing length of the n-alkyl chain, with a local minimum indicated for the C4 analogue. A decrease in pKi values for compounds with n-alkyl chain lengths of C12 and C13 was also evident. The generated neural network model with the larger 85 molecule training set may provide additional power for future prediction of Ki values of new virtual compounds, prior to their synthesis and pharmacological evaluation.

Acknowledgement

This work was supported by NIH grant U19DA017548.

References

  • M Quik. (2004). Trends Neurosci 27:561–568.
  • RC Hogg, and D Bertrand. (2004). Science 306:983–985.
  • AR Tapper, SL McKinney, R Nashmi, J Schwarz, P Deshpande, C Labarca, P Whiteaker, MJ Marks, AC Collins, and HA Lester. (2004). Science 306:1029–1032.
  • KG Paradiso, and JH Steinbach. (2003). J Physiol 553:857–871.
  • AA Jensen, B Frolund, T Liljefors, and P Krogsgaard-Larsen. (2005). J Med Chem 48 (15):4705–4745.
  • MN Romanelli, and F Gualtieri. (2003). Med Res Rev 23 (4):393–397.
  • PA Crooks, JT Ayers, R Xu, SP Sumithran, VP Grinevich, LH Wilkins, AG Deaciuc, DD Allen, and LP Dwoskin. (2004). Bioorg Med Chem Letters 14:1869–1874. (b) Grinevich VP, Crooks PA, Haubner AJ, Ghosheh OH, Ayers JH, Dwoskin LP. J Pharmacol Exp Ther 2003; 306:1011–1020. (c) Ayers JT, Dwoskin LP, Deaciuc AG, Grinevich VP, Zhu J, Crooks PA. Bioorg Med Chem Lett 2002; 12:3067–3071
  • LP Dwoskin, SP Sumithran, J Zhu, AG Deaciuc, JT Ayers, and PA Crooks. (2004). Bioorg Med Chem Letters 14:1863–1865.
  • R Xu, LP Dwoskin, V Grinevich, SP Sumithran, and PA Crooks. (2002). Drug Dev Res 55:173–186.
  • LH Wilkins, A Haubner, JT Ayers, PA Crooks, and LP Dwoskin. (2002). J Pharmacol Exp Ther 301:1088–1096.
  • LH Wilkins, VP Grinevich, JT Ayers, PA Crooks, and LP Dwoskin. (2003). J Pharmacol Exp Ther 304:400–410.
  • PA Crooks, A Ravard, LH Wilkins, LH Teng, ST Buxton, and LP Dwoskin. (1995). Drug Dev Res 36:91–102.
  • LP Dwoskin, and PA Crooks. (2001). J Pharmacol Exp Ther 298:395–402.
  • XQ Huang, F Zheng, PA Crooks, LP Dwoskin, and C-G Zhan. (2005). J Am Chem Soc 127:14401–14414. (b) Huang XQ, Zheng F, Chen X, Crooks P A, Dwoskin LP, Zhan C-G. J Med Chem 2006; 49(26):7661–7674
  • HA Edelstein. Introduction to data mining and knowledge discovery. 3rd ed. Potomac, MD: Two Crows Corp; (1999).
  • (a) Tripos discovery software package with SYBYL 6.8.1, Tripos Inc., 1699 South Hanley Rd., St. Louis, Missouri, 63144, USA. (b) Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, Cheeseman JR, Montgomery Jr JA, Vreven T, Kudin KN, Burant JC, Millam JM, Iyengar SS, Tomasi J, Barone V, Mennucci B, Cossi M, Scalmani G, Rega N, Petersson GA, Nakatsuji H, Hada M, Ehara M, Toyota K, Fukuda R, Hasegawa J, Ishida M, Nakajima T, Honda Y, Kitao O, Nakai H, Klene M, Li X, Knox JE, Hratchian HP, Cross JB, Adamo C, Jaramillo J, Gomperts R, Stratmann RE, Yazyev O, Austin AJ, Cammi R, Pomelli C, Ochterski JW, Ayala PY, Morokuma K, Voth GA, Salvador P, Dannenberg JJ, Zakrzewski VG, Dapprich S, Daniels AD, Strain MC, Farkas O, Malick DK, Rabuck AD, Raghavachari K, Foresman JB, Ortiz JV, Cui Q, Baboul AG, Clifford S, Cioslowski J, Stefanov BB, Liu G, Liashenko A, Piskorz P, Komaromi I, Martin RL, Fox DJ, Keith T, Al-Laham MA, Peng CY, Nanayakkara A, Challacombe M, Gill PMW, Johnson B, Chen W, Wong MW, Gonzalez C, Pople JA. Gaussian 03, Revision A.1, Gaussian, Inc., Pittsburgh, PA, 2003.
  • DRAGON software version 3.0, developed by Milano Chemometrics and QSAR Research Group, 2003 (http://www.disat.nimib.it/chm/Dragon.htm).
  • F Zheng, E Bayram, SP Sumithran, JT Ayers, C-G Zhan, JD Schmitt, LP Dwoskin, and PA Crooks. (2006). Bioorg Med Chem 14 (9):3017–3037. (b)Zheng F, Zheng G, Deaciuc AG, Zhan C-G, Dwoskin LP, Crooks PA. Bioorg Med Chem 2007; 15:2975-2992
  • R Orre, AB Lansner, and M Lindquist. (2000). Computational statistics and data analysis. 34:473–493. (b) Lancashire LJ, Mian S, Ellis IO, Rees RC, Ball GR. Current Proteomics 2005; 2(1):15-29
  • M Fernandez, and J Caballero. (2006). J Mol Graph Model 25:410–422.
  • M Fernández, MC Carreiras, JL Marco, and J Caballero. (2006). J Enz Inhib Med Chem 21:647–661.
  • J Caballero, A Tundidor-Camba, and M Fernández. (2007). QSAR Comb Sci 26:27–40.
  • MP González, J Caballero, A Tundidor-Camba, AM Helguera, and M Fernández. (2006). Bioorg Med Chem 14:200–213.
  • R Guha, and PC Jurs. (2005). J Chem Inf Model 45:800–806.
  • R Guha, DT Stanton, and PC Jurs. (2005). J Chem Inf Model 45:1109–1121.
  • J Caballero, L Fernandez, JI Abreu, and M Fernandez. (2006). J Chem Inf Model 46:1255–1268.
  • D Cherqaoui, and D Villemin. (1994). J Chem Soc Faraday Trans 90:97–102.
  • RG Eckenhoff, JW Tanner, and JS Johansson. (1999). Mol Pharmacol 56:414–418.
  • RA Glennon, M Dukat, and L Liao. (2004). Curr Top Med Chem 4 (6):631–644.
  • O Nicolotti, C Altomare, M Pellegrini-Calace, and A Carotti. (2004). Curr Top Med Chem 4 (3):335–360.
  • H Zhang, H Li, and C Liu. (2005). J Chem Inf Model 45 (2):440–448.
  • H Zhang, H Li, and Q Ma. (2007). J Mol Graph Model 26 (1):226–235.
  • K Audouze, EØ Nielsen, GM Olsen, P Ahring, TD Jørgensen, D Peters, T Liljefors, and T Balle. (2006). J Med Chem 49 (11):3159–3171.
  • J Ji, MR Schrimpf, KB Sippy, WH Bunnelle, T Li, DJ Anderson, C Faltynek, CS Surowy, T Dyhring, PK Ahring, and MD Meyer. (2007). J Med Chem 50 (22):5493–5508.
  • A Basu, S Gayen, S Samanta, P Panda, K Srikanth, and T Jha. (2006). Canadian J Chem 84 (3):458–463.
  • JT Ayers, A Clauser, JD Schmitt, LP Dwoskin, and PA Crooks. (2005). AAPS Journal: AAPS-NIDA Symposium. Frontiers in Science: Drug Addiction: From Basic Research to Therapeutics 7 (3):E678–E685.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.