455
Views
11
CrossRef citations to date
0
Altmetric
Short Communications

Prediction of HIV-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-ones: QSAR study

, &
Pages 288-294 | Received 08 May 2010, Accepted 24 May 2010, Published online: 24 Aug 2010

Abstract

Inhibition of human immunodeficiency virus 1 (HIV-1) protease is an important strategy for the treatment of HIV and acquired immune deficiency syndrome (AIDS). Therefore, HIV-1 protease inhibitory activity of dihydropyranone derivatives has been analyzed with different physico-chemical parameters. In the present work, QSAR studies were performed on a series of 4-hydroxy-5,6-dihydropyran-2-ones to explore the physico-chemical parameters responsible for their HIV-1 protease inhibitory activity. Physico-chemical parameters were calculated using WIN CAChe 6.1. Stepwise multiple linear regression analysis was performed to derive QSAR models which were further evaluated for statistical significance and predictive power by internal and external validation. The selected best QSAR model was having correlation coefficient (R) = 0.875 and cross-validated squared correlation coefficient (Q2) = 0.707. The developed significant QSAR model indicates that hydrophobicity of whole molecule and the substituent present at sixth position of dihydropyranones play an important role in the HIV-1 protease inhibitory activities of 4-hydroxy-5,6-dihydropyran-2-ones.

Introduction

Acquired immune deficiency syndrome (AIDS) is a formidable pandemic that is still wreaking havoc world wide. The catastrophic potential of this virally caused disease may not have been fully realized. The causative moiety of the disease is human immunodeficiency virus (HIV), which is a retrovirus of the lentivirus familyCitation1. The three viral enzymes, reverse transcriptase, protease, and integrase, encoded by the gag and gag-pol genes of HIV play an important role in the virus replication cycle. Among them, viral protease catalyzes the formation of viral functional enzymes and proteins necessary for its survival. The viral particles at this stage are called virions. The virus particles after the protease action have all the necessary constituents of mature virus and are capable of invading other T4 cells and repeating the life cycle of proviral DNA from viral RNA, the key stage in viral replication. Its central role in virus maturation makes protease is a prime target for anti-HIV-therapyCitation2.

QSAR analyses of HIV-1 reverse transcriptase inhibitorsCitation3, HIV-1 protease inhibitorsCitation4–7, and HIV-1 integrase inhibitorsCitation8,Citation9 were reported. The present group of authors has developed a few quantitative structure-activity relationship models to predict biological activity of different group of compoundsCitation10–16. In continuation of such efforts, in this article, we have performed QSAR analysis for HIV-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-one derivativesCitation17 using modelling software WIN CAChe 6.1 (molecular modelling software) and statistical software STATISTICA 6.

The purpose of the present study is to investigate the physico-chemical parameters responsible for the HIV-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-ones, explore the correlation between them, and expected to get more information for designing novel dihydropyranone derivatives with potent HIV-1 protease inhibitory activity.

There is high structural diversity and a sufficient range of the biological activity in the selected series of 4-hydroxy-5,6-dihydropyran-2-one derivatives. It insists as to select these series of compounds for our QSAR studies. We carried out QSAR analysis and established a QSAR model to guide further structural optimization and predict the potency and physiochemical properties of clinical drug candidates.

Materials and methods

Materials

All of the molecular modelling studies reported herein were performed using WIN CAChe 6.1 (product of Fujitsu Private Limited, Nagano, Japan, http://www.cachesoftware.com /contacts/japan.shtml) and Chem Draw Ultra version 8 (Cambridge Soft, Swann Road, CB5 8LA, UK.) modelling software and the QSAR models were executed with STATISTICA version 6 (Softstat, Inc., Tulsa, OK) software.

Biological data

In the present work we have taken 59 4-hydroxy-5,6-dihydropyran-2-one compounds () and their HIV-1 protease inhibitory activity from the reported workCitation17. All the HIV-1 protease inhibitory activities used in the present study were expressed as pIC50 = −log10 IC50; where IC50 is the micro molar concentration of the compounds producing 50% reduction in the HIV-1 protease activity is stated as the means of at least two experiments. The compounds which did not show confirmed HIV-1 protease inhibitory activity and the compounds having particular functional groups present only once at a particular position in the above cited literature have not been taken for our study.

Table 1.   Structure, selected parameters, and their HIV-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-one analogues.

Optimization of molecules structure

All the 59 compounds (50 compounds in training set and 9 in test set—test and training set compounds were chosen manually such that low, moderate, and high activity compounds were present approximately equal proportions in both setsCitation18) were built on workspace of molecular modelling software WIN CAChe 6.1. The energy minimization was done by geometry optimization of molecules using molecular mechanics followed by semi-empirical PM3 method available in molecular orbital package module until the root mean square gradient value becomes smaller than 0.001 kcal/mol Å. Most stable structure for each compound was generated and used for calculating various physico-chemical descriptors like thermodynamic, steric, and electronic values of descriptors.

Descriptors calculation

The physico-chemical properties were calculated on project leader file of the modelling software WIN CAChe 6.1. In the present study, the calculated descriptors were conformational minimum energies, zero-order connectivity index, first-order connectivity index, second-order connectivity index, dipole moment, total energy at its current geometry after optimization of structure, heat of formation at its current geometry after optimization of structure, highly occupied molecular orbital, low unoccupied molecular orbital, octanol-water partition coefficient (logP), squared partition coefficient (logP2), molar refractivity, shape index order 1, shape index order 2, zero-order valance connectivity index, first-order valance connectivity index, second-order valance connectivity index, and solvent accessible surface area (physico-chemical parameters data will be provided on request). We have also considered some indicator variables for QSAR model development. Those are IR (if the substituent at R is –(CH2)2C6H5 then the value of IR is 1 otherwise 0) and IR′ (if any substituent is present at R′ then the value of IR′ is 1 otherwise 0).

QSAR models development and validation

The above properties were fed manually into statistical software, STATISTICA version 6, and a correlation matrix was made to select the parameters having very less intercorrelation and maximum correlation with activity. This was followed by a stepwise multiple linear regression analysis to achieve best model. Statistical measures used were number of compounds in regression (n), correlation coefficient (R), squared correlation coefficient (R2), F-test (Fischer’s value) for statistical significance, standard error of estimation (SEE), cross validated correlation coefficient (Q2), and correlation matrix to show intercorrelation among the parameters.

Internal validation was carried out by leave one out (LOO) method using statistical software STATISTICA. Q2 was calculated using the following equation:

where yi is the activity for training set compounds, ym is the mean observed value, corresponding to the mean of the values for each cross-validation group, and ypred,i is the predicted activity for yi. The predictive ability of the selected model was also confirmed by external R2 and R2CVext.

where yi is the averaged value for the dependent variable for the training set.

Another term to check the external predictability of the selected model is r2m, which was proposed by Roy and Roy (2007)Citation19 and it was calculated by the following formula:

where r2 is squared correlation coefficient between observed and predicted values and r02 is squared correlation coefficient between observed and predicted values with intercept value set to zero. A value of r2m is greater than 0.5 may be taken as an indicator of good external predictability.

The robustness of a QSAR model was checked by Y–randomization test. In this technique, new QSAR models were developed by shuffling the dependent variable vector randomly and keeping the original independent variable as such. The new QSAR models are expected to have low R2 and Q2 values. If the opposite happens, then an acceptable QSAR model cannot be obtained for the specific modelling method and data.

Results

The QSAR studies of 4-hydroxy-5,6-dihydropyran-2-one series resulted in several QSAR equations. Equation 1 is the best equation when one parameter was considered.

1

n = 50, R = 0.750, R2 = 0.562, R2adj = 0.553, SEE = 0.466, F = 18.95, p < 0.001, Q2 = 0.482.

The above equation is statistically not a significant one. The R2 and internal predictivity of the model is poor. Equation 2 is the best equation when two parameters were considered.

2

n = 50, R = 0.762, R2 = 0.582, R2adj = 0.564, SEE = 0.460, F = 32.69, p < 0.001, Q2 = 0.483.

The above equation is also statistically not a significant one. The R2 and internal predictivity of the model is poor. Equation 3 is the best equation when three parameters were considered.

3

n = 50, R = 0.790, R2 = 0.624, R2adj = 0.599, SEE = 0.441, F = 25.42, p < 0.001, Q2 = 0.521.

The above equation is statistically significant. The R2 and internal predictivity of the model is good. Equation 4 is the best equation when four parameters were considered.

4

n = 50, R = 0.812, R2 = 0.660, R2adj = 0.629, SEE = 0.424, F = 21.84, p < 0.001, Q2 = 0.599.

When we considered five parameters for developing model, there was no significant improvement in R2 and Q2. Equations 3 and 4 were selected as the best, significant model on the basis of high Q2 values and R2 values. The values given in the parentheses are 95% confidence intervals of the regression coefficients. The residual of observed and calculated activity of compound number 50 and 57 was found two times larger than the standard deviation of Equations 3 and 4. Hence they were removed as outliers; we rebuilt the models with 48 compounds and got the following equations (Equations 5 and 6):

5

n = 48, R = 0.875, R2 = 0.765, R2adj = 0.749, SEE = 0.309, F = 47.86, p < 0.001, Q2 = 0.707, Standard deviation of Predicted residual sum of square (SPRESS) = 0.337, standard deviation of error of predictions (SDEP) = 0.335, r2m = 0.245, optimum logP (logP0) = 4.723.

Equation 5 could explain 76.5% and predict 74.9% of the variance of the HIV-1 protease inhibitory activity data. The calculated HIV-1 protease inhibitory activity values by Equation 5 are given in . There was no intercorrelation between the descriptors (). This model showed good correlation coefficient (R) of 0.875 between descriptors (logP, logP2, and IR) and HIV-1 protease inhibitory activity. This model also indicates statistical significance > 99.9% with F-value F(3,44) = 47.86.

Table 2.   Calculated and predicted (LOO) activities of training and test set using equation 5 and 6.

Table 3.  Correlation matrix of physico-chemical descriptors and HIV protease inhibitory activity.

6

n = 48, R = 0.902, R2 = 0.814, R2adj = 0.796, SEE = 0.278, F = 46.90, p < 0.001, Q2 = 0.763, SPRESS = 0.303, SDEP = 0.301, r2m = 0.196.

Equation 6 could explain 81.4% and predict 79.6% of the variance of the HIV-1 protease inhibitory activity data. The calculated HIV-1 protease inhibitory activity values by Equation 6 are given in . This model showed good correlation coefficient (R) of 0.902 between descriptors (logP, logP2, IR, and IR′) and HIV-1 protease inhibitory activity. This model also indicates statistical significance > 99.9% with F-value F(4,43) = 46.90.

Both the models showed good internal predictivity. The predictive ability of the selected models was also confirmed by external r2CVext method. The proposed QSAR model is predictive as it satisfies the conditions r2CVext > 0.5 and R2Pred > 0.620. Both the models satisfied these conditions (Equation 5: r2CVext = 0.624 and R2Pred = 0.843, Equation 6: r2CVext = 0.555 and R2Pred = 0.694). The robustness of this model was checked by Y–randomization test (maximum R2 value is 0.210 and maximum Q2 is 0.080). The low R2 and Q2 values indicate that the good results in our original model are not due to a chance correlation or structural dependency of the training set. But both the models failed to satisfy the condition that r2m should be > 0.5.

Discussion

Both Equations 5 and 6 showed statistical significance and good internal predictivity but Equation 5 showed better external predictivity than Equation 6. Hence the Equation 5 was selected as the best QSAR model to predict the HIV-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-one derivatives.

The QSAR shows a parabolic relationship of HIV-1 protease inhibitory activity with the logP. It is interesting to note that the optimum logP value in this case is 4.723. This value lies in the (4.49–6.96) range determined by Garg and PatelCitation5. Negative sign of logP2 indicates that highly hydrophobic groups are not good for improving the activity of the series. The positive coefficient of IR showed that the presence of –(CH2)2C6H5 group at sixth position of dihydropyranone nucleus is essential for better activity.

The proposed model, due to the good internal and external predictive ability, can therefore act as a useful aid to the costly and time consuming experiments for determining the molar concentration of compounds required to achieve 50% inhibition in HIV-1 protease activity. Our results lead to the conclusion that the HIV-1 protease inhibitory activity of 4-hydroxy-5,6-dihydropyran-2-ones can be increased if substituents that bring about changes in the molecules as mentioned above are attached to it.

Acknowledgements

One of the authors V. Ravichandran is thankful to AICTE, New Delhi for providing (QIP) Senior Research Fellowship. The authors are thankful to the reviewers for their valuable suggestions to improve the quality of this paper.

Declaration of interest

The authors report no conflict of interest.

References

  • Clercq ED. Toward improved anti-HIV chemotherapy: therapeutic strategies for intervention with HIV infections. J Med Chem 1995;38:2491–17.
  • Milton J, Slater MJ, Bird AJ, Spinks D, Scott G, Price CE, Downing S, Green DV, Madar S, Bethell R, Stammers DK. Biaryl acids: novel non-nucleoside inhibitors of HIV reverse transcriptase types 1 and 2. Bioorg Med Chem Lett 1998;8:2623–28.
  • Pungpo P, Hannongbua S. Three-dimensional quantitative structure-activity relationships study on HIV-1 reverse transcriptase inhibitors in the class of dipyridodiazepinone derivatives, using comparative molecular field analysis. J Mol Graph Model 2000;18:581–90, 601.
  • Nair AC, Jayatilleke P, Wang X, Miertus S, Welsh WJ. Computational studies on tetrahydropyrimidine-2-one HIV-1 protease inhibitors: improving three-dimensional quantitative structure-activity relationship comparative molecular field analysis models by inclusion of calculated inhibitor-and receptor-based properties. J Med Chem 2002;45:973–83.
  • Kumar S, Jacob RR, Tiwari M. 3D-QSAR study of some 5,6-dihydropyran-2-ones as protease inhibitors. Indian J Pharm Sci 2005;67:30–36.
  • Garg R, Patel D. Hydrophobicity in the design of P2/P2′ tetrahydropyrimidinone HIV protease inhibitors. Bioorg Med Chem Lett 2005;15:3767–70.
  • Bhhatarai B, Garg R. From SAR to comparative QSAR: role of hydrophobicity in the design of 4-hydroxy-5,6-dihydropyran-2-ones HIV-1 protease inhibitors. Bioorg Med Chem 2005;13:4078–84.
  • Buolamwini JK, Assefa H. CoMFA and CoMSIA 3D QSAR and docking studies on conformationally-restrained cinnamoyl HIV-1 integrase inhibitors: exploration of a binding mode at the active site. J Med Chem 2002;45:841–52.
  • Raghavan K, Buolamwini JK, Fesen MR, Pommier Y, Kohn KW, Weinstein JN. Three-dimensional quantitative structure-activity relationship (QSAR) of HIV integrase inhibitors: a comparative molecular field analysis (CoMFA) study. J Med Chem 1995;38:890–7.
  • Ravichandran V, Agrawal RK. Predicting anti-HIV activity of PETT derivatives: CoMFA approach. Bioorg Med Chem Lett 2007;17:2197–202.
  • Ravichandran V, Jain PK, Mourya VK, Agrawal RK. QSAR study on some arylsulfonamides as anti-HIV agents. Med Chem Res 2007;16:7–9.
  • Ravichandran V, Mourya VK, Agrawal RK. QSAR study of novel 1, 1, 3-trioxo [1, 2, 4]-thiadiazine (TTDs) analogues as potent anti-HIV agents. Arkivoc 2007;XIV:204–12.
  • Ravichandran V, Mourya VK, Agrawal RK. QSAR prediction of HIV–1 reverse transcriptase inhibitory activity of benzoxazinone derivatives. Internet Electron J Mol Des 2007;6:363–74.
  • Ravichandran V, Prashanthakumar BR, Sankar S, Agrawal RK. Comparative molecular similarity indices analysis for predicting anti-HIV activity of phenyl ethyl thiourea (PET) derivatives. Med Chem Res 2008;17:1–11.
  • Sahu KK, Ravichandran V, Jain PK, Sharma S, Mourya VK, Agrawal RK. QSAR analysis of chicoric acid derivatives as HIV–1 integrase inhibitors. Acta Chimi Slov 2008;55:138–45.
  • Sahu KK, Ravichandran V, Mourya VK, Agrawal RK. QSAR analysis of caffeoyl naphthalene sulphonamide derivatives as HIV-1 Integrase inhibitors. Med Chem Res 2007;15:418–30.
  • Tait BD, Hagen S, Domagala J, Ellsworth EL, Gajda C, Hamilton HW, Prasad JV, Ferguson D, Graham N, Hupe D, Nouhan C, Tummino PJ, Humblet C, Lunney EA, Pavlovsky A, Rubin J, Gracheck SJ, Baldwin ET, Bhat TN, Erickson JW, Gulnik SV, Liu B. 4-hydroxy-5,6-dihydropyrones. 2. Potent non-peptide inhibitors of HIV protease. J Med Chem 1997;40:3781–3792.
  • Roy K. On some aspects of validations of predictive QSAR models. Expert Opin Drug Discov 2007;2:1567–77.
  • Roy PP, Roy K. On some aspects of variable selection for partial least squares regression models. QSAR Comb Sci 2007;27:302–13.
  • Tropsha A, Gramatica P, Gombar VK. The importance of being earnest: validation is the absolute essential for successful application and interpretation of QSPR models. Quant Struct Act Rel 2003;22:1–9.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.