112
Views
19
CrossRef citations to date
0
Altmetric
Original Articles

A binary QSAR model for classifying neuraminidase inhibitors of influenza A viruses (H1N1) using the combined minimum redundancy maximum relevancy criterion with the sparse support vector machine

, ORCID Icon &
Pages 517-527 | Received 20 Apr 2018, Published online: 23 Jul 2018

References

  • Y. Cong, B.-K. Li, X.-G. Yang, Y. Xue, Y.-Z. Chen, and Y. Zeng, Quantitative structure–activity relationship study of influenza virus neuraminidase A/PR/8/34 (H1N1) inhibitors by genetic algorithm feature selection and support vector regression, Chemom. Intell. Lab. Syst. 127 (2013), pp. 35–42.
  • C.U. Kim, W. Lew, M.A. Williams, H. Liu, L. Zhang, S. Swaminathan, N. Bischofberger, M.S. Chen, D.B. Mendel, C.Y. Tai, W.G. Laver, and R.C. Stevens, Influenza neuraminidase inhibitors possessing a novel hydrophobic interaction in the enzyme active site: Design, synthesis, and structural analysis of carbocyclic sialic acid analogues with potent anti-influenza activity, J. Am. Chem. Soc. 119 (1997), pp. 681–690.
  • P. Chand, Y.S. Babu, S. Bantia, S. Rowland, A. Dehghani, P.L. Kotian, T.L. Hutchison, S. Ali, W. Brouillette, Y. El-Kattan, and T.-H. Lin, Syntheses and neuraminidase inhibitory activity of multisubstituted cyclopentane amide derivatives, J. Med. Chem. 47 (2004), pp. 1919–1929.
  • T.-T. Dao, B.-T. Tung, P.-H. Nguyen, P.-T. Thuong, S.-S. Yoo, E.-H. Kim, S.-K. Kim, and W.-K. Oh, C-Methylated flavonoids from cleistocalyx operculatus and their inhibitory effects on novel influenza A (H1N1) neuraminidase, J. Nat. Prod. 73 (2010), pp. 1636–1642.
  • U. Grienke, M. Schmidtke, J. Kirchmair, K. Pfarr, P. Wutzler, R. Durrwald, G. Wolber, K.R. Liedl, H. Stuppner, and J.M. Rollinger, Antiviral potential and molecular insight into neuraminidase inhibiting diarylheptanoids from Alpinia katsumadai, J. Med. Chem. 53 (2010), pp. 778–786.
  • N. Minovski, S. Zuperl, V. Drgan, and M. Novic, Assessment of applicability domain for multivariate counter-propagation artificial neural network predictive models by minimum euclidean distance space analysis: A case study, Anal. Chim. Acta. 759 (2013), pp. 28–42.
  • A.M. Al-Fakih, Z.Y. Algamal, M.H. Lee, H.H. Abdallah, H. Maarof, and M. Aziz, Quantitative structure–activity relationship model for prediction study of corrosion inhibition efficiency using two-stage sparse multiple linear regression, J. Chemom. 30 (2016), pp. 361–368.
  • A.M. Al-Fakih, M. Aziz, H.H. Abdallah, Z.Y. Algamal, M.H. Lee, and H. Maarof, High dimensional QSAR study of mild steel corrosion inhibition in acidic medium by furan derivatives, Int. J. Electrochem. Sci. 10 (2015), pp. 3568–3583.
  • Z.Y. Algamal and M.H. Lee, A new adaptive L1-norm for optimal descriptor selection of high-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives, SAR QSAR Environ. Res. 28 (2017), pp. 75–90.
  • Z.Y. Algamal, M.H. Lee, and A.M. Al-Fakih, High-dimensional quantitative structure–activity relationship modeling of influenza neuraminidase a/PR/8/34 (H1N1) inhibitors based on a two-stage adaptive penalized rank regression, J. Chemom. 30 (2016), pp. 50–57.
  • Z.Y. Algamal, M.H. Lee, A.M. Al-Fakih, and M. Aziz, High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm, SAR QSAR Environ. Res. 27 (2016), pp. 703–719.
  • B. Palacios-Bejarano, G. Cerruela Garcia, I. Luque Ruiz, and M.A. Gomez-Nieto, QSAR model based on weighted MCS trees approach for the representation of molecule data sets, J. Comput. Aided Mol. Des. 27 (2013), pp. 185–201.
  • Y. Qin, H. Deng, H. Yan, and R. Zhong, An accurate nonlinear QSAR model for the antitumor activities of chloroethylnitrosoureas using neural networks, J. Mol. Graph. Model. 29 (2011), pp. 826–833.
  • F. Sahigara, D. Ballabio, R. Todeschini, and V. Consonni, Defining a novel k-nearest neighbours approach to assess the applicability domain of a QSAR model for reliable predictions, J. Cheminf. 5 (2013), pp. 1–9.
  • W.-X. Wang, H. Si, and Z. Zhang, Quantitative structure–activity relationship study on antitumour activity of a series of flavonoids, Molec. Simul. 38 (2012), pp. 38–44.
  • E. Pourbasheer, R. Aalizadeh, T.S. Shokouhi, M.R. Ganjali, P. Norouzi, and J. Shadmanesh, 2D and 3D quantitative structure–activity relationship study of hepatitis C virus NS5B polymerase inhibitors by comparative molecular field analysis and comparative molecular similarity indices analysis methods, J. Chem. Inf. Model. 54 (2014), pp. 2902–2914.
  • V. Ruusmann, S. Sild, and U. Maran, QSAR DataBank – an approach for the digital organization and archiving of QSAR model information, J. Cheminf. 6 (2014), pp. 1–17.
  • K. Zhang, J.M. Hughes-Oliver, and S.S. Young, Analysis of high-dimensional structure–activity screening datasets using the optimal bit string tree, Technometrics 55 (2013), pp. 161–173.
  • Z.Y. Algamal, M.H. Lee, A.M. Al-Fakih, and M. Aziz, High-dimensional QSAR prediction of anticancer potency of imidazo[4,5-b]pyridine derivatives using adjusted adaptive LASSO, J. Chemom. 29 (2015), pp. 547–556.
  • A. Tropsha, Best practices for QSAR model development, validation, and exploitation, Mol. Inform. 29 (2010), pp. 476–488.
  • Y. Li, Y. Kong, M. Zhang, A. Yan, and Z. Liu, Using support vector machine (SVM) for classification of selectivity of H1N1 neuraminidase inhibitors, Mol. Inform. 35 (2016), pp. 116–124.
  • R. Tibshirani, Regression shrinkage and selection via the lasso, J. R. Stat. Soc. Series B, 58 (1996), pp. 267–288.
  • J. Fan and R. Li, Variable selection via nonconcave penalized likelihood and its oracle properties, J. Am. Stat. Assoc. 96 (2001), pp. 1348–1360.
  • H. Zou, and T. Hastie, Regularization and variable selection via the elastic net, J. R. Stat. Soc. Series B 67 (2005), pp. 301–320.
  • H. Zou, The adaptive lasso and its oracle properties, J. Am. Stat. Assoc. 101 (2006), pp. 1418–1429.
  • V. Vapnik, The Nature of Statistical Learning Theory, Springer Science & Business Media, New York, 1995.
  • S. Korkmaz, G. Zararsiz, and D. Goksuluk, Drug/nondrug classification using Support Vector Machines with various feature selection strategies, Comput. Meth. Prog. Biomed. 117 (2014), pp. 51–60.
  • N. Becker, G. Toedt, P. Lichter, and A. Benner, Elastic SCAD as a novel penalization method for SVM classification tasks in high-dimensional data, BMC Bioinform. 12 (2011), pp. 138–151.
  • Y. Liu, H.H. Zhang, C. Park, and J. Ahn, Support vector machines with adaptive Lq penalty, Comput. Stat. Data. Anal. 51 (2007), pp. 6380–6394.
  • C. Park, K.-R. Kim, R. Myung, and J.-Y. Koo, Oracle properties of SCAD-penalized support vector machine, J. Stat. Plan. Infer. 142 (2012), pp. 2257–2270.
  • H.H. Zhang, J. Ahn, X. Lin, and C. Park, Gene selection using support vector machines with non-convex penalty, Bioinformatics, 22 (2006), pp. 88–95.
  • J. Zhu, S. Rosset, T. Hastie, and R. Tibshirani, 1-norm support vector machines, Adv. Neural Inf. Proc. Syst. 16 (2004), pp. 49–56.
  • Z. Liu, S. Lin, and M.T. Tan, Sparse support vector machines with Lp penalty for biomarker identification, IEEE/ACM Trans. Comput. Biol. Bioinform. 7 (2010), pp. 100–107.
  • L. Wang, J. Zhu, and H. Zou, Hybrid huberized support vector machines for microarray classification and gene selection, Bioinformatics, 24 (2008), pp. 412–419.
  • S. Maldonado, R. Montoya, and J. López, Embedded heterogeneous feature selection for conjoint analysis: A SVM approach using L1 penalty, Appl. Intel. 4 (2016), pp. 775–787.
  • P.S. Bradley and O.L. Mangasarian, Feature selection via concave minimization and support vector machines, ICML 1998, pp. 82–90.
  • K. Ikeda and N. Murata, Geometrical properties of Nu support vector machines with different norms, Neur. Comput. 17 (2005), pp. 2508–2529.
  • H. Peng, F. Long, and C. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy, IEEE Trans. Pattern Anal. Mach. Intel. 27 (2005), pp. 1226–1238.
  • N. Hoque, D.K. Bhattacharyya, and J.K. Kalita, MIFS-ND: A mutual information-based feature selection method, Expert Syst. Appl. 41 (2014), pp. 6371–6385.
  • G.M. Fung and O.L. Mangasarian, A feature selection newton method for support vector machine classification, Comput. Optim. Applic. 28 (2004), pp. 185–202.
  • C.U. Kim, W. Lew, M.A. Williams, H. Wu, L. Zhang, X. Chen, P.A. Escarpe, D.B. Mendel, W.G. Laver, and R.C. Stevens, Structure−activity relationship studies of novel carbocyclic influenza neuraminidase inhibitors, J. Med. Chem. 41 (1998), pp. 2451–2460.
  • J. Sun, S. Cai, N. Yan, and H. Mei, Docking and 3D-QSAR studies of influenza neuraminidase inhibitors using three-dimensional holographic vector of atomic interaction field analysis, Eur. J. Med. Chem. 45 (2010), pp. 1008–1014.
  • R. Todeschini, V. Consonni, A. Mauri, and M. Pavan, Dragon 6, 2012.
  • Z.Y. Algamal and M.H. Lee, Applying penalized binary logistic regression with correlation based elastic net for variables selection, J. Mod. Appl. Stat. Meth. 14 (2015), pp. 168–179.
  • Z.Y. Algamal and M.H. Lee, High dimensional logistic regression model using adjusted elastic net penalty, Pak. J. Stat. Oper. Res. 11 (2015), pp. 667–676.
  • Z.Y. Algamal, M.H. Lee, A.M. Al-Fakih, and M. Aziz, High-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives based on the sparse logistic regression model with a bridge penalty, J. Chemom. 31 (2017), pp. e2889.
  • R 3.5, The R foundation for statistical computing, https://cran.r-project.org/web/packages/penalizedSVM/index.html
  • T. Nguyen, A. Khosravi, D. Creighton, and S. Nahavandi, Hidden Markov models for cancer classification using gene expression profiles, Inform. Sci. 316 (2015), pp. 293–307.
  • M. Reboiro-Jato, F. Díaz, D. Glez-Peña, and F. Fdez-Riverola, A novel ensemble of classifiers that use biological relevant gene sets for microarray classification, Appl. Soft. Comput. 17 (2014), pp. 117–126.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.