655
Views
79
CrossRef citations to date
0
Altmetric
Articles

Consistency of QSAR models: Correct split of training and test sets, ranking of models and performance parametersFootnote

, &
Pages 683-700 | Received 17 Jun 2015, Accepted 16 Aug 2015, Published online: 05 Oct 2015

Keep up to date with the latest research on this topic with citation updates for this article.

Read on this site (7)

D. Kovács, P. Király & G. Tóth. (2021) Sample-size dependence of validation parameters in linear regression models and in QSAR. SAR and QSAR in Environmental Research 32:4, pages 247-268.
Read now
Micael Karlberg, Moritz von Stosch & Jarka Glassey. (2018) Exploiting mAb structure characteristics for a directed QbD implementation in early process development. Critical Reviews in Biotechnology 38:6, pages 957-970.
Read now
P. De & K. Roy. (2018) Greener chemicals for the future: QSAR modelling of the PBT index using ETA descriptors. SAR and QSAR in Environmental Research 29:4, pages 319-337.
Read now
R. Sheikhpour, M. A. Sarram, M. Rezaeian & E. Sheikhpour. (2018) QSAR modelling using combined simple competitive learning networks and RBF neural networks. SAR and QSAR in Environmental Research 29:4, pages 257-276.
Read now
Jian-Bo Tong, Yuan Liu, Xue-chun Xiao, Peng Gao & Hai-yin Xu. Exploration of anti-tumour inhibitors from colchicine derivatives based on 3D-QSAR, molecular docking and molecular dynamics simulations. Molecular Simulation 0:0, pages 1-19.
Read now

Articles from other publishers (72)

Prachi P. Parvatikar, Sudha Patil, Kedar Khaparkhuntikar, Shruti Patil, Pankaj K. Singh, R. Sahana, Raghavendra V. Kulkarni & Anjanapura V. Raghu. (2023) Artificial intelligence: Machine learning approach for screening large database and drug discovery. Antiviral Research 220, pages 105740.
Crossref
Har Lal Singh, Naveen Dhingra & Sunita Bhanuka. (2023) Synthesis, spectral, antibacterial and QSAR studies of tin and silicon complexes with Schiff base of amino acids. Journal of Molecular Structure 1287, pages 135670.
Crossref
Izudin Redžepović & Boris Furtula. (2022) Chemical similarity of molecules with physiological response. Molecular Diversity 27:4, pages 1603-1612.
Crossref
Amjed S. Al Fahoum, Ansam Omar Abu Al-Haija & Hussam A. Alshraideh. (2023) Identification of Coronary Artery Diseases Using Photoplethysmography Signals and Practical Feature Selection Process. Bioengineering 10:2, pages 249.
Crossref
Vinay Kumar, Achintya Saha & Kunal Roy. (2023) Multi-target QSAR modeling for the identification of novel inhibitors against Alzheimer's disease. Chemometrics and Intelligent Laboratory Systems 233, pages 104734.
Crossref
Kamal Tabti, Larbi Elmchichi, Abdelouahid Sbai, Hamid Maghat, Mohammed Bouachrine & Tahar Lakhlifi. (2022) HQSAR, CoMFA, CoMSIA Docking Studies and Simulation MD on Quinazolines/Quinolines Derivatives for DENV Virus Inhibitory Activity. Chemistry Africa 5:6, pages 1937-1958.
Crossref
Péter Király, Ramóna Kiss, Dániel Kovács, Amine Ballaj & Gergely Tóth. (2022) The Relevance of Goodness‐of‐fit, Robustness and Prediction Validation Categories of OECD‐QSAR Principles with Respect to Sample Size and Model Type. Molecular Informatics 41:11.
Crossref
Shadi Shayanfar & Ali Shayanfar. (2022) Comparison of various methods for validity evaluation of QSAR models. BMC Chemistry 16:1.
Crossref
Jian-Bo Tong, Jie Wang, Ding Luo, Xue-Chun Xiao, Hai-Yin Xu, Shuai Bian & Xing Zhang. (2022) QSAR study, molecular docking, and ADMET prediction of vinyl sulfone-containing Nrf2 activator derivatives for treating Parkinson disease. Structural Chemistry 33:4, pages 1109-1131.
Crossref
Attila Gere, Dorina Szakál & Károly Héberger. (2022) Multiobject Optimization of National Football League Drafts: Comparison of Teams and Experts. Applied Sciences 12:13, pages 6303.
Crossref
Álmos Orosz, Károly Héberger & Anita Rácz. (2022) Comparison of Descriptor- and Fingerprint Sets in Machine Learning Models for ADME-Tox Targets. Frontiers in Chemistry 10.
Crossref
Timothy B. Dunn, Gustavo M. Seabra, Taewon David Kim, K. Eurídice Juárez-Mercado, Chenglong Li, José L. Medina-Franco & Ramón Alain Miranda-Quintana. (2021) Diversity and Chemical Library Networks of Large Data Sets. Journal of Chemical Information and Modeling 62:9, pages 2186-2201.
Crossref
Jian-Bo Tong, Shuai Bian, Xing Zhang & Ding Luo. (2021) QSAR analysis of 3-pyrimidin-4-yl-oxazolidin-2-one derivatives isocitrate dehydrogenase inhibitors using Topomer CoMFA and HQSAR methods. Molecular Diversity 26:2, pages 1017-1037.
Crossref
Shiyu Liu, Shutao Wang, Chunhai Hu, Xiaoyang Qin, Junzhu Wang & Deming Kong. (2022) Development of a new NIR-machine learning approach for simultaneous detection of diesel various properties. Measurement 187, pages 110293.
Crossref
Dóra Judit Kiss, Gáspár Pándy-Szekeres & György Miklós Keserű. 2022. Comprehensive Pharmacology. Comprehensive Pharmacology 84 114 .
Ádám Ipkovich, Károly Héberger & János Abonyi. (2021) Comprehensible Visualization of Multidimensional Data: Sum of Ranking Differences-Based Parallel Coordinates. Mathematics 9:24, pages 3203.
Crossref
Neda Shakour, Farzin Hadizadeh, Prashant Kesharwani & Amirhossein Sahebkar. (2021) 3D-QSAR Studies of 1,2,4-Oxadiazole Derivatives as Sortase A Inhibitors. BioMed Research International 2021, pages 1-10.
Crossref
Ramón Alain Miranda-Quintana, Dávid Bajusz, Anita Rácz & Károly Héberger. (2021) Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 1: Theory and characteristics†. Journal of Cheminformatics 13:1.
Crossref
Oana-Constantina Margin, Eva-Henrietta Dulf, Teodora Mocan & Lucian Mocan. (2021) Modelling in Synthesis and Optimization of Active Vaccinal Components. Nanomaterials 11:11, pages 3001.
Crossref
Jian-Bo Tong, Ding Luo, Hai-Yin Xu, Shuai Bian, Xing Zhang, Xue-Chun Xiao & Jie Wang. (2021) A computational approach for designing novel SARS-CoV-2 M pro inhibitors: combined QSAR, molecular docking, and molecular dynamics simulation techniques . New Journal of Chemistry 45:26, pages 11512-11529.
Crossref
Rahman Abdizadeh, Esfandiar Heidarian, Farzin Hadizadeh & Tooba Abdizadeh. (2021) QSAR Modeling, Molecular Docking and Molecular Dynamics Simulations Studies of Lysine-Specific Demethylase 1 (LSD1) Inhibitors as Anticancer Agents. Anti-Cancer Agents in Medicinal Chemistry 21:8, pages 987-1018.
Crossref
Cyrus Ahmadi Toussi, Javad Haddadnia & Chérif F. Matta. (2020) Drug design by machine-trained elastic networks: predicting Ser/Thr-protein kinase inhibitors’ activities. Molecular Diversity 25:2, pages 899-909.
Crossref
Peng Zhou, Qian Liu, Ting Wu, Qingqing Miao, Shuyong Shang, Heyi Wang, Zheng Chen, Shaozhou Wang & Heyan Wang. (2021) Systematic Comparison and Comprehensive Evaluation of 80 Amino Acid Descriptors in Peptide QSAR Modeling. Journal of Chemical Information and Modeling 61:4, pages 1718-1731.
Crossref
Rahman Abdizadeh, Keihan Ghatreh-Samani, Farzin Hadizadeh & Tooba Abdizadeh. (2021) Structural investigation of isatin-based benzenesulfonamides as carbonic anhydrase isoform IX inhibitors endowed with anticancer activity using molecular modeling approaches. Journal of Molecular Structure 1229, pages 129735.
Crossref
Khalid M. Kahloot & Peter Ekler. (2021) Algorithmic Splitting: A Method for Dataset Preparation. IEEE Access 9, pages 125229-125237.
Crossref
Om Silakari & Pankaj Kumar Singh. 2021. Concepts and Experimental Protocols of Modelling and Informatics in Drug Design. Concepts and Experimental Protocols of Modelling and Informatics in Drug Design 29 63 .
Attila Gere, Anita Rácz, Dávid Bajusz & Károly Héberger. (2020) Multicriteria decision making for evergreen problems in food science by sum of ranking differences. Food Chemistry, pages 128617.
Crossref
David Mary Rajathei, Subbiah Parthasarathy & Samuel Selvaraj. (2020) Combined QSAR Model and Chemical Similarity Search for Novel HMG-CoA Reductase Inhibitors for Coronary Heart Disease. Current Computer-Aided Drug Design 16:4, pages 473-485.
Crossref
Gergely Tóth, Péter Király & Dániel Kovács. (2020) Effect of variable allocation on validation and optimality parameters and on cross-optimization perspectives. Chemometrics and Intelligent Laboratory Systems 204, pages 104106.
Crossref
Rahman Abdizadeh, Esfandiar Heidarian, Farzin Hadizadeh & Tooba Abdizadeh. (2020) Investigation of pyrimidine analogues as xanthine oxidase inhibitors to treat of hyperuricemia and gout through combined QSAR techniques, molecular docking and molecular dynamics simulations. Journal of the Taiwan Institute of Chemical Engineers 113, pages 72-100.
Crossref
Jose R. Mora, Yovani Marrero-Ponce, César R. García-Jacas & Amileth Suarez Causado. (2020) Ensemble Models Based on QuBiLS-MAS Features and Shallow Learning for the Prediction of Drug-Induced Liver Toxicity: Improving Deep Learning and Traditional Approaches. Chemical Research in Toxicology 33:7, pages 1855-1873.
Crossref
Paola Gramatica. (2020) Principles of QSAR Modeling. International Journal of Quantitative Structure-Property Relationships 5:3, pages 61-97.
Crossref
Qingxiu Hao, Jie Zhou, Li Zhou, Liping Kang, Tiegui Nan, Yi Yu & Lanping Guo. (2020) Prediction the contents of fructose, glucose, sucrose, fructo-oligosaccharides and iridoid glycosides in Morinda officinalis radix using near-infrared spectroscopy. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy 234, pages 118275.
Crossref
Rahman Abdizadeh, Farzin Hadizadeh & Tooba Abdizadeh. (2020) In silico studies of novel scaffold of thiazolidin-4-one derivatives as anti-Toxoplasma gondii agents by 2D/3D-QSAR, molecular docking, and molecular dynamics simulations. Structural Chemistry 31:3, pages 1149-1182.
Crossref
Razieh Sheikhpour, Sajjad Gharaghani & Elmira Nazarshodeh. (2020) Sparse feature selection in multi-target modeling of carbonic anhydrase isoforms by exploiting shared information among multiple targets. Chemometrics and Intelligent Laboratory Systems 200, pages 104000.
Crossref
Nasser Jalili‐Jahani & Azadeh Fatehi. (2020) Multivariate image analysis–quantitative structure‐retention relationship study of polychlorinated biphenyls using partial least squares and radial basis function neural networks. Journal of Separation Science 43:8, pages 1479-1488.
Crossref
Pushyaraga P. Venugopal, Bratin K. Das, E. Soorya & Debashree Chakraborty. (2019) Effect of hydrophobic and hydrogen bonding interactions on the potency of ß‐alanine analogs of G‐protein coupled glucagon receptor inhibitors. Proteins: Structure, Function, and Bioinformatics 88:2, pages 327-344.
Crossref
Maria Cristina A. Costa, Pedro O. Mariz Carvalho & Márcia M. C. Ferreira. (2020) Four‐dimensional quantitative structure‐activity analysis of 1,4‐naphthoquinone derivatives tested against HL‐60 human promyelocytic leukemia cells. Journal of Chemometrics 34:2.
Crossref
Rahman Abdizadeh, Farzin Hadizadeh & Tooba Abdizadeh. (2020) QSAR analysis of coumarin-based benzamides as histone deacetylase inhibitors using CoMFA, CoMSIA and HQSAR methods. Journal of Molecular Structure 1199, pages 126961.
Crossref
Eric J. Martin, Valery R. Polyakov, Xiang-Wei Zhu, Li Tian, Prasenjit Mukherjee & Xin Liu. (2019) All-Assay-Max2 pQSAR: Activity Predictions as Accurate as Four-Concentration IC 50 s for 8558 Novartis Assays . Journal of Chemical Information and Modeling 59:10, pages 4450-4459.
Crossref
Hiromasa Kaneko. (2019) Estimation of predictive performance for test data in applicability domains using y‐randomization. Journal of Chemometrics 33:9.
Crossref
Rácz, Bajusz & Héberger. (2019) Multi-Level Comparison of Machine Learning Classifiers and Their Performance Metrics. Molecules 24:15, pages 2811.
Crossref
Anita Rácz, Dávid Bajusz & Károly Héberger. (2019) Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR. Molecular Informatics 38:8-9, pages 1800154.
Crossref
David M. Rajathei, Subbiah Parthasarathy & Samuel Selvaraj. (2019) QSAR Analysis of Multimodal Antidepressants Vortioxetine Analogs Using Physicochemical Descriptors and MLR Modeling. Current Computer-Aided Drug Design 15:4, pages 294-307.
Crossref
Milica Ž. Karadžić Banjac, Strahinja Z. Kovačević, Lidija R. Jevrić, Sanja O. Podunavac-Kuzmanović & Anamarija I. Mandić. (2019) On the characterization of novel biologically active steroids: Selection of lipophilicity models of newly synthesized steroidal derivatives by classical and non-parametric ranking approaches. Computational Biology and Chemistry 80, pages 23-30.
Crossref
Attila Gere, Dalma Radványi & Károly Héberger. (2019) Which insect species can best be proposed for human consumption?. Innovative Food Science & Emerging Technologies 52, pages 358-367.
Crossref
Viviana Consonni, Roberto Todeschini, Davide Ballabio & Francesca Grisoni. (2019) On the Misleading Use of QF32 for QSAR Model Comparison. Molecular Informatics 38:1-2, pages 1800029.
Crossref
Lu Xu, Ou Hu, Yuwan Guo, Mengqin Zhang, Daowang Lu, Chen-Bo Cai, Shunping Xie, Mohammad Goodarzi, Hai-Yan Fu & Yuan-Bin She. (2018) Representative splitting cross validation. Chemometrics and Intelligent Laboratory Systems 183, pages 29-35.
Crossref
Razieh Sheikhpour, Mehdi Agha Sarram & Elnaz Sheikhpour. (2018) Semi-supervised sparse feature selection via graph Laplacian based scatter matrix for regression problems. Information Sciences 468, pages 14-28.
Crossref
Tayebeh Baghgoli, Mehdi Mousavi & Behnam Mohseni Bababdani. (2018) Descriptor selection evaluation of binary gravitational search algorithm in quantitative structure-activity relationship studies of benzyl phenyl ether diamidine's antiprotozoal activity and Chalcone's anticancer potency. Chemometrics and Intelligent Laboratory Systems 182, pages 31-40.
Crossref
Filip Lj. Andrić. (2018) Towards polypotent natural products: The Derringer desirability approach and nonparametric ranking for multicriteria evaluation of essential oils. Journal of Chemometrics 32:10, pages e3050.
Crossref
Joao Miguel Lourenco & Luiz Lebensztajn. (2018) Post-Pareto Optimality Analysis With Sum of Ranking Differences. IEEE Transactions on Magnetics 54:8, pages 1-10.
Crossref
Anita Rácz, Marietta Fodor & Károly Héberger. (2018) Development and comparison of regression models for the determination of quality parameters in margarine spread samples using NIR spectroscopy. Analytical Methods 10:25, pages 3089-3099.
Crossref
Anita Rácz, Dávid Bajusz & Károly Héberger. 2018. Applied Chemoinformatics. Applied Chemoinformatics 471 499 .
Yabin Wen, Mohammad Talebi, Ruth I.J. Amos, Roman Szucs, John W. Dolan, Christopher A. Pohl & Paul R. Haddad. (2018) Retention prediction in reversed phase high performance liquid chromatography using quantitative structure-retention relationships applied to the Hydrophobic Subtraction Model. Journal of Chromatography A 1541, pages 1-11.
Crossref
Maryam Taraji, Paul R. Haddad, Ruth I.J. Amos, Mohammad Talebi, Roman Szucs, John W. Dolan & Christopher A. Pohl. (2018) Chemometric-assisted method development in hydrophilic interaction liquid chromatography: A review. Analytica Chimica Acta 1000, pages 20-40.
Crossref
Mohammad Amin Valizade Hasanloei, Razieh Sheikhpour, Mehdi Agha Sarram, Elnaz Sheikhpour & Hamdollah Sharifi. (2017) A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities. Journal of Computer-Aided Molecular Design 32:2, pages 375-384.
Crossref
Anita Rácz, Attila Gere, Dávid Bajusz & Károly Héberger. (2018) Is soft independent modeling of class analogies a reasonable choice for supervised pattern recognition?. RSC Advances 8:1, pages 10-21.
Crossref
Soo Hyun Park, Mohammad Talebi, Ruth I.J. Amos, Eva Tyteca, Paul R. Haddad, Roman Szucs, Christopher A. Pohl & John W. Dolan. (2017) Towards a chromatographic similarity index to establish localised quantitative structure-retention relationships for retention prediction. II Use of Tanimoto similarity index in ion chromatography. Journal of Chromatography A 1523, pages 173-182.
Crossref
Soo Hyun Park, Paul R. Haddad, Ruth I.J. Amos, Mohammad Talebi, Roman Szucs, Christopher A. Pohl & John W. Dolan. (2017) Towards a chromatographic similarity index to establish localised Quantitative Structure-Retention Relationships for retention prediction. III Combination of Tanimoto similarity index, log P , and retention factor ratio to identify optimal analyte training sets for ion chromatography. Journal of Chromatography A 1520, pages 107-116.
Crossref
Razieh Sheikhpour, Mehdi Agha Sarram, Sajjad Gharaghani & Mohammad Ali Zare Chahooki. (2017) Feature selection based on graph Laplacian by using compounds with known and unknown activities. Journal of Chemometrics 31:8, pages e2899.
Crossref
Maryam Taraji, Paul R. Haddad, Ruth I.J. Amos, Mohammad Talebi, Roman Szucs, John W. Dolan & Christopher A. Pohl. (2017) Use of dual-filtering to create training sets leading to improved accuracy in quantitative structure-retention relationships modelling for hydrophilic interaction liquid chromatographic systems. Journal of Chromatography A 1507, pages 53-62.
Crossref
Kunal Roy, Pravin Ambure & Rahul B. Aher. (2017) How important is to detect systematic error in predictions and understand statistical applicability domain of QSAR models?. Chemometrics and Intelligent Laboratory Systems 162, pages 44-54.
Crossref
Attila Gere, László Sipos, Sándor Kovács, Zoltán Kókai & Károly Héberger. (2017) Which just-about-right feature should be changed if evaluations deviate? A case study using sum of ranking differences. Chemometrics and Intelligent Laboratory Systems 161, pages 130-135.
Crossref
Paul G. Tratnyek, Eric J. Bylaska & Eric J. Weber. (2017) In silico environmental chemical science: properties and processes from statistical and computational modelling. Environmental Science: Processes & Impacts 19:3, pages 188-202.
Crossref
D. Bajusz, A. Rácz & K. Héberger. 2017. Comprehensive Medicinal Chemistry III. Comprehensive Medicinal Chemistry III 329 378 .
Károly Héberger, Anita Rácz & Dávid Bajusz. 2017. Advances in QSAR Modeling. Advances in QSAR Modeling 89 104 .
Watshara Shoombuatong, Philip Prathipati, Wiwat Owasirikul, Apilak Worachartcheewan, Saw Simeon, Nuttapat Anuwongcharoen, Jarl E. S. Wikberg & Chanin Nantasenamat. 2017. Advances in QSAR Modeling. Advances in QSAR Modeling 3 55 .
Anita Rácz, Károly Héberger & Marietta Fodor. (2016) Quantitative determination and classification of energy drinks using near-infrared spectroscopy. Analytical and Bioanalytical Chemistry 408:23, pages 6403-6411.
Crossref
Filip Andrić, Dávid Bajusz, Anita Rácz, Sandra Šegan & Károly Héberger. (2016) Multivariate assessment of lipophilicity scales—computational and reversed phase thin-layer chromatographic indices. Journal of Pharmaceutical and Biomedical Analysis 127, pages 81-93.
Crossref
Filip Andrić, Sandra Šegan, Aleksandra Dramićanin, Helena Majstorović & Dušanka Milojković-Opsenica. (2016) Linear modeling of the soil-water partition coefficient normalized to organic carbon content by reversed-phase thin-layer chromatography. Journal of Chromatography A 1458, pages 136-144.
Crossref
Alister Tencate, John H. Kalivas & Erik Andries. (2016) Penalty processes for combining roughness and smoothness in spectral multivariate calibration. Journal of Chemometrics 30:4, pages 144-152.
Crossref

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.