Filter feature selectors in the development of binary QSAR models

G. Cerruela GarcíaDepartment of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071Córdoba, SpainCorrespondence[email protected]
View further author information

J. Pérez-Parras ToledanoDepartment of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071Córdoba, SpainView further author information

A. de Haro GarcíaDepartment of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071Córdoba, SpainView further author information

N. García-PedrajasDepartment of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071Córdoba, SpainView further author information

ABSTRACT

The application of machine learning methods to the construction of quantitative structure–activity relationship models is a complex computational problem in which dimensionality reduction of the representation of the molecular structure plays a fundamental role in predicting a target activity. The feature selection pre-processing approach has been indicated to be effective in dimensionality reduction for building simpler and more understandable models. In this paper, a performance comparative study of 13 state-of-the-art feature selection filter methods is conducted. Structure–activity relationship models are constructed using three widely used classifiers and a diverse collection of datasets. The comparative study utilizes robust statistical tests to compare the algorithms. According to the experimental results, there are substantial differences in performance among the evaluated feature selection methods. The methods that exhibit the best performance are correlation-based feature selection, fast clustering-based feature selection and the set cover method.

KEYWORDS:

Disclosure statement

No potential conflict of interest was reported by the authors.

SUPPLEMENTAL DATA

Supplemental data for this article can be accessed at: https://doi.org/10.1080/1062936X.2019.1588160

Additional information

Funding

This work was supported in part by Project TIN2015-66108-P of the Spanish Ministry of Science and Innovation.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Filter feature selectors in the development of binary QSAR models

Information for

Open access

Opportunities

Help and information

Filter feature selectors in the development of binary QSAR models

ABSTRACT

Disclosure statement

SUPPLEMENTAL DATA

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature