92
Views
1
CrossRef citations to date
0
Altmetric
Articles

A preprocessing method combined with an ensemble framework for the multiclass imbalanced data classification

ORCID Icon & ORCID Icon
Pages 1178-1185 | Received 13 Sep 2019, Accepted 29 Nov 2019, Published online: 10 Dec 2019
 

Abstract

Skewed distributions appear in many real-world classification problems. Skewed distributions, underrepresented classes, and multiple overlapping regions in multiclass imbalanced datasets deteriorate the performance of existing classification algorithms and approaches. In this context, we combine a novel preprocessing procedure to tackle minority classes in multiclass imbalanced problems with an ensemble framework. The preprocessing method oversamples the minority classes based on normalized probability, and then an ensemble called a stacked generalization framework is used to train the model. The motive behind combining the ensemble framework and the preprocessing procedure is to enhance the overall classification performance of the classifier for multiclass imbalanced problems. Experimental results on 20 multiclass imbalanced datasets show that the proposed preprocessing method with the ensemble framework outperforms the representative approaches in 13 datasets for macro average arithmetic (MAvA) and mean F-measure (MFM) metrics. In the case of state-of-the-art techniques, the proposed approach steered 14 datasets for the MAvA metric and 15 datasets for the MFM metric to success.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Notes on contributors

M. R. Pavan Kumar

Pavan Kumar M R received M. Tech degree from VIT University, Vellore, Tamilnadu. Currently, he is a research scholar in the School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, India and pursuing his Ph.D. degree in the field of Data Analytics. His main areas of research include Big Data Analytics, Machine Learning, and Deep Learning.

Prabhu Jayagopal

Dr. Prabhu J received his Ph.D. (Computer science and Engineering) and Master of Computer Science and Engineering from Sathyabama University, Chennai, Tamil Nadu, India. He received a B.Tech degree from the Department of Information Technology, Vellore Engineering College, Affiliated to Madras University in 2004 located at Vellore, Tamil Nadu, India. He worked as an Assistant Professor in various Engineering colleges for more than 13 years. Now he is working as an Associate Professor in the School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India. His research interests are Software Testing, Software Quality Assurance, Software Metrics, and Big Data.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 288.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.