58
Views
5
CrossRef citations to date
0
Altmetric
Original Articles

Fast Support Vector Machine Classification for Large Data Sets

&
Pages 197-212 | Received 14 Jun 2011, Accepted 12 Dec 2011, Published online: 21 Nov 2013
 

Abstract

Normal support vector machine (SVM) algorithms are not suitable for classification of large data sets because of high training complexity. This paper introduces a novel two-stage SVM classification approach for large data sets. Fast clustering techniques are introduced to select the training data from the original data set for the first stage SVM, and a de-clustering technique is then proposed to recover the training data for the second stage SVM. The proposed two-stage SVM classifier has distinctive advantages on dealing with huge data sets such as those in bioinformatics. Finally, we apply the proposed method on several benchmark problems. Experimental results demonstrate that our approach has good classification accuracy while the training is significantly faster than other SVM classifiers.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.