1,241
Views
165
CrossRef citations to date
0
Altmetric
Technical Note

Crop classification by support vector machine with intelligently selected training data for an operational application

&
Pages 2227-2240 | Received 06 Dec 2005, Accepted 05 Apr 2007, Published online: 25 Mar 2008
 

Abstract

The accuracy of a supervised classification is dependent to a large extent on the training data used. The aim in training is often to capture a large training set to fully describe the classes spectrally, commonly with the requirements of a conventional statistical classifier in mind. However, it is not always necessary to provide a complete description of the classes, especially if using a support vector machine (SVM) as the classifier. An SVM seeks to fit an optimal hyperplane between the classes and uses only some of the training samples that lie at the edge of the class distributions in feature space (support vectors). This should allow the definition of the most informative training samples prior to the analysis. An approach to identify informative training samples was demonstrated for the classification of agricultural classes in south‐western part of Punjab state, India. A small, intelligently selected, training dataset was acquired in the field with the aid of ancillary information. This dataset contained the data from training sites that were predicted before the classification to be amongst the most informative for an SVM classification. The intelligent training collection scheme yielded a classification of comparable accuracy, ∼91%, to one derived using a larger training set acquired by a conventional approach. Moreover, from inspection of the training sets it was apparent that the intelligently defined training set contained a greater proportion of support vectors (0.70), useful training sites, than that acquired by the conventional approach (0.41). By focusing on the most informative training samples, the intelligent scheme required less investment in training than the conventional approach and its adoption would have reduced the total financial outlay in classification production and evaluation by ∼26%. Additionally, the analysis highlighted the possibility to further reduce the training set size without any significant negative impact on classification accuracy.

Acknowledgements

We are grateful to Dr P. K. Sharma, Director Punjab Remote Sensing Centre, Ludhiana for providing the facilities and support for the field visit and data used in the study. The help rendered by Mr P. K. Litoria, Dr D. C. Loshali and Mr Surjit Singh for the field visit is also duly acknowledged. The field information provided by agriculture departments, Tribune newspaper and the farming community in the study area is highly appreciated. The authors are also grateful to the Commonwealth Scholarship and Fellowship plan administered by the British Council for provision of a scholarship to support AM's work at the University of Southampton while on leave from Punjab Remote Sensing Centre, India. The research reported was initiated while AM and GMF were at the University of Southampton. The support vector machines were constructed with BSVM (version 2.01) software developed by Chih‐Wei Hsu and Chih‐Jen Lin, National University, Taipei. Finally, we are grateful to the reviewers for their constructive comments on the original submission.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.