955
Views
1
CrossRef citations to date
0
Altmetric
Original Articles

Evaluation of Naive Bayes and Support Vector Machines for Wikipedia

ORCID Icon, &

References

  • Cappabianco, F. A. M., J. P. Papa, and A. X. Falcão. 2010. Optimizing optimum-path forest classification for huge datasets. International Conference on Pattern Recognition, Istanbul, Turkey.
  • Cortes, C., and V. Vapnik. 1995. Support-vector networks. Machine Learning 20 (3):273–97. doi:10.1007/BF00994018.
  • Denoyer, L., and P. Gallinari. 2006. The Wikipedia XML corpus. International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006. Lecture Notes in Computer Science, 4518. Springer, Berlin, Heidelberg.
  • Gattani, A., D. S. Lamba, N. Garera, M. Tiwari, X. Chai, S. Das, S. Subramaniam, A. Rajaraman, V. Harinarayan, and A. Doan. 2013. Entity extraction, linking, classification, and tagging for social media: A Wikipedia-based approach. Proceedings of the VLDB Endowment 6 (11). doi:10.14778/2536222.2536237.
  • Joachims, T. 1998. Text categorization with support vector machines: Learning with many relevant features. European Conference on Machine Learning, Chemnitz, Germany, 137–42.
  • Joshi, M. V., G. Karypis, and V. Kumar. 1998. ScalParC: A new scalable and efficient parallel classification algorithm for mining large datasets. Parallel Processing Symposium.
  • Manning, C. D., P. Raghavan, and H. Schütze. 2008. Introduction to information retrieval. 1st ed., 258–69. Cambridge University Press, New York, NY, USA.
  • McCallum, A., and K. Nigam. 1998. A comparison of event models for Naive Bayes text classification. Learning for Text Categorization: Papers from the 1998 AAAI Workshop, 41–48.
  • Mehdi, M., C. Okolib, M. Mesgaric, F. Å. Nielsend, and A. Lanamäkie. 2017. Excavating the mother lode of human-generated text: A systematic review of research that uses the Wikipedia corpus. Information Processing & Management 53 (2) (Elsevier). doi:10.1016/j.ipm.2016.07.003.
  • Murugeshan, M. S., K. Lakshmi, and S. Mukherjee. 2009. Exploiting negative categories and Wikipedia structures for document classification. International Conference On Advances in Recent Technologies in Communication and Computing, Kottayam, Kerala, India.
  • Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and É. Duchesnay. 2011. Scikit-learn: Machine learning in python. Jmlr 12:2825–30.
  • Platt, J. C. 1999. Fast training of support vector machines using sequential minimal optimization. Advances in kernel methods, 185–209. MIT Press, Cambridge, MA, USA.
  • Wang, P., and C. Domeniconi. 2008. Building semantic kernels for text classification using Wikipedia. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Platt 2009, 713–21.
  • Wang, P., J. Hu, H.-J. Zeng, and Z. Chen. 2009. Using Wikipedia knowledge to improve text classification. Knowledge and Information Systems 19 (3):265–81. (Springer). doi:10.1007/s10115-008-0152-4.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.