67
Views
1
CrossRef citations to date
0
Altmetric
Articles

Systematic framework for short text classification based on improved TWE and supervised MCFS topic merging strategy

ORCID Icon, ORCID Icon, ORCID Icon &
Pages 401-413 | Received 14 Apr 2019, Accepted 23 Apr 2020, Published online: 06 May 2020

References

  • Zhao S, Xu Z, Liu L, et al. Towards accurate deceptive opinions detection based on word order-preserving CNN. Mathematical Problems in Engineering. 2018;1:1–9.
  • Sriram B, Fuhry D, Demir E, et al. Short text classification in twitter to improve information filtering; 2010 Jul.18-23; Geneva, Switzerland; 2010. p. 841–842.
  • Chen M, Shen D, Shen D. Short text classification improved by learning multi-granularity topics. International Joint Conference on Artificial Intelligence; Barcelona, Catalonia, Spain; 2011. p. 1776–1781.
  • Vo D T, Ock C Y. Learning to classify short text from scientific documents using topic models with various types of knowledge. Expert Syst Appl. 2015;42(3):1684–1698. doi: https://doi.org/10.1016/j.eswa.2014.09.031
  • Zhang L, Li C, Liu J, et al. Graph-based text similarity measurement by exploiting wikipedia as background knowledge, world Academy of Science. Eng Technol. 2011;59:1548–1553.
  • Zhu Y, Li L, Luo L. Learning to classify short text with topic model and external knowledge. In: M Wang, editor. International Conference on knowledge Science, engineering and Management. Berlin, Heidelberg: Springer; 2013. p. 493–503.
  • Song Y, Wang H, Wang Z, et al. Short text conceptualization using a probabilistic knowledgebase; International Joint Conference on Artificial Intelligence; 2011 Jul.16–22; Barcelona, Catalonia, Spain; 2011. p. 2330–2336.
  • Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res. 2003;3:993–1022.
  • Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality. International Conference on Neural Information Processing Systems; 2013 Dec 5-10; Harrahs and Harveys, Lake Tahoe; 2013. p. 3111–3119.
  • Mikolov T, Yih WT, Zweig G. Linguistic regularities in continuous space word representations; HLT-NAACL; 2013 June 9-14, Atlanta, Georgia.
  • Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space. New York: Computer Science; 2013.
  • Le Q, Mikolov T. Distributed representations of sentences and documents. Proceedings of the 31st International Conference on Machine Learning (ICML-14); 2014 June 21-26; Beijing, China; 2014. p. 1188–1196.
  • Bengio Y, Ducharme R, Vincent P, et al. A neural probabilistic language model. J Mach Learn Res. 2003;3(Feb):1137–1155.
  • Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence. 2014;39(4):640–651.
  • He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[J]. 2015.
  • Kipf TN, Welling M. Semi-Supervised Classification with Graph Convolutional Networks[J]. 2016.
  • Zhang H, Zhong G. Improving short text classification by learning vector representations of both words and hidden topics. Knowl Based Syst. 2016;102:76–86. doi: https://doi.org/10.1016/j.knosys.2016.03.027
  • Cai D, Zhang C, He X. Unsupervised feature selection for multi-cluster data. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2010 July 25-28; Washington; 2010. p. 333–342.
  • Phan XH, Nguyen CT, Le DT, et al. A hidden topic-based framework toward building applications with short web documents. IEEE Trans Knowl Data Eng. 2011;23(7):961–976. doi: https://doi.org/10.1109/TKDE.2010.27
  • Yan X, Guo J, Lan Y, et al. A biterm topic model for short texts. International Conference on World Wide Web; 2013 May 13-17; Rio de Janeiro, Brazil; 2013.p. 1445–1456.
  • Zhou G, Liu Y, Liu F, et al. Improving question retrieval in community question answering using world knowledge. IJCAI; 2013 Aug 3-9; Beijing, China; vol. 13, 2013. p. 2239–2245.
  • Liu Y, Liu Z, Chua TS, et al. Topical word embeddings in Twenty-Ninth AAAI Conference on Artificial Intelligence; 2015 Jan. 25–30; Hyatt Regency in Austin, Texas, USA; 2015. p. 2418–2424.
  • Law J, Zhuo HH, He J, et al. Ltsg: Latent topical skipgram for mutually learning topic model and vector representations 2017.
  • Kim Y. Convolutional neural networks for sentence classification. Eprint Arxiv. 2014.
  • Socher R, Perelygin A, Wu JY, et al. Recursive deep models for semantic compositionality over a sentiment treebank. 2013.
  • Hochreiter S, Schmidhuber J. Long short-term memory. Cambridge, US: Neural Computation; 1997. p. 1735–1780.
  • Lee JY, Dernoncourt F. Sequential short-text classification with recurrent and convolutional neural networks. 2016. p. 515–520.
  • Lai S, Xu L, Liu K, et al. Recurrent convolutional neural networks for text classification; AAAI; 2015 Jan. 25–30; Hyatt Regency in Austin, Texas, USA; vol. 333, 2015. p. 2267– 2273.
  • Wang P, Xu B, Xu J, et al. Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing. 2016;174(PB):806–814. doi: https://doi.org/10.1016/j.neucom.2015.09.096
  • Rodriguez A, Laio A. Machine learning. clustering by fast search and find of density peaks. Science. 2014;344(6191):1492. doi: https://doi.org/10.1126/science.1242072
  • Wang P, Zhang H, Wu Y-F, et al. A robust framework for short text categorization based on topic model and integrated classifier. Neural Networks (IJCNN), 2014 International Joint Conference on IEEE; 2014 July 6-11; Beijing, China; 2014. p. 3534–3539.
  • Wang P, Zhang H, Xu B, et al. Short text feature enrichment using link analysis on topic-keyword graph. Shenzhen, China: Springer Berlin Heidelberg; 2014. p. 79–90.
  • Phan X-H. Jwikidocs: A java-based wikipedia crawling toolkit. [Online]. Available: http://jwebpro.sourceforge.net/.
  • Phan X-H, Nguyen C-T. Gibbslda++: A c/c++ implementation of latent dirichlet allocation (lda). [Online]. Available: http://gibbslda.sourceforge.net/.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.