CFSE: a Chinese short text classification method based on character frequency sub-word enhancement

Xingguang Wanga School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, People’s Republic of China;b Artificial Intelligence Research Institute of Hefei Comprehensive National Science Center, Hefei, People’s Republic of ChinaView further author information

Shunxiang Zhanga School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, People’s Republic of China;b Artificial Intelligence Research Institute of Hefei Comprehensive National Science Center, Hefei, People’s Republic of ChinaCorrespondence[email protected]
View further author information

Zichen Maa School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, People’s Republic of China;b Artificial Intelligence Research Institute of Hefei Comprehensive National Science Center, Hefei, People’s Republic of ChinaView further author information

Yunduo Liua School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, People’s Republic of China;b Artificial Intelligence Research Institute of Hefei Comprehensive National Science Center, Hefei, People’s Republic of ChinaView further author information

Youqiang Zhanga School of Computer Science and Engineering, Anhui University of Science & Technology, Huainan, People’s Republic of China;b Artificial Intelligence Research Institute of Hefei Comprehensive National Science Center, Hefei, People’s Republic of ChinaView further author information

Article: 2263663 | Received 08 Jun 2023, Accepted 21 Sep 2023, Published online: 06 Oct 2023

Cite this article
https://doi.org/10.1080/09540091.2023.2263663
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

References

Basabain, S., Cambria, E., Alomar, K., & Hussain, A. (2023). Enhancing Arabic-text feature extraction utilizing label-semantic augmentation in few/zero-shot learning. Expert Systems, 40(8), e13329. https://doi.org/10.1111/exsy.13329
Web of Science ®Google Scholar
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Web of Science ®Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees (CART). Biometrics, 40(3), 358. https://doi.org/10.2307/2530946
Google Scholar
Cover, T., & Hart, P. (1967). Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13(1), 21–27. https://doi.org/10.1109/TIT.1967.1053964
Web of Science ®Google Scholar
Dai, Y., Shou, L., Gong, M., Xia, X., Kang, Z., Xu, Z., & Jiang, D. (2022). Graph fusion network for text classification. Knowledge-Based Systems, 236, 107659. https://doi.org/10.1016/j.knosys.2021.107659
Web of Science ®Google Scholar
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171–4186. https://doi.org/10.18653/v1/N19-1423
Google Scholar
Feng, Y., Qu, B., Xu, H., & Wang, R. (2019). Chinese FastText short text classification method integrating TF-IDF and LDA. Journal of Applied Sciences, 37(3), 378. https://doi.org/10.3969/j.issn.0255-8297.2019.03.008
Google Scholar
Fu, W., Yang, D., Ma, H., & Wu, D. (2022). Short text classification method based on BTM and BERT. Computer Engineering and Design, 43(12), 3421–3427. https://doi.org/10.16208/j.issn1000-7024.2022.12.016
Google Scholar
Jian, L. X. S. Q. S. (2023). Automatic classification of product review texts combining short text extension and BERT. Journal of Information Resources Management, 13(1), 129. https://doi.org/10.13365/j.jirm.2023.01.129
Google Scholar
Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In C. Nédellec & C. Rouveirol (Eds.), Machine learning: ECML-98 (pp. 137–142). Springer. https://doi.org/10.1007/BFb0026683
Google Scholar
Johnson, R., & Zhang, T. (2017). Deep pyramid convolutional neural networks for text categorization. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 562–570. https://doi.org/10.18653/v1/P17-1052
Google Scholar
Kim, Y. (2014). Convolutional neural networks for sentence classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1746–1751, https://doi.org/10.3115/v1/D14-1181
Google Scholar
Lai, S., Xu, L., Liu, K., & Zhao, J. (2015). Recurrent convolutional neural networks for text classification. Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2267–2273. https://doi.org/10.1109/IJCNN.2019.8852406
Google Scholar
Li, B.-H., Xiang, Y.-X., Feng, D., He, Z.-C., Wu, J.-J., Dai, T.-L., & Li, J. (2022). Short text classification model combining knowledge aware and dual attention. Journal of Software, 33(10), 3565–3581.
Google Scholar
Li, F.-F., Su, P.-Z., Duan, J.-W., Zhang, S.-C., & Mao, X.-L. (2023). Multi-label text classification with enhancing multi-granularity information relations. Journal of Software, 1–18. https://doi.org/10.13328/j.cnki.jos.006802
Google Scholar
Li, S., Deng, M., Shao, Z., Chen, X., & Zheng, Y. (2023). Automatic classification of interactive texts in online collaborative discussion based on multi-feature fusion. Computers and Electrical Engineering, 107, 108648. https://doi.org/10.1016/j.compeleceng.2023.108648
Web of Science ®Google Scholar
Li, X., Zhu, G., Zhang, S., & Wei, Z. (2023). RCRE: Radical-aware causal relationship extraction model oriented in the medical field. International Journal of Computational Science and Engineering, https://doi.org/10.1504/IJCSE.2023.10054227
Web of Science ®Google Scholar
Liu, P., Qiu, X., & Huang, X. (2016). Recurrent neural network for text classification with multi-task learning. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2873–2879. https://doi.org/10.48550/arXiv.1605.05101
Google Scholar
McCallum, A., & Nigam, K. (1998). A comparison of event models for naive Bayes text classification. AAAI Conference on Artificial Intelligence. https://api.semanticscholar.org/CorpusID:7311285
Google Scholar
Meng, J. S. C., Shan, H., Huang, R., Yan, F., Li, Z., Zheng, G., & Liu, Y. (2023). Text classification model based on dual-channel feature fusion based on XLNet. Journal of Shandong University(Natural Science), 58(5), 36. https://doi.org/10.6040/j.issn.1671-9352.0.2021.790
Google Scholar
Onan, A. (2022). Bidirectional convolutional recurrent neural network architecture with group-wise enhancement mechanism for text sentiment classification. Journal of King Saud University - Computer and Information Sciences, 34(5), 2098–2117. https://doi.org/10.1016/j.jksuci.2022.02.025
Web of Science ®Google Scholar
Rohidin, D., Samsudin, N. A., & Deris, M. M. (2022). Association rules of fuzzy soft set based classification for text classification problem. Journal of King Saud University - Computer and Information Sciences, 34(3), 801–812. https://doi.org/10.1016/j.jksuci.2020.03.014
Web of Science ®Google Scholar
Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523. https://doi.org/10.1016/0306-4573(88)90021-0
Web of Science ®Google Scholar
Shunxiang, Z., Aoqiang, Z., Guangli, Z., Zhongliang, W., & KuanChing, L. (2023). Building fake review detection model based on sentiment intensity and PU learning. IEEE Transactions on Neural Networks and Learning Systems, https://doi.org/10.1109/tnnls.2023.3234427
PubMed Web of Science ®Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems, 6000–6010. https://doi.org/10.48550/arXiv.1706.03762
Google Scholar
Wang, C., Jiang, H., Chen, T., Liu, J., Wang, M., Jiang, S., Li, Z., & Xiao, Y. (2022). Entity understanding with hierarchical graph learning for enhanced text classification. Knowledge-Based Systems, 244, 108576. https://doi.org/10.1016/j.knosys.2022.108576
Web of Science ®Google Scholar
Wu, F. L., Gou, J., & Wang, C. (2013). Review of Chinese short text classification. Applied Mechanics and Materials, 336–338, 2171–2174. https://doi.org/10.4028/www.scientific.net/AMM.336-338.2171
Google Scholar
Yan, C., Liu, J., Liu, W., & Liu, X. (2022). Research on public opinion sentiment classification based on attention parallel dual-channel deep learning hybrid model. Engineering Applications of Artificial Intelligence, 116, 105448. https://doi.org/10.1016/j.engappai.2022.105448
Web of Science ®Google Scholar
Zhang, S., Hu, Z., Zhu, G., Jin, M., & Li, K.-C. (2021). Sentiment classification model for Chinese micro-blog comments based on key sentences extraction. Soft Computing, 25(1), 463–476. https://doi.org/10.1007/s00500-020-05160-8
Web of Science ®Google Scholar
Zhang, S., Wang, Y., Zhang, S., & Zhu, G. (2016). Building associated semantic representation model for the ultra-short microblog text jumping in big data. Cluster Computing, 19(3), 1399–1410. https://doi.org/10.1007/s10586-016-0602-9
Web of Science ®Google Scholar
Zhang, S., Yu, H., & Zhu, G. (2022). An emotional classification method of Chinese short comment text based on ELECTRA. Connection Science, 34(1), 254–273. https://doi.org/10.1080/09540091.2021.1985968
Web of Science ®Google Scholar
Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., & Liu, Q. (2019). ERNIE: Enhanced language representation with informative entities. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 1441–1451. https://doi.org/10.18653/v1/P19-1139
Google Scholar
Zhou, P., Shi, W., Tian, J., Qi, Z., Li, B., Hao, H., & Xu, B. (2016). Attention-based bidirectional long short-term memory networks for relation classification. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 207–212. https://doi.org/10.18653/v1/P16-2034
Google Scholar
Zhou, Y., Li, J., Chi, J., Tang, W., & Zheng, Y. (2022). Set-CNN: A text convolutional neural network based on semantic extension for short text classification. Knowledge-Based Systems, 257, 109948. https://doi.org/10.1016/j.knosys.2022.109948
Web of Science ®Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

CFSE: a Chinese short text classification method based on character frequency sub-word enhancement

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

CFSE: a Chinese short text classification method based on character frequency sub-word enhancement

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date