467
Views
0
CrossRef citations to date
0
Altmetric
Research Article

CFSE: a Chinese short text classification method based on character frequency sub-word enhancement

, , , &
Article: 2263663 | Received 08 Jun 2023, Accepted 21 Sep 2023, Published online: 06 Oct 2023

References

  • Basabain, S., Cambria, E., Alomar, K., & Hussain, A. (2023). Enhancing Arabic-text feature extraction utilizing label-semantic augmentation in few/zero-shot learning. Expert Systems, 40(8), e13329. https://doi.org/10.1111/exsy.13329
  • Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
  • Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees (CART). Biometrics, 40(3), 358. https://doi.org/10.2307/2530946
  • Cover, T., & Hart, P. (1967). Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13(1), 21–27. https://doi.org/10.1109/TIT.1967.1053964
  • Dai, Y., Shou, L., Gong, M., Xia, X., Kang, Z., Xu, Z., & Jiang, D. (2022). Graph fusion network for text classification. Knowledge-Based Systems, 236, 107659. https://doi.org/10.1016/j.knosys.2021.107659
  • Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171–4186. https://doi.org/10.18653/v1/N19-1423
  • Feng, Y., Qu, B., Xu, H., & Wang, R. (2019). Chinese FastText short text classification method integrating TF-IDF and LDA. Journal of Applied Sciences, 37(3), 378. https://doi.org/10.3969/j.issn.0255-8297.2019.03.008
  • Fu, W., Yang, D., Ma, H., & Wu, D. (2022). Short text classification method based on BTM and BERT. Computer Engineering and Design, 43(12), 3421–3427. https://doi.org/10.16208/j.issn1000-7024.2022.12.016
  • Jian, L. X. S. Q. S. (2023). Automatic classification of product review texts combining short text extension and BERT. Journal of Information Resources Management, 13(1), 129. https://doi.org/10.13365/j.jirm.2023.01.129
  • Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In C. Nédellec & C. Rouveirol (Eds.), Machine learning: ECML-98 (pp. 137–142). Springer. https://doi.org/10.1007/BFb0026683
  • Johnson, R., & Zhang, T. (2017). Deep pyramid convolutional neural networks for text categorization. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 562–570. https://doi.org/10.18653/v1/P17-1052
  • Kim, Y. (2014). Convolutional neural networks for sentence classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1746–1751, https://doi.org/10.3115/v1/D14-1181
  • Lai, S., Xu, L., Liu, K., & Zhao, J. (2015). Recurrent convolutional neural networks for text classification. Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2267–2273. https://doi.org/10.1109/IJCNN.2019.8852406
  • Li, B.-H., Xiang, Y.-X., Feng, D., He, Z.-C., Wu, J.-J., Dai, T.-L., & Li, J. (2022). Short text classification model combining knowledge aware and dual attention. Journal of Software, 33(10), 3565–3581.
  • Li, F.-F., Su, P.-Z., Duan, J.-W., Zhang, S.-C., & Mao, X.-L. (2023). Multi-label text classification with enhancing multi-granularity information relations. Journal of Software, 1–18. https://doi.org/10.13328/j.cnki.jos.006802
  • Li, S., Deng, M., Shao, Z., Chen, X., & Zheng, Y. (2023). Automatic classification of interactive texts in online collaborative discussion based on multi-feature fusion. Computers and Electrical Engineering, 107, 108648. https://doi.org/10.1016/j.compeleceng.2023.108648
  • Li, X., Zhu, G., Zhang, S., & Wei, Z. (2023). RCRE: Radical-aware causal relationship extraction model oriented in the medical field. International Journal of Computational Science and Engineering, https://doi.org/10.1504/IJCSE.2023.10054227
  • Liu, P., Qiu, X., & Huang, X. (2016). Recurrent neural network for text classification with multi-task learning. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2873–2879. https://doi.org/10.48550/arXiv.1605.05101
  • McCallum, A., & Nigam, K. (1998). A comparison of event models for naive Bayes text classification. AAAI Conference on Artificial Intelligence. https://api.semanticscholar.org/CorpusID:7311285
  • Meng, J. S. C., Shan, H., Huang, R., Yan, F., Li, Z., Zheng, G., & Liu, Y. (2023). Text classification model based on dual-channel feature fusion based on XLNet. Journal of Shandong University(Natural Science), 58(5), 36. https://doi.org/10.6040/j.issn.1671-9352.0.2021.790
  • Onan, A. (2022). Bidirectional convolutional recurrent neural network architecture with group-wise enhancement mechanism for text sentiment classification. Journal of King Saud University - Computer and Information Sciences, 34(5), 2098–2117. https://doi.org/10.1016/j.jksuci.2022.02.025
  • Rohidin, D., Samsudin, N. A., & Deris, M. M. (2022). Association rules of fuzzy soft set based classification for text classification problem. Journal of King Saud University - Computer and Information Sciences, 34(3), 801–812. https://doi.org/10.1016/j.jksuci.2020.03.014
  • Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523. https://doi.org/10.1016/0306-4573(88)90021-0
  • Shunxiang, Z., Aoqiang, Z., Guangli, Z., Zhongliang, W., & KuanChing, L. (2023). Building fake review detection model based on sentiment intensity and PU learning. IEEE Transactions on Neural Networks and Learning Systems, https://doi.org/10.1109/tnnls.2023.3234427
  • Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems, 6000–6010. https://doi.org/10.48550/arXiv.1706.03762
  • Wang, C., Jiang, H., Chen, T., Liu, J., Wang, M., Jiang, S., Li, Z., & Xiao, Y. (2022). Entity understanding with hierarchical graph learning for enhanced text classification. Knowledge-Based Systems, 244, 108576. https://doi.org/10.1016/j.knosys.2022.108576
  • Wu, F. L., Gou, J., & Wang, C. (2013). Review of Chinese short text classification. Applied Mechanics and Materials, 336–338, 2171–2174. https://doi.org/10.4028/www.scientific.net/AMM.336-338.2171
  • Yan, C., Liu, J., Liu, W., & Liu, X. (2022). Research on public opinion sentiment classification based on attention parallel dual-channel deep learning hybrid model. Engineering Applications of Artificial Intelligence, 116, 105448. https://doi.org/10.1016/j.engappai.2022.105448
  • Zhang, S., Hu, Z., Zhu, G., Jin, M., & Li, K.-C. (2021). Sentiment classification model for Chinese micro-blog comments based on key sentences extraction. Soft Computing, 25(1), 463–476. https://doi.org/10.1007/s00500-020-05160-8
  • Zhang, S., Wang, Y., Zhang, S., & Zhu, G. (2016). Building associated semantic representation model for the ultra-short microblog text jumping in big data. Cluster Computing, 19(3), 1399–1410. https://doi.org/10.1007/s10586-016-0602-9
  • Zhang, S., Yu, H., & Zhu, G. (2022). An emotional classification method of Chinese short comment text based on ELECTRA. Connection Science, 34(1), 254–273. https://doi.org/10.1080/09540091.2021.1985968
  • Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., & Liu, Q. (2019). ERNIE: Enhanced language representation with informative entities. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 1441–1451. https://doi.org/10.18653/v1/P19-1139
  • Zhou, P., Shi, W., Tian, J., Qi, Z., Li, B., Hao, H., & Xu, B. (2016). Attention-based bidirectional long short-term memory networks for relation classification. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 207–212. https://doi.org/10.18653/v1/P16-2034
  • Zhou, Y., Li, J., Chi, J., Tang, W., & Zheng, Y. (2022). Set-CNN: A text convolutional neural network based on semantic extension for short text classification. Knowledge-Based Systems, 257, 109948. https://doi.org/10.1016/j.knosys.2022.109948