546
Views
10
CrossRef citations to date
0
Altmetric
Application Note

Unsupervised document classification integrating web scraping, one-class SVM and LDA topic modelling

, , &
Pages 574-591 | Received 29 Sep 2020, Accepted 31 Mar 2021, Published online: 27 Apr 2021

Keep up to date with the latest research on this topic with citation updates for this article.

Read on this site (2)

Yichuan Zhao, Chi-Hua Chen, Feng Feng & Dragan Pamucar. (2023) Editorial to the special issue: Statistical Approaches for Big Data and Machine Learning. Journal of Applied Statistics 50:3, pages 451-455.
Read now
Priti Bhardwaj & Niyati Baliyan. A Novel and Effective Multi-Class Classification Method for Imbalanced Medical Transcriptions. IETE Journal of Research 0:0, pages 1-11.
Read now

Articles from other publishers (8)

Guan Bo, Wang Shanshan, Zhang Qing, Pang Bo & Zuo Yan. (2024) Empowering Medical Data Analysis: An Advanced Deep Fusion Model for Sorting Medicine Document. IEEE Access 12, pages 1650-1659.
Crossref
Wenqing Chen & Ting Yang. (2023) A Recommendation System of Personalized Resource Reliability for Online Teaching System under Large-scale User Access. Mobile Networks and Applications 28:3, pages 983-994.
Crossref
Anton Thielmann, Christoph Weisser, Thomas Kneib & Benjamin Säfken. (2023) Coherence based Document Clustering. Coherence based Document Clustering.
Maojian Chen, Xiong Luo, Qiaojuan Peng, Hailun Shen & Ziyang Huang. 2023. Proceedings of 2023 Chinese Intelligent Automation Conference. Proceedings of 2023 Chinese Intelligent Automation Conference 511 518 .
Prajwal Eachempati, Laurent Muzellec & Ashish Kumar Jha. (2022) Examining the relationship between Privacy Setting Policy, Public Discourse, Business Models and Financial Performance of Facebook (2004–2021). Examining the relationship between Privacy Setting Policy, Public Discourse, Business Models and Financial Performance of Facebook (2004–2021).
Gillian Kant, Levin Wiebelt, Christoph Weisser, Krisztina Kis-Katos, Mattias Luber & Benjamin Säfken. (2022) An iterative topic model filtering framework for short and noisy user-generated data: analyzing conspiracy theories on twitter. International Journal of Data Science and Analytics.
Crossref
Naveen S Pagad, Pradeep N, Khalid K. Almuzaini, Manish Maheshwari, Durgaprasad Gangodkar, Piyush Shukla & Musah Alhassan. (2022) Clinical Text Data Categorization and Feature Extraction Using Medical-Fissure Algorithm and Neg-Seq Algorithm. Computational Intelligence and Neuroscience 2022, pages 1-16.
Crossref
Arne Tillmann, Anton Thielmann, Gillian Kant, Christoph Weisser, Benjamin Säfken, Alexander Silbersdorff & Thomas Kneib. (2021) AuDoLab: Automatic document labelling and classification for extremely unbalanced data. Journal of Open Source Software 6:66, pages 3719.
Crossref

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.