1,055
Views
6
CrossRef citations to date
0
Altmetric
Research Article

A review of approaches for topic detection in Twitter

, , &
Pages 747-773 | Received 11 Jul 2019, Accepted 03 Jun 2020, Published online: 28 Jun 2020

References

  • Aggarwal, C. C., & Subbian, K. (2012). Event detection in social streams. Proceedings of the 2012 SIAM international conference on data mining, SIAM. Anaheim, California, USA.
  • Aha, D. W., Kibler, D., & Albert, M. K. (1991). Instance-based learning algorithms. Machine Learning, 6(1), 37–66. https://doi.org/10.1007/BF00153759
  • Akbik, A., Blythe, D., & Vollgraf, R. (2018). Contextual string embeddings for sequence labeling. Proceedings of the 27th International Conference on Computational Linguistics. Santa Fe, New Mexico, USA.
  • Alam, F., Ofli, F., Imran, M., & Aupetit, M. (2018). A Twitter Tale of Three Hurricanes: Harvey, Irma, and Maria. arXiv Preprint. https://arxiv.org/abs/1805.05144
  • Alhawarat, M., & Hegazi, M. (2018). Revisiting K-means and topic modeling, a comparison study to cluster arabic documents. IEEE Access, 6(1), 42740–42749. https://doi.org/10.1109/ACCESS.2018.2852648
  • Allan, J., Carbonell, J. G., Doddington, G., Yamron, J., & Yang, Y. (2003). Topic detection and tracking pilot study final report. Carnegie Mellon University. https://doi.org/10.1184/R1/6626252.v1
  • Allan, J., Harding, S., Fisher, D., Bolivar, A., Guzman-Lara, S., & Amstutz, P. (2005). Taking topic detection from evaluation to practice. Proceedings of the 38th Annual Hawaii International Conference on System Sciences, : Big Island, Hawaii, USA: IEEE.
  • Allan, J., Papka, R., & Lavrenko, V. (1998). On-line new event detection and tracking. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 37–45). Citeseer.
  • Anandkumar, A., Ge, R., Hsu, D., Kakade, S. M., & Telgarsky, M. (2014). Tensor decompositions for learning latent variable models. The Journal of Machine Learning Research, 15(1), 2773–2832.  https://escholarship.org/uc/item/87b1x7b6
  • Arora, S., Ge, R., & Moitra, A. (2012). Learning topic models–going beyond SVD. 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science, IEEE, New Brunswick, NJ, USA.
  • Asur, S., & Huberman, B. A. (2010). Predicting the future with social media. Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology-Volume 01 Toronto, Ontario, Canada: IEEE Computer Society.
  • Babcock, B., Babu, S., Datar, M., Motwani, R., & Widom, J. (2002). Models and issues in data stream systems. Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. Madison, Wisconsin, USA: ACM.
  • Bahrehdar, A. R., & Purves, R. S. (2018). Description and characterization of place properties using topic modeling on georeferenced tags. Geo-spatial Information Science, 21(3), 173–184. https://doi.org/10.1080/10095020.2018.1493238
  • Becker, H., Naaman, M., & Gravano, L. (2010). Learning similarity metrics for event identification in social media. Proceedings of the third ACM international conference on Web search and data mining. New York, USA: ACM.
  • Benny, A., & Philip, M. (2015). Keyword based tweet extraction and detection of related topics. Procedia Computer Science, 46(4), 364–371. https://doi.org/10.1016/j.procs.2015.02.032
  • Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993–1022. DOI: 10.1162/jmlr.2003.3.4-5.993
  • Bouras, C., & Tsogkas, V. (2010). Assigning web news to clusters. 2010 Fifth International Conference on Internet and Web Applications and Services. Barcelona, Spain: IEEE.
  • Boyd, D., Golder, S., & Lotan, G. (2010). Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. 2010 43rd Hawaii International Conference on System Sciences. Koloa, Kauai, Hawaii: IEEE.
  • Caliński, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics-theory and Methods, 3(1), 1–27. https://doi.org/10.1080/03610927408827101
  • Carter, S., Weerkamp, W., & Tsagkias, M. (2013). Microblog language identification: Overcoming the limitations of short, unedited and idiomatic text. Language Resources and Evaluation, 47(1), 195–215. https://doi.org/10.1007/s10579-012-9195-y
  • Castellanos, A., Cigarrán, J., & García-Serrano, A. (2017). Formal concept analysis for topic detection: A clustering quality experimental analysis. Information Systems, 66(4), 24–42. https://doi.org/10.1016/j.is.2017.01.008
  • Cataldi, M., Di Caro, L., & Schifanella, C. (2010). Emerging topic detection on twitter based on temporal and social terms evaluation. Proceedings of the tenth international workshop on multimedia data mining. Washington, DC, USA: ACM.
  • Chemudugunta, C., Smyth, P., & Steyvers, M. (2007). Modeling general and specific aspects of documents with a probabilistic topic model. Advances in neural information processing systems.
  • Chen, Y., Xu, B., Hao, H., Zhou, S., & Cao, J. (2013). User-defined hot topic detection in microblogging. Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service. Huangshan, China: ACM.
  • Cigarrán, J., Castellanos, Á., & García-Serrano, A. (2016). A step forward for topic detection in twitter: An FCA-based approach. Expert Systems with Applications, 57(15), 21–36. https://doi.org/10.1016/j.eswa.2016.03.011
  • Collobert, R., & Weston, J. (2008). A unified architecture for natural language processing: Deep neural networks with multitask learning. Proceedings of the 25th international conference on Machine learning. Helsinki, Finland: ACM.
  • Cristianini, N., & Shawe-Taylor, J. (2000). An introduction to support vector machines and other kernel-based learning methods. Cambridge university press.
  • Cutting, D. R., Karger, D. R., Pedersen, J. O., & Tukey, J. W. (1992). Scatter/gather: A cluster-based approach to browsing large document collections. Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval. Copenhagen, Denmark: ACM.
  • Dai, X.-Y., Chen, Q.-C., Wang, X.-L., & Xu, J. (2010). Online topic detection and tracking of financial news based on hierarchical clustering. 2010 International Conference on Machine Learning and Cybernetics. Qingdao, China: IEEE.
  • Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. https://arxiv.org/abs/1810.04805
  • Dubes, R. C., & Jain, A. K. (1988). Algorithms for clustering data. Prentice hall Englewood Cliffs.
  • Elman, J. L. (1991). Distributed representations, simple recurrent networks, and grammatical structure. Machine Learning, 7(2–3), 195–225. https://doi.org/10.1007/BF00114844
  • Finkel, J. R., Grenager, T., & Manning, C. (2005). Incorporating non-local information into information extraction systems by Gibbs sampling. Proceedings of the 43rd annual meeting on association for computational linguistics, Association for Computational Linguistics, Ann Arbor, Michigan.
  • Fiscus, J. G., & Doddington, G. R. (2002). Topic detection and tracking evaluation overview. In J. Allan (Ed.), Topic detection and tracking (pp. 17–31). Springer.
  • Frey, B. J., & Dueck, D. (2007). Clustering by passing messages between data points. science, 315(5814), 972–976. https://doi.org/10.1126/science.1136800
  • Fung, G. P. C., Yu, J. X., Yu, P. S., & Lu, H. (2005). Parameter free bursty events detection in text streams. Proceedings of the 31st international conference on Very large data bases, VLDB Endowment, Trondheim, Norway.
  • Ganter, B., & Wille, R. (2012). Formal concept analysis: Mathematical foundations. Springer Science & Business Media.
  • Gardner, M., Grus, J., Neumann, M., Tafjord, O., Dasigi, P., Liu, N., Peters, M., Schmitz, M., & Zettlemoyer, L. (2018). Allennlp: A deep semantic natural language processing platform. arXiv Preprint. https://arxiv.org/abs/1803.07640v2
  • Gencoglu, O. (2018). Deep representation learning for clustering of health tweets. https://arxiv.org/abs/1901.00439
  • Geng, X., Zhang, Y., Jiao, Y., & Mei, Y. (2017). A novel hybrid clustering algorithm for microblog topic detection. AIP Conference Proceedings. Hubei Province, China: AIP Publishing.
  • Givoni, I., Chung, C., & Frey, B. J. (2012). Hierarchical affinity propagation. https://arxiv.org/abs/1202.3722
  • Guo, X., Xiang, Y., Chen, Q., Huang, Z., & Hao, Y. (2013). LDA-based online topic detection using tensor factorization. Journal of Information Science, 39(4), 459–469. https://doi.org/10.1177/0165551512473066
  • Harris, Z. S. (1954). Distributional structure. Word, 10(2–3), 146–162. https://doi.org/10.1080/00437956.1954.11659520
  • Hasan, M., Orgun, M. A., & Schwitter, R. (2016). TwitterNews±: A framework for real time event detection from the Twitter data stream. International Conference on Social Informatics. Bellevue, USA: Springer.
  • Hasan, M., Orgun, M. A., & Schwitter, R. (2019). Real-time event detection from the Twitter data stream using the TwitterNews+ Framework. Information Processing & Management, 56(3), 1146–1165. https://doi.org/10.1016/j.ipm.2018.03.001
  • He, Q., Chang, K., & Lim, E.-P. (2007). Analyzing feature trajectories for event detection. Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. Amsterdam, Netherlands: ACM.
  • Hooper, R., & Paice, C. (2005). The Lancaster stemming algorithm. University of Lancester.
  • Hu, Y., John, A., Seligmann, D. D., & Wang, F. (2012). What were the tweets about? Topical associations between public events and twitter feeds. Sixth International AAAI Conference on Weblogs and Social Media. Dublin, Ireland
  • Hu, Y., Yi, Y., Yang, T., & Pan, Q. (2018). Short text classification with a convolutional neural networks based method. 2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV), Marina Bay Sands Expo and Convention Centre, Singapore, IEEE.
  • Humphreys, L., Gill, P., Krishnamurthy, B., & Newbury, E. (2013). Historicizing new media: A content analysis of Twitter. Journal of Communication, 63(3), 413–431. https://doi.org/10.1111/jcom.12030
  • Imran, M., Elbassuoni, S., Castillo, C., Diaz, F., & Meier, P. (2013). Extracting information nuggets from disaster-related messages in social media. Iscram.
  • Imran, M., Mitra, P., & Castillo, C. (2016). Twitter as a lifeline: Human-annotated twitter corpora for NLP of crisis-related messages. https://arxiv.org/abs/1605.05894
  • Indyk, P., & Motwani, R. (1998). Approximate nearest neighbors: Towards removing the curse of dimensionality. Proceedings of the thirtieth annual ACM symposium on Theory of computing. Dallas, Texas, USA: ACM.
  • Ishikawa, S., Arakawa, Y., Tagashira, S., & Fukuda, A. (2012). Hot topic detection in local areas using Twitter and Wikipedia. ARCS 2012. Muenchen, Germany: IEEE.
  • Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., & Fei-Fei, L. (2014). Large-scale video classification with convolutional neural networks. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. Columbus, Ohio.
  • Kim, H.-G., Lee, S., & Kyeong, S. (2013). Discovering hot topics using Twitter streaming data social topic detection and geographic clustering. 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013). Niagara Ontario, Canada: IEEE.
  • Kleinberg, J. (2003). Bursty and hierarchical structure in streams. Data Mining and Knowledge Discovery, 7(4), 373–397. https://doi.org/10.1023/A:1024940629314
  • Korenius, T., Laurikkala, J., Järvelin, K., & Juhola, M. (2004). Stemming and lemmatization in the clustering of Finnish text documents. Proceedings of the thirteenth ACM international conference on Information and knowledge management. Washington D.C., USA: ACM.
  • Kuznetsov, S. O. (2007). On stability of a formal concept. Annals of Mathematics and Artificial Intelligence, 49(1–4), 101–115. https://doi.org/10.1007/s10472-007-9053-6
  • Kuzuetsov, S. (1990). Stability as an estimate of thie degree of substantiation of hypotheses derived on the basis of operational, Nauchno-Tekhnicheskaya Informatslya, Seriya 2, 24(12), 21–29. https://www.researchgate.net/publication/265673866_Stability_as_an_estimate_of_the_degree_of_substantiation_of_hypotheses_derived_on_the_basis_of_operational_similarity
  • Kwak, H., Lee, C., Park, H., & Moon, S. (2010). What is Twitter, a social network or a news media? Proceedings of the 19th international conference on World wide web. Raleigh, North Carolina, USA: AcM.
  • Le Cessie, S., & Van Houwelingen, J. C. (1992). Ridge estimators in logistic regression. Journal of the Royal Statistical Society. Series C, Applied Statistics, 41(1), 191–201. https://www.jstor.org/stable/2347628
  • Lee, K., Palsetia, D., Narayanan, R., Patwary, M. M. A., Agrawal, A., & Choudhary, A. (2011). Twitter trending topic classification. 2011 IEEE 11th International Conference on Data Mining Workshops. Vancouver, Canada: IEEE.
  • Li, H.-F., Lee, S.-Y., & Shan, M.-K. (2004). An efficient algorithm for mining frequent itemsets over the entire history of data streams. Proc. of First International Workshop on Knowledge Discovery in Data Streams. Pisa, Italy
  • Li, S., Lv, X., Wang, T., & Shi, S. (2010). The key technology of topic detection based on K-means. 2010 International Conference on Future Information Technology and Management Engineering. Changzhou, China: IEEE.
  • Liao, Y., Moshtaghi, M., Han, B., Karunasekera, S., Kotagiri, R., Baldwin, T., Harwood, A., & Pattison, P. (2012). Mining micro-blogs: Opportunities and challenges. In Abraham A. (Ed.), Computational social networks (pp. 129–159). Springer.
  • Liu, Y., Liu, Z., Chua, T.-S., & Sun, M. (2015). Topical word embeddings. Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, Texas, USA.
  • Lovins, J. B. (1968). Development of a stemming algorithm. Mechanics Translations & Computer. Linguistics, 11(1–2), 22–31. http://chuvyr.ru/MT-1968-Lovins.pdf
  • Ma, G. (2018). Tweets classification with BERT in the field of disaster management. Stanford, CA 94305, Department of Civil Engineering Stanford University.
  • Marwick, B. (2013). Discovery of emergent issues and controversies in anthropology using text mining, topic modeling, and social network analysis of microblog content. In Y. Zhao & Y. Cen (Eds.), Data Mining Applications with R (pp. 63–93). Elsevier.
  • McCallum, A. K. (2002). “Mallet: A machine learning for language toolkit.” http://mallet.cs.umass.edu
  • McMinn, A. J., Moshfeghi, Y., & Jose, J. M. (2013). Building a large-scale corpus for evaluating event detection on twitter. Proceedings of the 22nd ACM international conference on Information & Knowledge Management. San Francisco, California, USA: ACM.
  • Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. https://arxiv.org/abs/1301.3781.
  • Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems.
  • Milstein, S., Lorica, B., Magoulas, R., Hochmuth, G., Chowdhury, A., & O’Reilly, T. (2009). Twitter and the micro-messaging revolution: Communication, connections, and immediacy–140 characters at a time. O’Reilly Media.
  • Mori, M., Miura, T., & Shioya, I. (2004). Extracting events from web pages. AISTA’04: Proceedings of the International Conference on Advances in Intelligent Systems-Theory and Applications (AISTA). Centre de Recherche Public Henri Tudor, Luxembourg-Kirchberg, Luxembourg
  • Mori, M., Miura, T., & Shioya, I. (2006). Topic detection and tracking for news web pages. Proceedings of the 2006 IEEE/WIC/ACM international conference on web intelligence, IEEE Computer Society.
  • Murtagh, F. (1983). A survey of recent advances in hierarchical clustering algorithms. The Computer Journal, 26(4), 354–359. https://doi.org/10.1093/comjnl/26.4.354
  • Naveed, N., Gottron, T., Kunegis, J., & Alhadi, A. C. (2011). Searching microblogs: Coping with sparsity and document quality. Proceedings of the 20th ACM international conference on Information and knowledge management. Hong Kong: ACM.
  • Nguyen, D. Q., Billingsley, R., Du, L., & Johnson, M. (2015). Improving topic models with latent feature word representations. Transactions of the Association for Computational Linguistics, 3(1), 299–313. https://doi.org/10.1162/tacl_a_00140
  • Nigam, K., McCallum, A. K., Thrun, S., & Mitchell, T. (2000). Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2–3), 103–134. https://doi.org/10.1023/A:1007692713085
  • Nur’Aini, K., Najahaty, I., Hidayati, L., Murfi, H., & Nurrohmah, S. (2015). Combination of singular value decomposition and K-means clustering methods for topic detection on Twitter. 2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS). Depok, Indonesia: IEEE.
  • O’Connor, B., Krieger, M., & Ahn, D. (2010). Tweetmotif: Exploratory search and topic summarization for twitter. Fourth International AAAI Conference on Weblogs and Social Media, George Washington University in Washington, DC, USA.
  • Olteanu, A., Castillo, C., Diaz, F., & Vieweg, S. (2014). Crisislex: A lexicon for collecting and filtering microblogged communications in crises. Eighth International AAAI Conference on Weblogs and Social Media, Ann Arbor, Michigan, USA
  • Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the web. Stanford InfoLab.
  • Pennington, J., Socher, R., & Manning, C. (2014). Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar.
  • Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep contextualized word representations. https://arxiv.org/abs/1802.05365.
  • Petrović, S., Osborne, M., & Lavrenko, V. (2010). Streaming first story detection with application to twitter. Human language technologies: The 2010 Annual conference of the north american chapter of the association for computational linguistics, Association for Computational Linguistics, Los Angeles, California, USA.
  • Phuvipadawat, S., & Murata, T. (2010). Breaking news detection and tracking in Twitter. 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. Los Alamitos, CA, US: IEEE.
  • Piczak, K. J. (2015). Environmental sound classification with convolutional neural networks. 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP). Boston, USA: IEEE.
  • Piczak, K. J. (2019). Sound classification with convolutional neural networks. The Institute of Computer Science.
  • Porter, M. F. (2006). An algorithm for suffix stripping. Program, 40(3), 211–218. https://doi.org/10.1108/00330330610681286
  • Prabowo, R., Thelwall, M., Hellsten, I., & Scharnhorst, A. (2008). Evolving debates in online communication: A graph analytical approach. Internet Research, 18(5), 520–540. https://doi.org/10.1108/10662240810912765
  • Quinlan, J. R. (1996). Improved use of continuous attributes in C4. 5. Journal of Artificial Intelligence Research, 4(1), 77–90. https://doi.org/10.1613/jair.279
  • Rakesh, V., Reddy, C. K., & Singh, D. (2013). Location-specific tweet detection and topic summarization in twitter. Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. Niagara Ontario, Canada: ACM.
  • Rill, S., Reinel, D., Scheidt, J., & Zicari, R. V. (2014). PoliTwi: Early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis. Knowledge-Based Systems, 69(15), 24–33. https://doi.org/10.1016/j.knosys.2014.05.008
  • Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1988). Learning representations by back-propagating errors. Cognitive Modeling, 5(3), 1. https://doi.org/10.1038/323533a0
  • Sakaki, T., Okazaki, M., & Matsuo, Y. (2010). Earthquake shakes Twitter users: Real-time event detection by social sensors. Proceedings of the 19th international conference on World wide web. Raleigh, North Carolina, USA: ACM.
  • Salamon, J., & Bello, J. P. (2017). Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Processing Letters, 24(3), 279–283. https://doi.org/10.1109/LSP.2017.2657381
  • Salton, G., & McGill, M. J. (1983). Introduction to modern information retrieval. mcgraw-hill.
  • Sankaranarayanan, J., Samet, H., Teitler, B. E., Lieberman, M. D., & Sperling, J. (2009). Twitterstand: News in tweets. Proceedings of the 17th acm sigspatial international conference on advances in geographic information systems. Seattle, USA: ACM.
  • Sayyadi, H., & Raschid, L. (2013). A graph analytical approach for topic detection. ACM Transactions on Internet Technology (TOIT), 13(2), 4. https://doi.org/10.1145/2542214.2542215
  • Schubert, E., Weiler, M., & Kriegel, H.-P. (2014). Signitrend: Scalable detection of emerging topics in textual streams by hashed significance thresholds. Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. New York, USA: ACM.
  • Sproat, R., Black, A. W., Chen, S., Kumar, S., Ostendorf, M., & Richards, C. (2001). Normalization of non-standard words. Computer Speech & Language, 15(3), 287–333. https://doi.org/10.1006/csla.2001.0169
  • Sridhar, V. K. R. (2015). Unsupervised topic modeling for short texts using distributed representations of words. Proceedings of the 1st workshop on vector space modeling for natural language processing, Denver, Colorado, USA
  • Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., & Demirbas, M. (2010). Short text classification in twitter to improve information filtering. Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval. Geneva, Switzerland: ACM.
  • Steyvers, M., & Griffiths, T. (2007). Probabilistic topic models. Handbook of Latent Semantic Analysis, 427(7), 424–440. http://173.236.226.255/tom/papers/SteyversGriffiths.pdf
  • Suzuki, J., & Nagata, M. (2015). A unified learning framework of skip-grams and global vectors. Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Beijing, China.
  • Teevan, J., Ramage, D., & Morris, M. R. (2011). # TwitterSearch: A comparison of microblog search and web search. Proceedings of the fourth ACM international conference on Web search and data mining. Hong Kong: ACM.
  • Tembhurnikar, S. D., & Patil, N. N. (2015). Topic detection using BNgram method and sentiment analysis on twitter dataset. 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO)(Trends and Future Directions). Noida, India: IEEE.
  • Titov, I., & McDonald, R. (2008). Modeling online reviews with multi-grain topic models. Proceedings of the 17th international conference on World Wide Web. Beijing, China: ACM.
  • Toda, H., & Kataoka, R. (2005). A search result clustering method using informatively named entities. Proceedings of the 7th annual ACM international workshop on Web information and data management. Bremen, Germany: ACM.
  • Uysal, I., & Croft, W. B. (2011). User oriented tweet ranking: A filtering approach to microblogs. Proceedings of the 20th ACM international conference on Information and knowledge management. Glasgow Scotland, UK: ACM.
  • Wang, F., Orton, K., Wagenseller, P., & Xu, K. (2018). Towards understanding community interests with topic modeling. IEEE Access, 6(1), 24660–24668. https://doi.org/10.1109/ACCESS.2018.2815904
  • Wang, L., Qiao, Y., Li, W., Xu, C., & Tang, X. (2018). Video classification method and apparatus, US Patent App. 15/167,388. Google Patents
  • Webster, J. J., & Kit, C. (1992). Tokenization as the initial phase in NLP. COLING 1992 Volume 4: The 15th International Conference on Computational Linguistics, Nantes, France.
  • Wille, R. (1992). Concept lattices and conceptual knowledge systems. Computers & Mathematics with Applications, 23(6–9), 493–515. https://doi.org/10.1016/0898-1221(92)90120-7
  • Wille, R. (2009). Restructuring lattice theory: An approach based on hierarchies of concepts. International Conference on Formal Concept Analysis. Darmstadt, Germany: Springer.
  • Winarko, E., & Pulungan, R. (2019). Trending topics detection of Indonesian tweets using BN-grams and Doc-p. Journal of King Saud University-Computer and Information Sciences, 31(2), 266–274. https://doi.org/10.1016/j.jksuci.2018.01.005
  • Xie, W., Zhu, F., Jiang, J., Lim, E.-P., & Wang, K. (2016). Topicsketch: Real-time bursty topic detection from twitter. IEEE Transactions on Knowledge and Data Engineering, 28(8), 2216–2229. https://doi.org/10.1109/TKDE.2016.2556661
  • Xu, G., Yu, Z., & Qi, Q. (2018). Efficient sensitive information classification and topic tracking based on tibetan Web pages. IEEE Access, 6(1), 55643–55652. https://doi.org/10.1109/ACCESS.2018.2870122
  • Yan, D., Hua, E., & Hu, B. (2016). An improved single-pass algorithm for Chinese microblog topic detection and tracking. 2016 IEEE International Congress on Big Data (BigData Congress). San Francisco, CA, USA: IEEE.
  • Yan, X., Guo, J., Lan, Y., & Cheng, X. (2013). A biterm topic model for short texts. Proceedings of the 22nd international conference on World Wide Web. Rio de Janeiro, Brazil: ACM.
  • Yang, Y., Pierce, T., & Carbonell, J. G. (1982). A study on retrospective and on-line event detection. SIGIR '98: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information, August 1998, 28–36. https://doi.org/10.1145/290941.290953
  • Yin, H., Cui, B., Lu, H., Huang, Y., & Yao, J. (2013). A unified model for stable and temporal topic detection from social media data. 2013 IEEE 29th International Conference on Data Engineering (ICDE). Brisbane, QLD, Australia: IEEE.
  • Yu, Y., Li, M., & Fu, Y. (2018). Forest type identification by random forest classification combined with SPOT and multitemporal SAR data. Journal of Forestry Research, 29(5), 1407–1414. https://doi.org/10.1007/s11676-017-0530-4
  • Zhang, C., Pan, X., Li, H., Gardiner, A., Sargent, I., Hare, J., & Atkinson, P. M. (2018). A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification. ISPRS Journal of Photogrammetry and Remote Sensing, 140(6), 133–144. https://doi.org/10.1016/j.isprsjprs.2017.07.014
  • Zhang, C., Wang, H., Cao, L., Wang, W., & Xu, F. (2016). A hybrid term–term relations analysis approach for topic detection. Knowledge-Based Systems, 93(3), 109–120. https://doi.org/10.1016/j.knosys.2015.11.006
  • Zhang, D., & Li, S. (2011). Topic detection based on K-means. 2011 International Conference on Electronics, Communications and Control (ICECC). Ningbo, China: IEEE.
  • Zhang, M., Li, W., & Du, Q. (2018). Diverse region-based CNN for hyperspectral image classification. IEEE Transactions on Image Processing, 27(6), 2623–2634. https://doi.org/10.1109/TIP.2018.2809606
  • Zhao, D., Chang, Z., Du, N., & Guo, S. (2018). Classification for social media short text based on word distributed representation. International Conference on Web Information Systems and Applications. Taiyuan, China: Springer.
  • Zhou, P., Han, J., Cheng, G., & Zhang, B. (2019). Learning compact and discriminative stacked autoencoder for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 57(7), 4823–4833. doi:https://doi.org/10.1109/TGRS.2019.2893180.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.