224
Views
3
CrossRef citations to date
0
Altmetric
Articles

Storing, preprocessing and analyzing tweets: finding the suitable noSQL system

ORCID Icon, ORCID Icon & ORCID Icon
Pages 586-595 | Received 28 May 2020, Accepted 01 Nov 2020, Published online: 17 Nov 2020

References

  • Kim J, Hastak M. Social network analysis: characteristics of online social networks after a disaster. Int J Inf Manage. 2018;38(1):86–96.
  • Thoma B, Sanders JL, Lin M, et al. The social media index: measuring the impact of emergency medicine and critical care websites. Western J Emergency Med. 2015;16(2):242–249.
  • Ahsan M, Sharma TP. Spams classification and their diffusibility prediction on twitter through sentiment and topic models. Int J Computers Appl. 2020; 42(5): 1-12. doi: https://doi.org/10.1080/1206212X.2020.1758430.
  • Memon I, Chen L, Majid A, et al. Travel recommendation using geo-tagged photos in social media for tourist. Wireless Personal Commun. 2015;80(4):1347–1362.
  • Sotiriadis MD, Van Zyl C. Electronic word-of-mouth and online reviews in tourism services: the use of twitter by tourists. Electronic Commerce Res. 2013;13(1):103–124.
  • Meehan K, Lunney T, Curran K, et al.Context-aware intelligent recommendation system for tourism. Pervasive Computing and Communications Workshops (PERCOM Workshops), 2013 IEEE International Conference on; IEEE; 2013. p. 328–331.
  • Shimada K, Inoue S, Maeda H, et al. Analyzing tourism information on twitter for a local city. 2011 First ACIS International Symposium on Software and Network Engineering; IEEE; 2011. p. 61–66
  • Fowler B, Godin J, Geddy M. Teaching case introduction to nosql in a traditional database course. J Inform Syst Education. 2016;27(2):99–103.
  • Amghar S, Cherdal S, Mouline SData integration and nosql systems: A state of the art. Proceedings of the 4th International Conference on Big Data and Internet of Things; 2019. p. 1–6
  • Leavitt N. Will nosql databases live up to their promise?. Computer. 2010;43(2):00–00.
  • DeCandia G, Hastorun D, Jampani M, et al. Dynamo: amazon's highly available key-value store. ACM SIGOPS operating systems review; Vol. 41, ACM; 2007. p. 205–220.
  • Amghar S, Cherdal S, Mouline SWhich nosql database for iot applications? 2018 International Conference on Selected Topics in Mobile and Wireless Networking (MoWNeT); IEEE; 2018. p. 131–137
  • Pokorny J. Nosql databases: a step to database scalability in web environment. Int J Web Information Syst. 2013;9(1):69–82.
  • Han J, Haihong E, Le G, et al. Survey on nosql database.Pervasive computing and applications (ICPCA), 2011 6th international conference on; IEEE; 2011. p. 363–366.
  • Redis. redis.io. Accessed: 2020-11-10
  • Lakshman A, Malik P. Cassandra: a decentralized structured storage system. ACM SIGOPS Operating Syst Rev. 2010;44(2):35–40.
  • Apache cassandra. Available From: http://cassandra.apache.org/doc/latest/cql/index.html. Accessed: 2020-11-10
  • Couchbase. Available From: https://docs.couchbase.com/home/. Accessed: 2020-11-10
  • Miller JJ. Graph database applications and concepts with neo4j. Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA; Vol. 2324, 2013. p. 36.
  • Taylor RC. An overview of the hadoop/mapreduce/hbase framework and its current applications in bioinformatics. BMC bioinformatics. Vol. 11, BioMed Central, 2010, p. S1.
  • Shoro AG, Soomro TR. Big data analysis: apache spark perspective. Global J Computer Sci Technol. 2015; 15(1): 8–16.
  • Iqbal MH, Soomro TR. Big data analysis: apache storm perspective. Int J Computer Trends and Technol. 2015;19(1):9–14.
  • Zaharia M, Xin RS, Wendell P, et al. Apache spark: a unified engine for big data processing. Commun ACM. 2016;59(11):56–65.
  • R website. Available From: https://cran.r-project.org/. Accessed: 2020-11-10
  • Sanner MF. Python: a programming language for software integration and development. J Mol Graph Model. 1999;17(1):57–61.
  • Chang M-LE, Chua HN. Sql and nosql database comparison. Future of Information and Communication Conference; Springer; 2018. p. 294–310.
  • ul Haque A, Mahmood T, Ikram N. Performance comparison of state of art nosql technologies using apache spark. Proceedings of SAI Intelligent Systems Conference; Springer; 2018. p. 563–576.
  • Pereira DA, de Morais WO, de Freitas EP. Nosql real-time database performance comparison. Int J Parallel, Emergent and Distributed Syst. 2018;33(2):144–156.
  • Celesti A, Lay-Ekuakille A, Wan J, et al. Information management in iot cloud-based tele-rehabilitation as a service for smart cities: comparison of nosql approaches. Measurement. 2020;151:107218.
  • Tang E, Fan Y. Performance comparison between five nosql databases. Cloud Computing and Big Data (CCBD), 2016 7th International Conference on; IEEE; 2016. p. 105–109
  • Chandra DG. Base analysis of nosql database. Future Generation Computer Syst. 2015;52:13–21.
  • Corbellini A, Mateos C, Zunino A, et al. Persisting big-data: the nosql landscape. Inf Syst. 2017;63:1–23.
  • Kan CY. Cassandra data modeling and analysis. Packt Publishing Ltd; Birmingham-UK; 2014.
  • Zahid A, Masood R, Awais Shibli M. Security of sharded nosql databases: A comparative analysis. 2014 Conference on Information Assurance and Cyber Security (CIACS); IEEE; 2014. p. 1–8.
  • Gu Y, Shen S, Wang J, et al. Application of nosql database mongodb. Consumer Electronics-Taiwan (ICCE-TW), 2015 IEEE International Conference on; IEEE; 2015. p. 158–159.
  • Scholar PG. Performance enhancement of data processing using multiple intelligent cache in hadoop
  • Aydin A, Anderson KBatch to real-time: Incremental data collection & analytics platform. Proceedings of the 50th Hawaii International Conference on System Sciences; 2017
  • Apache storm. Available From: https://storm.apache.org. Accessed: 2020-11-10
  • Song H, Ferry N, Høgenes J, et al. Model-driven engineering for the configuration and deployment of data processing applications. Proceedings of the 5th International Conference on Model-Driven Engineering and Software Development; Port, Portugal: February 19–21, 2017.
  • Lewis IM. Key-value storage systems (and beyond) with python. The Python Papers. 2010;5(3):1–5.
  • Dede E, Sendir B, Kuzlu P, et al. Processing cassandra datasets with hadoop-streaming based approaches. IEEE Trans Services Comput. 2016;9(1):46–58.
  • Vasavi S, Priya MP, Gokhale AA. Framework for geospatial query processing by integrating cassandra with hadoop. Knowledge Computing and Its Applications; Springer; 2018. p. 131–160.
  • Chaudhari AA, Mulay P. Scsi: Real-time data analysis with cassandra and spark. Big Data Processing Using Spark in Cloud; Springer; 2019. p. 237–264.
  • Lladós J, Cores F, Guirado F. Scalable consistency in t-coffee through apache spark and cassandra database. J. Comput. Biol. 2018;25(8):894–906.
  • Huang J, Zhang L. The big data processing platform for intelligent agriculture. AIP Conference Proceedings; Vol. 1864, AIP Publishing; 2017. p. 020033.
  • Dede E, Govindaraju M, Gunter D, et al. Performance evaluation of a mongodb and hadoop platform for scientific data analysis. Proceedings of the 4th ACM workshop on Scientific cloud computing; ACM; 2013. p. 13–20
  • Boulmakoul A, Karim L, Laarabi MH, et al. Mongodb-hadoop distributed and scalable framework for spatio-temporal hazardous materials data warehousing. 7th International Congress on Environmental Modelling and Software (iEMSs); 2014
  • Mongodb documentation. Available From: https://docs.mongodb.com/. Accessed: 2020-11-10
  • Le M-K, Chang H-T, Chang Y-M, et al. An efficient multilevel healthy cloud system using spark for smart clothes. Computer Symposium (ICS), 2016 International; IEEE; 2016, p. 182–186
  • Felipe A, Palacio O, Wauters T, et al. Scalable distributed traffic monitoring for enterprise networks with spark streaming. ECCWS2018, the 17th European Conference on Cyber Warfare and Security; 2018. p. 563–569.
  • Cai J, Jin Z. Real-time calculating over self-health data using storm. 2015 4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering; Atlantis Press; 2015
  • Hassan MK, Shakthi SP, Sasikala R. Sentimental analysis of amazon reviews using naïve bayes on laptop products with mongodb and r. IOP Conference Series: Materials Science and Engineering; Vol. 263, IOP Publishing; 2017. p. 042090.
  • Nayak A. MongoDB cookbook. Packt Publishing Ltd; Birmingham-UK ; 2014.
  • Asri H, Mousannif H, Al Moatassime H. Real-time miscarriage prediction with spark. Procedia Comput Sci. 2017;113:423–428.
  • Kagramanyan G, Poliseno L. Document stores and couchbase. 2017
  • Pratiba D, Shobha G, LalithKumar H, et al. Distributed web crawlers using hadoop. Int J Appl Eng Res. 2017;12(24):15187–15195.
  • Lv X, Cheng X, Tian S, et al. Storage and parallel topology processing of the power network based on neo4j. Control Conference (CCC), 2017 36th Chinese; IEEE; 2017. p. 10703–10707.
  • Neo4j. Available From: https://neo4j.com/developer/. Accessed: 2020-11-10
  • Do P, Pham P. W-pathsim++: the novel approach of topic-driven similarity search in large-scaled heterogeneous network with the support of spark-based datalog. 2018 10th International Conference on Knowledge and Systems Engineering (KSE); IEEE; 2018, p. 102–106.
  • Cattuto C, Quaggiotto M, Panisson A, et al. Time-varying social networks in a graph database: a neo4j use case. First international workshop on graph data management experiences and systems; ACM; 2013, p. 11
  • Drakopoulos G, Kanavos A. Tensor-based document retrieval over neo4j with an application to pubmed mining. Information, Intelligence, Systems & Applications (IISA), 2016 7th International Conference on; IEEE; 2016. p. 1–6.
  • Angadi S, Reddy VS. Multimodal sentiment analysis using relieff feature selection and random forest classifier. Int J Computers and Appl. 2019; 41(7):1–9. doi:https://doi.org/10.1080/1206212X.2019.1658054
  • Kabir M, Jahangir Kabir Mir Md, Xu S, et al. An empirical research on sentiment analysis using machine learning approaches. Intl J Computers and Appl. 2019; 41(6):1–9.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.