Search in:

Advanced search

Communication Methods and Measures Volume 12, 2018 - Issue 2-3: Computational Methods for Communication Science

Submit an article Journal homepage

Open access

24,980

Views

CrossRef citations to date

Altmetric

Articles

More than Bags of Words: Sentiment Analysis with Word Embeddings

Elena RudkowskyFaculty of Computer Science, University of Vienna, Vienna, AustriaCorrespondence[email protected]

http://orcid.org/0000-0003-3193-6242 View further author information

Martin HaselmayerDepartment of Government, University of Vienna, Vienna, Austria

http://orcid.org/0000-0002-7765-5158 View further author information

Matthias WastianCenter for Computational Complex Systems, Technical University of Vienna, Vienna, AustriaView further author information

Marcelo JennyDepartment of Political Science, University of Innsbruck, Innsbruck, Austria

http://orcid.org/0000-0003-1535-9094 View further author information

Štefan EmrichDrahtwarenhandlung (dwh) GmbH, Vienna, AustriaView further author information

Michael SedlmairComputer Science, Jacobs University Bremen, Bremen, GermanyView further author information

Pages 140-157 | Published online: 10 Apr 2018

Cite this article
https://doi.org/10.1080/19312458.2018.1455817
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF

References

Aaldering, L., & Vliegenthart, R. (2016). Political leaders and the media. Can we measure political leadership images in newspapers using computer-assisted content analysis? Quality and Quantity, 50(5), 1871–1905.
PubMed Web of Science ®Google Scholar
Al-Rfou, R., Perozzi, B., & Skiena, S. (2013, August). Polyglot: Distributed word representations for multilingual NLP. Proceedings of the seventeenth conference on computational natural language learning (pp. 183–192). Sofia, Bulgaria. Association for Computational Linguistics. Retrieved from http://www.aclweb.org/anthology/W13-3520
Google Scholar
Baumeister, R. A., Bratlavsky, E., & Finkenauer, C. (2001). Bad Is Stronger Than Good. Review of General Psychology, 5(4), 323–370. doi:10.1037/1089-2680.5.4.323
Google Scholar
Benoit, K., Conway, D., Lauderdale, B., Laver, M., & Mikhaylov, S. (2016). Crowd-sourced text analysis: Reproducible and agile production of political data. American Political Science Review, 110(2), 278–295. doi:10.1017/S0003055416000058
Web of Science ®Google Scholar
Boumans, J. W., & Trilling, D. (2016). Taking stock of the toolkit: An overview of relevant automated content analysis approaches and techniques for digital journalism scholars. Digital Journalism, 4(1), 8–23. doi:10.1080/21670811.2015.1096598
Web of Science ®Google Scholar
Burscher, B., Odijk, D., Vliegenthart, R., De Rijke, M., & De Vreese, C. H. (2014). Teaching the computer to code frames in news: Comparing two supervised machine learning approaches to frame analysis. Communication Methods and Measures, 8(3), 190–206. doi:10.1080/19312458.2014.937527
Google Scholar
Ceron, A., Curini, L., & Iacus, S. M. (2015). Using sentiment analysis to monitor electoral campaigns: Method matters-evidence from the United States and Italy. Social Science Computer Review, 33(1), 3–20. doi:10.1177/0894439314521983
Web of Science ®Google Scholar
Ceron, A., Curini, L., & Iacus, S. M. (2016). First-and second-level agenda setting in the twittersphere: An application to the Italian political debate. Journal of Information Technology and Politics, 13(2), 159–174.
Web of Science ®Google Scholar
Ceron, A., Curini, L., & Iacus, S. M. (2017). Politics and big data: Nowcasting and forecasting elections with social media. London, UK: Routledge.
Google Scholar
Chollet, F. (2015). Keras. Retrieved from https://github.com/fchollet/keras
Google Scholar
Diakopoulos, N., Naaman, M., & Kivran-Swaine, F. (2010). Diamonds in the rough: Social media visual analytics for journalistic inquiry. 2010 IEEE symposium on Visual Analytics Science and Technology (VAST) (pp. 115–122), Salt Lake City, UT.
Google Scholar
Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17(3), 37.
Web of Science ®Google Scholar
Fritzinger, F., & Fraser, A. (2010). How to avoid burning ducks: Combining linguistic analysis and corpus statistics for german compound processing. Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and Metrics MATR (pp. 224–234), Uppsala, Sweden.
Google Scholar
Gold, V., Rohrdantz, C., & El-Assady, M. (2015). Exploratory text analysis using lexical episode plots. In E. Bertini, J. Kennedy, & E. Puppo (Eds.), Eurographics Conference on Visualization (EuroVis) - short papers, Cagliaria, Italy. The Eurographics Association. doi:10.2312/eurovisshort.20151130
Google Scholar
Goldberg, Y. (2016). A primer on neural network models for natural language processing. Journal of Artificial Intelligence Research, 57, 345–420.
Web of Science ®Google Scholar
Greene, Z., Ceron, A., Schumacher, G., & Fazekas, Z. (2016). The nuts and bolts of automated text analysis. Comparing Different Document Pre-Processing Techniques in Four Countries. Retrieved from osf.io/ghxj8
Google Scholar
Gregory, M. L., Chinchor, N., Whitney, P., Carter, R., Hetzler, E., & Turner, A. (2006). User-directed sentiment analysis: Visualizing the affective content of documents. In Proceedings of the workshop on sentiment and subjectivity in text (pp. 23–30), Sydney, Australia.
Google Scholar
Grimmer, J., & Stewart, B. M. (2013). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political Analysis, 21(3), 267–297. doi:10.1093/pan/mps028
Web of Science ®Google Scholar
Harris, Z. S. (1954). Distributional Structure. Word, 10(2–3), 146–162. doi:10.1080/00437956.1954.11659520
Web of Science ®Google Scholar
Haselmayer, M., Hirsch, L., & Jenny, M. (2017). Love is blind. Partisan bias in the perception of positive and negative campaign messages. Paper prepared for presentation at the 7th Annual Conference of the European Political Science Association (EPSA), June 22–24, Milan, Italy.
Google Scholar
Haselmayer, M., & Jenny, M. (2017). Sentiment analysis of political communication: Combining a dictionary approach with crowdcoding. Quality and Quantity, 51(6), 2623–2646. doi:10.1007/s11135-016-0412-4
PubMed Web of Science ®Google Scholar
Helms, L. (2008). Studying parliamentary opposition in old and new democracies: Issues and perspectives. The Journal of Legislative Studies, 14(1–2), 6–19. doi:10.1080/13572330801920788
Google Scholar
Hopkins, D. J., & King, G. (2010). A method of automated nonparametric content analysis for social science. American Journal of Political Science, 54(1), 229–247. doi:10.1111/ajps.2010.54.issue-1
Web of Science ®Google Scholar
Kandel, S., Heer, J., Plaisant, C., Kennedy, J., Van Ham, F., Riche, N. H., … Buono, P. (2011). Research directions in data wrangling: Visualizations and transformations for usable and credible data. Information Visualization, 10(4), 271–288. doi:10.1177/1473871611415994
Web of Science ®Google Scholar
Kingma, D., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
Google Scholar
Kleinnijenhuis, J., Schultz, F., Oegema, D., & Van Atteveldt, W. (2013). Financial news and market panics in the age of high-frequency sentiment trading algorithms. Journalism, 14(2), 271–291. doi:10.1177/1464884912468375
Web of Science ®Google Scholar
Kotzias, D., Denil, M., De Freitas, N., & Smyth, P. (2015, August). From group to individual labels using deep features. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 597–606), Sydney, Australia.
Google Scholar
Krippendorff, K. (2013). Content Analysis. An Introduction to its methodology (3rd ed.). Los Angeles, CA: Sage.
Google Scholar
Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. International Conference on Machine Learning (ICML)(pp. 1188–1196), Beijing, China.
Google Scholar
Lind, F., Gruber, M., & Boomgaarden, H. G. (2017). Content analysis by the crowd: Assessing the usability of crowdsourcing for coding latent constructs. Communication Methods and Measures, 11(3), 191–209. doi:10.1080/19312458.2017.1317338
PubMed Web of Science ®Google Scholar
Loper, E., & Bird, S. (2002). NLTK: The natural language toolkit. ACL workshop on effective tools and methodologies for teaching natural language processing and computational linguistics, Philadelphia, PA.
Google Scholar
Lowe, W., & Benoit, K. (2013). Validating estimates of latent traits from textual data using human judgment as a benchmark. Political Analysis, 21(3), 298–313. doi:10.1093/pan/mpt002
Web of Science ®Google Scholar
Lucas, C., Nielsen, R. A., Roberts, M. E., Stewart, B. M., Storer, A., & Tingley, D. (2015). Computer-assisted text analysis for comparative politics. Political Analysis, 23(2), 254–277. doi:10.1093/pan/mpu019
Web of Science ®Google Scholar
Maas, A. L., Daly, R. E., Pham, P. T., Huang, D., Ng, A. Y., & Potts, C. (2011). Learning word vectors for sentiment analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human language technologies (ACL 2011) (pp. 142–150), Portland, OR.
Google Scholar
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. CoRR. Retrieved from http://arxiv.org/abs/1301.3781
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013, December). Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems (pp. 3111–3119), Lake Tahoe, CA.
Google Scholar
Moraes, R., Valiati, J. F., & Neto, W. P. G. (2013). Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Systems with Applications, 40(2), 621–633. doi:10.1016/j.eswa.2012.07.059
Web of Science ®Google Scholar
Mozetič, I., Grčar, M., & Smailovič, J. (2016). Multilingual twitter sentiment classification: The role of human annotators. PloS One, 11(5), e0155036. doi:10.1371/journal.pone.0155036
PubMed Web of Science ®Google Scholar
Müller, W. C. (1993). Executive–Legislative relations in Austria: 1945– 1992. Legislative Studies Quarterly, 18(4), 467–494. doi:10.2307/439851
Web of Science ®Google Scholar
Müller, W. C., Jenny, M., Dolezal, M., Steininger, B., Philipp, W., & Westphal, S. (2001). Die österreichischen Abgeordneten: Individuelle Präferenzen und politisches Verhalten. Wien, Austria: WUV Universitätsverlag.
Google Scholar
Nasukawa, T., & Yi, J. (2003). Sentiment analysis: Capturing favorability using natural language processing. Proceedings of the 2nd international conference on knowledge capture (pp. 70–77), Sanibel Island, FL.
Google Scholar
Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., & Chang, Y. (2016). Abusive language detection in online user content. Proceedings of the 25th international conference on world wide web (pp. 145–153), Montréal, Canada.
Google Scholar
Pang, B., Lee, L., & Vaithyanathan, S. (2002). Thumbs up? sentiment classification using machine learning techniques. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2002) (pp. 79–86), Philadelphia, PA.
Google Scholar
Parikh, A. P., Täckström, O., Das, D., & Uszkoreit, J. (2016). A decomposable attention model for natural language inference. Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (pp. 2249–2255), Austin,Texas.
Google Scholar
Parliament Austria. (2013). Parliamentary speeches from the Austrian National Parliament. Retrieved from https://www.parlament.gv.at/PERK/NRBRBV/NR/STENO/
Google Scholar
Pennington, J., Socher, R., & Manning, C. D. (2014). Glove: Global vectors for word representation. Proceedings of the Empirical Methods in Natural Language Processing (EMNLP 2014) (Vol. 14, pp. 1532–1543). Retrieved from https://nlp.stanford.edu/projects/glove/
Google Scholar
Raschka, S. (2014). Naive Bayes and text classification I - Introduction and theory. arXiv preprint arXiv:1410.5329.
Google Scholar
Rauh, C., De Wilde, P., & Schwalbach, J. (2017). The ParlSpeech data set: Annotated full-text vectors of 3.9 million plenary speeches in the key legislative chambers of seven European states. Harvard Dataverse, V1. doi:10.7910/DVN/E4RSP9
Google Scholar
Rheault, L., Beelen, K., Cochrane, C., & Hirst, G. (2016). Measuring emotion in parliamentary debates with automated textual analysis. PLoS One, 11(12), e0168843. doi:10.1371/journal.pone.0168843
PubMed Web of Science ®Google Scholar
Rozin, P., & Royzman, E. B. (2001). Negativity bias, negativity dominance, and contagion. Personality and Social Psychology Review, 5(4), 296–320. doi:10.1207/S15327957PSPR0504_2
Web of Science ®Google Scholar
Russell, M., & Gover, D. (2017). Legislation at westminster: Parliamentary actors and influence in the making of British law. Oxford, UK: Oxford University Press.
Google Scholar
Salton, G., & McGill, M. J. (1986). Introduction to modern information retrieval. New York, NY: McGraw-Hill, Inc.
Google Scholar
Slapin, J. B., & Proksch, O. (2014). Words as data: Content analysis in legislative studies. In S. Martin, K. Strom, & T. Saalfeld (Eds.), The Oxford handbook of legislative studies. Oxford, UK: Oxford University Press.
Google Scholar
Socher, R., Perelygin, A., Wu, J. Y., Chuang, J., Manning, C. D., Ng, A. Y., & Potts, C. (2013). Recursive deep models for semantic compositionality over a sentiment treebank. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (Vol. 1631, p. 1642), Seattle, WA.
Google Scholar
Soroka, S., & McAdams, S. (2015). News, politics, and negativity. Political Communication, 32(1), 1–22. doi:10.1080/10584609.2014.881942
Web of Science ®Google Scholar
Soroka, S., Young, L., & Balmas, M. (2015). Bad news or mad news? Sentiment scoring of negativity, fear, and anger in news content. The Annals of the American Academy of Political and Social Science, 659(1), 108–121. doi:10.1177/0002716215569217
Web of Science ®Google Scholar
Stolte, C., Tang, D., & Hanrahan, P. (2002). Polaris: A system for query, analysis, and visualization of multidimensional relational databases. IEEE Transactions on Visualization and Computer Graphics, 8(1), 52–65. doi:10.1109/2945.981851
Web of Science ®Google Scholar
Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., & Qin, B. (2014). Learning sentiment-specific word embedding for Twitter sentiment classification. ACL (Vol. 1, pp. 1555–1565), Baltimore, MD.
Google Scholar
Turney, P. D., & Pantel, P. (2010). From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research, 37, 141–188.
Web of Science ®Google Scholar
Van Atteveldt, W., Kleinnijenhuis, J., & Ruigrok, N. (2008b). Parsing, semantic networks, and political authority using syntactic analysis to extract semantic relations from dutch newspaper articles. Political Analysis, 16(4), 428–446. doi:10.1093/pan/mpn006
Web of Science ®Google Scholar
Van Atteveldt, W., Kleinnijenhuis, J., Ruigrok, N., & Schlobach, S. (2008a). Good news or bad news? Conducting sentiment analysis on dutch text to distinguish between positive and negative relations. Journal of Information Technology and Politics, 5(1), 73–94. doi:10.1080/19331680802154145
Google Scholar
Van Atteveldt, W., Sheafer, T., Shenhav, S. R., & Fogel-Dror, Y. (2017). Clause analysis: Using syntactic information to automatically extract source, subject, and predicate from texts with an application to the 2008-2009 gaza war. Political Analysis, 25(2), 207–222. doi:10.1017/pan.2016.12
Web of Science ®Google Scholar
Wilkerson, J., & Casas, A. (2017). Large-scale computerized text analysis in political science: Opportunities and challenges. Annual Review of Political Science, 20(1), 529–544. doi:10.1146/annurev-polisci-052615-025542
Web of Science ®Google Scholar
Wang, H., Can, D., Kazemzadeh, A., Bar, F., & Narayanan, S. (2012). A system for real-time Twitter sentiment analysis of 2012 U.S. presidential election cycle. Paper presented at the Proceedings of the ACL 2012 System Demonstrations, Jeju Island, Korea, 115–120.
Google Scholar
Wueest, B., Clematide, S., Bünzli, A., Laupper, D., & Frey, T. (2011). Electoral campaigns and relation mining: Extracting semantic network data from newspaper articles. Journal of Information Technology and Politics, 8(4), 444–463. doi:10.1080/19331681.2011.567387
Google Scholar
Wulczyn, E., Thain, N., & Dixon, L. (2016). Ex machina: Personal attacks seen at scale. CoRR, abs/1610.08914. Retrieved from http://arxiv.org/abs/1610.08914
Google Scholar
Young, L., & Soroka, S. (2012). Affective news: The automated coding of sentiment in political texts. Political Communication, 29(2), 205–231. doi:10.1080/10584609.2012.671234
Web of Science ®Google Scholar

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

More than Bags of Words: Sentiment Analysis with Word Embeddings

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

More than Bags of Words: Sentiment Analysis with Word Embeddings

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date