CrossRef citations to date

Using Computational Text Analysis Tools to Study African Online News Content



  • Afrobarometer. 2019. Rounds 5,6 & 7. Cape Town.
  • Atteveldt, W. van, J. Strycharz, D. Trilling, and K. Welbers. 2019. “Toward open computational communication science: A practical road map for reusable data and code.” International Journal of Communication 13: 3935–3954.
  • Aukia, J., J. Heimonen, T. Pahikkala, and T. Salakoski. 2017. “Automated quantification of Reuters news using a receiver operating characteristic curve analysis: The Western media image of China.” Global Media and China 2 (3–4): 251–268. doi:10.1177/2059436418754890.
  • Barberá, P. 2015. “Birds of the same feather Tweet together: Bayesian ideal point estimation using Twitter data.” Political Analysis 23 (01): 76–91. doi:10.1093/pan/mpu011.
  • Baum, M. A., and Y. M. Zhukov. 2019. “Media ownership and News coverage of International conflict.” Political Communication 36 (1): 36–63. doi:10.1080/10584609.2018.1483606.
  • Bengfort, B., R. Bilbro, and T. Ojeda. 2018. Applied Text Analysis With Python: Enabling Language-Aware Data Products With Machine Learning. Sebastopol: O’Reilly.
  • Benoit, K., K. Watanabe, H. Wang, P. Nulty, A. Obeng, S. Müller, and A. Matsuo. 2018. “Quanteda: An R package for the quantitative analysis of textual data.” Journal of Open Source Software 3 (30): 774. doi:10.21105/joss.00774.
  • Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. “Latent dirichlet allocation.” Journal of Machine Learning Research 3 (Jan): 993–1022.
  • Boumans, J. W., and D. Trilling. 2016. “Taking stock of the Toolkit: An overview of relevant automated content analysis approaches and techniques for digital journalism scholars.” Digital Journalism 4 (1): 8–23. doi:10.1080/21670811.2015.1096598.
  • Bouvier, G., and D. Machin. 2018. “Critical discourse analysis and the challenges and opportunities of social media.” Review of Communication 18 (3): 178–192. doi:10.1080/15358593.2018.1479881.
  • Burgess, J., A. Bruns, and L. Hjorth. 2013. “Emerging methods for digital media research: An introduction.” Journal of Broadcasting & Electronic Media 57 (1): 1–3. doi:10.1080/08838151.2012.761706.
  • Charlesworth, A., and E. L. Tonkin. 2016. “If You Find Yourself in a Hole, Stop Digging: Legal and Ethical Issues of Text/Data Mining in Research.” In Working with Text, edited by E. L. Tonkin, and G. J. L. Tourte, 61–88. Amsterdam: Chandos Publishing.
  • Davies, M. 2019. The best of both worlds: Multi-billion word “dynamic” corpora [Application/pdf]. 23–28. https://doi.org/10.14618/IDS-PUB-9023.
  • De Grove, F., K. Boghe, and L. De Marez. 2020. “(What) can journalism studies learn from supervised machine learning?” Journalism Studies 21 (7): 912–927. doi:10.1080/1461670X.2020.1743737.
  • Dowle, M., and A. Srinivasan. 2019. data.table: Extension of “data.frame” (Version 1.12.2) [R]. Retrieved from https://CRAN.R-project.org/package=data.table.
  • Feinerer, I., K. Hornik, and D. Meyer. 2008. “Text Mining Infrastructure in R.” Journal of Statistical Software 25 (1): 1–54. doi:10.18637/jss.v025.i05.
  • Fengler, S., M. Kreutler, M. Alku, B. Barlovac, M. Bastian, S. S. Bodrunova, … R. Zguri. 2020. “The Ukraine conflict and the European media: A comparative study of newspapers in 13 European countries.” Journalism 21 (3): 399–422. doi:10.1177/1464884918774311.
  • Franklin, B. 2013. “Editorial.” Digital Journalism 1 (1): 1–5. doi:10.1080/21670811.2012.740264.
  • Gabore, S. M., and X. Deng. 2018. “Do National and International media cover the same event differently? The online media framing of irreecha festival tragedy.” Communicatio 44 (1): 55–70. doi:10.1080/02500167.2018.1441889.
  • Gerber, A. S., and D. P. Green. 2012. Field experiments: Design, analysis, and interpretation. 1st ed. New York: W. W. Norton.
  • Grasland, C. 2019. “International news flow theory revisited through a space–time interaction model: Application to a sample of 320,000 international news stories published through RSS flows by 31 daily newspapers in 2015.” International Communication Gazette, 1748048518825091 82 (3): 231–259. doi:10.1177/1748048518825091.
  • Grimmer, J., and B. M. Stewart. 2013. “Text as data: The promise and pitfalls of automatic content analysis methods for political texts.” Political Analysis 21 (3): 267–297. doi:10.1093/pan/mps028.
  • Guo, L., K. Mays, S. Lai, M. Jalal, P. Ishwar, and M. Betke. 2019. “Accurate, fast, but not always cheap: evaluating “crowdcoding” as an alternative approach to analyze social Media Data.” Journalism & Mass Communication Quarterly 107769901989143), 97 (3): 811–834. doi:10.1177/1077699019891437.
  • Guo, L., and C. Vargo. 2020. ““Fake News” and emerging online media ecosystem: An integrated intermedia agenda-setting analysis of the 2016 U.S. presidential election.” Communication Research 47 (2): 178–200. doi:10.1177/0093650218777177.
  • Haim, M. 2020. “Agent-based Testing: An automated approach toward artificial reactions to human behavior.” Journalism Studies 21 (7): 895–911. doi:10.1080/1461670X.2019.1702892.
  • Hopp, F. R., J. A. Schaffer, J. T. Fisher, and R. Weber. 2019. “iCoRe: The GDELT Interface for the Advancement of Communication Research.” Computational Communication Research 1 (1): 13–44. –13–44.
  • Hutchinson, J. 2016. “An introduction to digital media research methods: How to research and the implications of new media data.” Communication Research and Practice 2 (1): 1–6. doi:10.1080/22041451.2016.1155307.
  • Jurafsky, D., and J. H. Martin. 2018. Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Retrieved from https://web.stanford.edu/~jurafsky/slp3/.
  • Kananovich, V. 2018. “Framing the taxation-democratization link: An automated content analysis of cross-national newspaper data.” The International Journal of Press/Politics, 1940161218771893. 23 (2): 247–267. doi:10.1177/1940161218771893.
  • Krippendorff, K. 2019. Content Analysis: An Introduction to Its Methodology (4th ed). Thousand Oaks: Sage.
  • Lane, H., C. Howard, and H. M. Hapke. 2019. Natural Language Processing In Action: Understanding, Analyzing, And Generating text with Python. Shelter Island: Manning Publications Co.
  • Lang, A. 2019. “Spatial dialectics: Pursuing geospatial imaginaries with word embedding models and mapping.” Modernism/Modernity Print Plus 4 (2), doi:10.26597/mod.0116.
  • Laver, M., K. Benoit, and J. Garry. 2003. “Extracting policy positions from political texts using words as data.” American Political Science Review 97 (2): 311–331. doi:10.1017/S0003055403000698.
  • Lazer, D., A. Pentland, L. Adamic, S. Aral, A.-L. Barabási, D. Brewer, … M. V. Alstyne. 2009. “Computational social science.” Science 323 (5915): 721–723. doi:10.1126/science.1167742.
  • Lewis, S. C., R. Zamith, and A. Hermida. 2013. “Content Analysis in an Era of Big Data: A Hybrid approach to computational and manual methods.” Journal of Broadcasting & Electronic Media 57 (1): 34–52. doi:10.1080/08838151.2012.761702.
  • Lind, F., J.-M. Eberl, T. Heidenreich, and H. G. Boomgaarden. 2019. “When the journey is as important as the Goal: A roadmap to multilingual dictionary construction.” International Journal of Communication 13: 4000–4020.
  • Liu, B. 2015. Sentiment Analysis: Mining Opinions, Sentiments, and Emotions. New York: Cambridge University Press.
  • Lucas, C., R. Nielsen, M. Roberts, B. Stewart, A. Storer, and D. Tingley. 2015. “Computer assisted text analysis for comparative politics.” Political Analysis 23 (2): 254–277.
  • Lucas, C., and D. Tingley. 2014. TranslateR: Bindings for the Google and Microsoft Translation APIs (Version 1.0) [R]. Retrieved from https://CRAN.R-project.org/package=translateR.
  • Lukito, J., J. Suk, Y. Zhang, L. Doroshenko, S. J. Kim, M.-H. Su, … C. Wells. 2019. “The Wolves in Sheep’s clothing: How Russia’s Internet Research Agency Tweets Appeared in U.S. News as Vox Populi.” The International Journal of Press/Politics, 194016121989521. 25 (2): 196–216. doi:10.1177/1940161219895215.
  • Madrid-Morales, D. 2018. “African News with Chinese Characteristics: A Case Study of CGTN Africa.” (PhD Thesis, City University of Hong Kong). Retrieved from http://lbms03.cityu.edu.hk/theses/ftt/phd-com-23837390.pdf
  • McEnery, T., and A. Hardie. 2012. Corpus Linguistics: Method, Theory And Practice. Cambridge . New York: Cambridge University Press.
  • Nicholls, T. 2019. “Detecting Textual Reuse in News Stories, At Scale.” International Journal of Communication 13: 4173–4197.
  • Pasquale, F. 2015. The Black box society: The Secret Algorithms That Control Money And Information. Cambridge: Harvard University Press.
  • Possler, D., S. Bruns, and J. Niemann-Lenz. 2019. “Data Is the New Oil—But How Do We Drill It? Pathways to Access and Acquire Large Data Sets in Communication Science.” International Journal of Communication 13: 3894–3911.
  • Proksch, S.-O., W. Lowe, J. Wäckerle, and S. Soroka. 2019. “Multilingual Sentiment Analysis: A New Approach to Measuring Conflict in Legislative Speeches.” Legislative Studies Quarterly 44 (1): 97–131. doi:10.1111/lsq.12218.
  • R Core Team. 2017. R: A language and environment for statistical computing (Version 3.4.3). Retrieved from https://www.R-project.org/.
  • Reber, U. 2019. “Overcoming language barriers: Assessing the potential of machine translation and topic modeling for the comparative analysis of multilingual text corpora.” Communication Methods and Measures 13 (02): 102–125. doi:10.1080/19312458.2018.1555798.
  • Roberts, M. E., B. M. Stewart, D. Tingley, C. Lucas, J. Leder-Luis, S. K. Gadarian, … D. G. Rand. 2014. “Structural topic models for open-ended survey responses.” American Journal of Political Science 58 (4): 1064–1082. doi:10.1111/ajps.12103.
  • Silge, J., and D. Robinson. 2016. “Tidytext: Text mining and analysis using tidy data principles in R.” The Journal of Open Source Software 1 (3), doi:10.21105/joss.00037.
  • Snelson, C. L. 2016. “Qualitative and mixed methods social media research: A review of the literature.” International Journal of Qualitative Methods 15 (1): 160940691562457. 15 (1): 1–15. doi:10.1177/1609406915624574.
  • Soroka, S., L. Young, and M. Balmas. 2015. “Bad News or Mad News? Sentiment scoring of negativity, fear, and anger in news content.” The ANNALS of the American Academy of Political and Social Science 659 (1): 108–121. doi:10.1177/0002716215569217.
  • Spirling, A., and P. L. Rodriguez. 2019. Word Embeddings What works, what doesn’t, and how to tell the difference for applied research. New York.
  • Trilling, D., and J. G. F. Jonkman. 2018. “Scaling up content analysis.” Communication Methods and Measures 12 (2–3): 158–174. doi:10.1080/19312458.2018.1447655.
  • Weiss, G., and R. Wodak 2007. Critical discourse analysis theory and interdisciplinarity. Houndmills: Palgrave.
  • Welbers, K., W. Van Atteveldt, and K. Benoit. 2017. “Text Analysis in R.” Communication Methods and Measures 11 (4): 245–265. doi:10.1080/19312458.2017.1387238.
  • Whitney, D. C., R. S. Sumpter, and D. McQuail. 2004. “News media production: Individuals, organizations and institutions.” In The SAGE handbook of media studies, edited by J. Downing, D. McQuail, P. Schlesinger, and E. Wartella, 393–410. Thousand Oaks: Sage.
  • Wickham, H. 2016. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag.
  • Young, L., and S. Soroka. 2012. “Affective News: The automated coding of sentiment in political texts.” Political Communication 29 (2): 205–231. doi:10.1080/10584609.2012.671234.
  • Zamith, R. 2017. “Capturing and analyzing liquid content: A computational process for freezing and analyzing mutable documents.” Journalism Studies 18 (12): 1489–1504. doi:10.1080/1461670X.2016.1146083.
  • Zhang, L., and B. Liu. 2017. “Sentiment Analysis and opinion mining.” In Encyclopedia of Machine Learning and Data Mining, edited by C. Sammut , and G. I. Webb , 1152–1161. Boston: Springer. doi:10.1007/978-1-4899-7687-1_907

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.