1,955
Views
4
CrossRef citations to date
0
Altmetric
Articles

Noise Pollution: A Multi-Step Approach to Assessing the Consequences of (Not) Validating Search Terms on Automated Content Analyses

ORCID Icon, ORCID Icon & ORCID Icon

Figures & data

Table 1. Results of the systematic literature review.

Figure 1. Research design.

Note. The terms “validated” and “non-validated” text corpus, DFM, and topic model reflect that they were constructed using the validated or non-validated search term. They do not refer to post-hoc validation procedures, like the validation of results from unsupervised procedures such as topic modeling approaches.

Figure 1. Research design.Note. The terms “validated” and “non-validated” text corpus, DFM, and topic model reflect that they were constructed using the validated or non-validated search term. They do not refer to post-hoc validation procedures, like the validation of results from unsupervised procedures such as topic modeling approaches.

Figure 2. Comparing issue attention to climate change.

Figure 2. Comparing issue attention to climate change.

Figure 3. Comparing features with the largest difference in relative frequencies.

Figure 3. Comparing features with the largest difference in relative frequencies.

Figure 4. Comparing the most prevalent topics.

Figure 4. Comparing the most prevalent topics.

Figure 5. Comparing topical prevalence for the ten topics with the most similar matches.

Figure 5. Comparing topical prevalence for the ten topics with the most similar matches.

Figure 6. Comparing temporal development of topic pairs (highest and lowest cosine similarities).

Figure 6. Comparing temporal development of topic pairs (highest and lowest cosine similarities).
Supplemental material

Supplemental Material

Download PDF (584.7 KB)

SupplementaryMaterialR3_DJ.docx

Download MS Word (327.1 KB)