441
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Transparent generosity. Introducing the impresso interface for the exploration of semantically enriched historical newspapers

ORCID Icon, ORCID Icon & ORCID Icon

Figures & data

Table 1. impresso’s user requirements gathering process. Overview of the involvement and contributions of stakeholders in history and libraries.

Table 2. Retrieve relevant content: Search, filter, and discover.

Table 3. Understand what is there: Aggregate, document, and contextualize.

Table 4. Compare and assess significance.

Table 5. Externalize: Export, link, and embed.

Table 6. Create transparency through documentation and educational materials.

Figure 1. Search pill with suggestions for the keyword “atom” (blue) including entity mentions (orange), topics (green) linked named entities (yellow), user-created collections (blue).

Figure 1. Search pill with suggestions for the keyword “atom” (blue) including entity mentions (orange), topics (green) linked named entities (yellow), user-created collections (blue).

Figure 2. A Query for “atom” and the linked entity “Otto Hahn” with suggested keywords based on word embeddings (left), the human readable summary of the executed query (centre top, highlighted) and snippet previews of search results (centre).

Figure 2. A Query for “atom” and the linked entity “Otto Hahn” with suggested keywords based on word embeddings (left), the human readable summary of the executed query (centre top, highlighted) and snippet previews of search results (centre).

Figure 3. Image similarity ranking reveals image reuse between 1984 and 1996 (images blurred for copyright reasons).

Figure 3. Image similarity ranking reveals image reuse between 1984 and 1996 (images blurred for copyright reasons).

Figure 4. Searching for instances of text reuse in the nuclear power collection revealed a pro-nuclear PR campaign published between May and November 1977 in Swiss French-language newspapers.

Figure 4. Searching for instances of text reuse in the nuclear power collection revealed a pro-nuclear PR campaign published between May and November 1977 in Swiss French-language newspapers.

Figure 5. Experimental recommender system with option to assign weight on co-occurring entities, temporal proximity, topics, and text reuse.

Figure 5. Experimental recommender system with option to assign weight on co-occurring entities, temporal proximity, topics, and text reuse.

Figure 6. Newspaper page with information on publication periods, available issues, and indication of missing pages.

Figure 6. Newspaper page with information on publication periods, available issues, and indication of missing pages.

Figure 7. Distribution of articles over time for the query on nuclear power content.

Figure 7. Distribution of articles over time for the query on nuclear power content.

Figure 8. Preview of the distribution of a linked named entity in the corpus.

Figure 8. Preview of the distribution of a linked named entity in the corpus.

Figure 9. Overview of the distribution of a linked named entity in the corpus and contexts in which it appears.

Figure 9. Overview of the distribution of a linked named entity in the corpus and contexts in which it appears.

Figure 10. Facsimile view of a newspaper page with marginalia and searchable table of content on the left.

Figure 10. Facsimile view of a newspaper page with marginalia and searchable table of content on the left.

Figure 11. Using the Inspect component to contrast the distribution of “bikini” in French and German language content.

Figure 11. Using the Inspect component to contrast the distribution of “bikini” in French and German language content.

Figure 12. Using the Compare component to assess the proportion of content related to nuclear power and nuclear weapons in German-language newspapers.

Figure 12. Using the Compare component to assess the proportion of content related to nuclear power and nuclear weapons in German-language newspapers.

Figure 13. N-Gram frequencies of “atom”, “atomenergie” and “kernenergie”.

Figure 13. N-Gram frequencies of “atom”, “atomenergie” and “kernenergie”.

Table 7. Overview of the integration of main semantic enrichments in impresso Search page, Inspect & Compare and availability of dedicated components for exploration.

Figure 14. Example of an i-button with link to corresponding FAQ article.

Figure 14. Example of an i-button with link to corresponding FAQ article.