218
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Disentangling the role of algorithm awareness and knowledge in digital inequalities: an empirical validation of an explanatory model

, ORCID Icon, & ORCID Icon
Received 29 May 2023, Accepted 14 May 2024, Published online: 10 Jun 2024

ABSTRACT

Algorithms have become almost ubiquitous on the internet. They shape the way internet users engage with internet services across diverse online domains and what outcomes they obtain from their internet uses. While scholars have extensively studied the societal implications of algorithms in the digital world, only a few studies investigated the relationship between digital inequalities and algorithm awareness and knowledge. Little is known about how ubiquity of internet access and disparities in skills affect algorithm awareness and knowledge of internet users, and how they affect their internet uses and outcomes. To fill this important gap, this study presents and tests an explanatory model to assert how algorithm awareness and knowledge are related to the three levels of digital inequalities. The data were collected with a face-to-face survey (response rate = 54%) on a representative sample of internet users (N = 802) in Slovenia in 2022. Results of path analysis confirmed the sequential paths between the three levels of digital inequalities, suggesting that ubiquity of internet access strongly determines internet skills and internet uses, which in turn affect tangible internet outcomes. While ubiquity of internet access affected only algorithm awareness, internet skills predicted both algorithm awareness and knowledge. Importantly, algorithm knowledge was a significant determinant of internet uses. Age, education, and income moderated some of the relationships in the model. Overall, the study demonstrates that research on digital inequalities and related interventions need to address algorithm awareness and knowledge, and also consider how social inequalities among internet users shape them.

Introduction

Digital inequalities have long been an issue of concern for researchers. Starting with the so-called digital divide, studies focused on access to computers and the internet (e.g., Helsper, Citation2021; van Dijk, Citation2020). However, the field quickly diversified to examine different types and quality of access (first-level digital inequalities) (van Deursen & van Dijk, Citation2019), and differences in the types and levels of internet skills and internet uses (second-level digital inequalities) (DiMaggio et al., Citation2004; Robinson et al., Citation2015). More recently, scholars have been concerned with a new dimension of digital inequalities – whether people are aware of algorithms and what they know about them (Eslami et al., Citation2015, Citation2017; Hamilton et al., Citation2014; Rader & Gray, Citation2015). In fact, Gran et al. (Citation2021) argue that

[b]eing aware of [algorithms] and navigating consciously on the internet infrastructure could be seen as a new and reinforced level of digital divide. It is more subtle, more difficult to cope with than other skills, and at least as powerful as other skills and usage-based divisions. (pp. 1791–1792)

When examining both general awareness of algorithms, and how much participants understood about how algorithms work (i.e., algorithm knowledge), Fouquaert and Mechant (Citation2022) showed that general awareness, but not understanding of algorithms’ inner workings, was positively associated with a more critical stance toward algorithms. Their findings support recent conceptual propositions of Dogruel (Citation2021), suggesting that algorithm awareness and knowledge represent the two key dimensions of algorithm literacyFootnote1 since they affect the online activities of the largest number of internet users (Dogruel, Citation2021; Dogruel et al., Citation2022). Accordingly, we will focus only on these two dimensions of algorithm literacy in this study.

Despite increasing research on algorithm awareness and knowledge (Dogruel, Citation2021) among users of various online platforms and services (e.g., social networking sites and search engines), only a few studies focus on these two concepts as facets of digital inequality research (e.g., Cotter & Reisdorf, Citation2020). As such, it is unclear whether and to what extent algorithm awareness and knowledge play a role in the sequential pathways leading from first- and second level to third-level digital inequalities, that is, the outcomes that individuals gain from using the internet (van Deursen et al., Citation2017).

The limited empirical evidence suggests that higher levels of internet skills are positively correlated with algorithm awareness and knowledge (e.g., Reisdorf & Blank, Citation2021). Likewise, a positive association was found between internet uses and algorithm awareness and knowledge. Both frequency and breadth (i.e., variety) of internet uses were positively associated with a better understanding of algorithms (Cotter & Reisdorf, Citation2020; Oeldorf-Hirsch & Neubaum, Citation2023).

The current study contributes by comprehensively examining the role of algorithm awareness and algorithm knowledge in sequential paths of digital inequalities. It builds on the model of compound and sequential digital exclusion – MCSDE (van Deursen et al., Citation2017) – to test how ubiquity of internet access and internet skills affect algorithm awareness and knowledge, and how algorithm awareness and knowledge, in turn, affect the variety of internet users’ engagement with the internet, resulting in different levels of achieved tangible internet outcomes. In line with MCSDE, we also investigate whether internet users’ digital inequalities interact with their demographic and socio-economic characteristics. The model was tested with path analysis using a representative subsample of internet users (N = 802) in Slovenia. The results confirm the proposed model, indicating that algorithm awareness and knowledge should be incorporated not only in conceptual models of digital inequalities but should also be part of any effective digital inclusion intervention.

Background

The MCSDE (van Deursen et al., Citation2017) assumes that the three levels of digital inequalities are related with sequential paths, which determine the level of digital deprivation among internet users. For instance, sequential digital deprivation is indicated when users who lack a particular digital resource (e.g., lack of skills) are also affected by shortage of another digital resource (e.g., low levels of internet use). Following the sequential logic of the MCSDE, we assume that algorithm awareness and knowledge represent two digital resources and thus can be hypothetically related to the different levels of digital inequalities (Hargittai et al., Citation2020). In this section, we explore how first-level digital inequalities (i.e., ubiquity of internet access) are related to second-level digital inequalities (i.e., internet skills and uses). We then examine algorithm awareness and knowledge, and how these concepts are related to digital inequalities, and in particular to skills, breadth of internet uses, and internet outcomes (third-level digital inequalities).

Internet access, uses and skills

First-level digital inequalities concern themselves with access to both devices and the internet. While access was initially conceptualized along the yes/no axis of having a computer and/or the internet, the field soon shifted to understanding access as a gradation of no access to high levels of access. In particular, Helsper (Citation2021) pointed out the importance of ubiquity of internet access, which refers to whether someone owns a number of (different) devices and has internet access through various means, such as through an internet service provider at home, mobile access through a data plan, and at various other places, such as work, school, or public places (van Deursen & van Dijk, Citation2019; van Dijk, Citation2020). As part of access, we also have to consider the quality of a connection or a device and whether users have the means and autonomy to maintain both (Helsper, Citation2021). All three aspects of access have been shown to be unevenly distributed across populations, with those who are better off socio-economically having more ubiquitous and better-quality access (Robinson et al., Citation2015; van Deursen et al., Citation2017).

As researchers realized that digital inequalities are about more than just binary notions of access, they also shifted their attention toward second-level digital inequalities, focusing on digital and internet skills as well as differences in usage (DiMaggio et al., Citation2004). Internet skills are often higher among younger people, those who have higher educational qualifications, higher incomes, and work in occupations that utilize a lot of digital media (e.g., Scheerder et al., Citation2017). There has been a plethora of evidence linking access and skills – meaning that those who have more and better access opportunities also have better internet skills (Hargittai et al., Citation2019; Scheerder et al., Citation2017; van Deursen et al., Citation2017). In line with prior research findings and in accordance with the MCSDE, we hypothesize ():

H1: Ubiquity of internet access positively affects internet skills.

There has been an equally vast amount of research on the different types, breadth, and amount of internet uses – the second aspect of second-level digital inequalities (Blank & Grošelj, Citation2014). A range of studies have shown that uses, much like access and skills, are not equal across different populations. Those who are already better off use the internet both more broadly and for more capital-enhancing types of uses (Büchi et al., Citation2016; Hargittai & Hinnant, Citation2008; Robinson et al., Citation2015). Additional studies have demonstrated that those with more access points use the internet for a wider range of activities than those who only access the internet through mobile phones (Reisdorf et al., Citation2022). Therefore, we hypothesize:

H2: Ubiquity of internet access has a positive effect on breadth of internet uses.

Additionally, various studies based on the MCSDE have demonstrated a positive link between levels of internet skills and the breadth and types of internet uses (Helsper, Citation2021; Petrovčič et al., Citation2022; van Deursen et al., Citation2017). Therefore, we assume that:

H3: Internet skills have a positive effect on breadth of internet uses.

Algorithm awareness and knowledge and digital inequalities

Various studies have examined to what extent internet users are aware that algorithms affect what they are seeing online (i.e., algorithm awareness) and how much they know or understand about how algorithms work (i.e., algorithm knowledge) (Dogruel, Citation2021).

Early studies have focused on algorithm awareness on social media (Eslami et al., Citation2015; Hamilton et al., Citation2014; Rader & Gray, Citation2015) and search engines like Google (Powers, Citation2017), often among college students. These studies showed that awareness was low among participants although it appears to be increasing over time, partly due to increased media coverage of the issue. Increased awareness was associated with stronger feelings of control over the features of the specific website and changes in use (Eslami et al., Citation2017; Rader & Gray, Citation2015). More recent investigations on algorithm awareness have diversified platforms to include social media like Instagram, TikTok, and more general awareness with reference to algorithms that users encounter in their day-to-day online dealings. Overall, these studies converge to demonstrate that algorithm awareness varies across platforms and that greater awareness often goes along with modified behaviors (Fletcher & Nielsen, Citation2019; Fouquaert & Mechant, Citation2022; Siles et al., Citation2022).

Accordingly, several scholars have recently argued that considering the ubiquity of algorithms across platforms, algorithm awareness and knowledge are new aspects of digital inequalities that need to be considered alongside more ‘traditional’ concepts, such as access, skills, and usage. For instance, Wei and Yan (Citation2023) showed that users who use personal computers alongside mobile devices to go online have a higher level of algorithm awareness than mobile-only users because they can observe the consequences of algorithm curation for content recommendation across different devices. Following this evidence, we hypothesize:

H4: Ubiquity of internet access positively affects algorithm awareness.

H5: Ubiquity of internet access positively affects algorithm knowledge.

Focusing on Google Search, both Cotter and Reisdorf (Citation2020) and Reisdorf and Blank (Citation2021), examined US internet users’ understanding of how algorithms related to search engines work (i.e., algorithm knowledge). Both studies found that those with higher self-rated internet skills had higher algorithm knowledge (Cotter & Reisdorf, Citation2020; Reisdorf & Blank, Citation2021). Moreover, in their qualitative study of internet users in the US and four European countries (Germany, Hungary, Bosnia, and Serbia), Gruber et al. (Citation2021) examined whether people were aware of algorithms and what they understood about them with a particular focus on voice assistants. They found that broader previous internet experience and general internet skills increased algorithm awareness and knowledge. In a separate paper based on the same data, Hargittai et al. (Citation2019) make a strong argument about why algorithm awareness and knowledge should be measures of broader digital skills, and how future studies could account for these factors.

We concur that algorithm awareness and knowledge are part of a larger set of proficiencies needed to engage with the internet. However, we also hold that internet skills are distinct from and in fact a prerequisite for being able to encounter, make sense of, and use algorithms online (Pinski & Benlian, Citation2024). Indeed, internet skills relate to the ability to use the internet efficiently (van Deursen et al., Citation2016), while algorithm awareness and knowledge enable users to identify the presence and consequences of algorithms online (Dogruel, Citation2021). Hence, we position internet skills as antecedents of algorithm awareness and knowledge in the MCSDE, proposing the following two hypotheses:

H6: Internet skills have a positive effect on algorithm awareness.

H7: Internet skills have a positive effect on algorithm knowledge.

Cotter and Reisdorf (Citation2020) as well as Lomborg and Kapsch (Citation2020), argued that algorithm awareness affects knowledge. Without awareness, users do not have the basis for building an understanding of algorithms and developing a deeper insight into the principles and methods that underlie algorithms as well as their social and political implications. We hypothesize:

H8: Algorithm awareness has a positive effect on algorithm knowledge.

Although we understand that internet skills and uses are in a recursive relationship – i.e., they influence each other – prior sequential models have placed internet skills as a precursor for broader internet engagement (Helsper, Citation2021; Petrovčič et al., Citation2022; van Deursen et al., Citation2017). In this sense, algorithm awareness and knowledge play the role of mediators between internet skills and uses because, according to DeVito (Citation2021, p. 3), they refer to ‘the capacity and opportunity to be aware of both the presence and impact of algorithmically-driven systems on self- or collaboratively-identified goals, and the capacity and opportunity to crystalize this understanding into a strategic use of these systems to accomplish said goals.’ Hence, we assume that:

H9: Algorithm awareness positively affects breadth of internet uses.

H10: Algorithm knowledge positively affects breadth of internet uses.

Tangible internet outcomes

We assume that algorithm awareness and knowledge do not have a direct association with internet outcomes, but rather that breadth of internet uses mediates the relationship between algorithm awareness and knowledge and internet outcomes (DeVito, Citation2021). Several studies based on the MCSDE have linked broader internet uses with more beneficial outcomes of using the internet (van Deursen & Helsper, Citation2015, Citation2018). For example, van Deursen et al. (Citation2017) found that broader economic internet use, such as selling something online or looking for a job online, resulted in more beneficial economic outcomes, such as saving money or finding a better paid position. Although of course not all broader internet uses are beneficial and overuse can result in reduced well-being (Büchi et al., Citation2019; Gui & Büchi, Citation2021), in this study, following the MCSDE, we focus on specific capital-enhancing uses and outcomes related to digital deprivation. Therefore, we suggest:

H11: Breadth of internet uses has a positive effect on internet outcomes.

Figure 1. Explanatory model with hypothesized path relationships.

Figure 1. Explanatory model with hypothesized path relationships.

The moderating role of demographic and socio-economic characteristics

The MCSDE also suggests that disparities among internet users at each level of digital inequalities interact with their demographic and socio-economic characteristics such as gender, age, education, occupation, and income (Helsper, Citation2021; van Deursen et al., Citation2017). The level of awareness and knowledge about algorithms has also been correlated with these demographic and socio-economic variables (e.g., Cotter & Reisdorf, Citation2020; Dogruel et al., Citation2022; Gran et al., Citation2021; Wei & Yan, Citation2023; Ytre-Arne & Moe, Citation2021). For instance, Gran et al. (Citation2021) found differences in algorithm awareness by age, education, and gender, with younger internet users, those with higher education, and men being more aware of algorithms. Moreover, Wei and Yan (Citation2023) showed that the level of algorithm awareness was positively correlated with occupation and income. However, since the effect of personal and positional characteristics of users can vary across sequential paths of digital exclusion (van Deursen et al., Citation2017; van Dijk, Citation2020), we pose the following RQ:

RQ1: How do the hypothesized relationships in the proposed explanatory model interact with gender, age, education, occupation, and income differences among internet users?

Methods

Procedures and data

The data used in this study were collected from the 2022 wave of the Slovenian Public Opinion Survey. A face-to-face survey was conducted between April and August 2022. Participants aged 18 + years were selected from the Central Register of Population (CRP) using two-stage random sampling with stratification by type of settlement and the statistical region of residence. The survey was completed by 1001 respondents, yielding a response rate of 54% after non-eligible units were excluded (AAPOR, Citation2016). The socio-demographic characteristics of the respondents closely mirrored the characteristics of the general population when compared with data from the CRP. Ethical review and approval were waived for this study by the Ethics Committee of the Faculty of Social Sciences, University of Ljubljana, as the questionnaire did not cover any sensitive issues and did not collect any personally identifiable data. In line with the General Data Protection Regulations, informed consent was obtained from all respondents involved in the study.

The sample characteristics are presented in . In this study, we analyzed a subsample of respondents who had used the internet in the last three months (N = 802; 80.1%).

Table 1. Socio-demographic characteristics of the sample.

Measures

Internet outcomes were measured on a 10-item inventory adapted from Helsper et al. (Citation2015) and validated in Slovenia by Petrovčič et al. (Citation2022). The inventory asserted four types of achieved tangible outcomes that the respondents performed online for private purposes in the previous 12 months: economic, cultural, social, and personal (Table OS1 in Online Supplement). Each item was measured on a 5-point Likert scale ranging from 1 (strongly disagree) to 5 (strongly agree). The respondents were also provided with the possibility of choosing the ‘Not applicable’ response option (recoded to missing values in the analysis) if they did not undertake the activity in question. Since internet outcomes were defined as a formative construct, we conducted a principal component analysis (PCA) to extract a single component for all items (Petter et al., Citation2007). All items had > .50 correlation with the component, indicating that all items sufficiently contribute to the construct (Hair et al., Citation2018). The scale was not affected by multicollinearity (max[VIF] < 2.0; Table OS1). Accordingly, the scores on the internet outcomes scale were calculated by averaging across the 10 items with higher score values indicating higher achieved tangible outcomes.

Breadth of internet uses was assessed using a 17-item inventory adapted from Blank and Grošelj (Citation2014) and validated by Reisdorf et al. (Citation2021) in Slovenia. Four items assessed cultural, social, and personal uses respectively, and five items measured economic uses. All items were scored on a 6-point ordinal scale ranging from 1 (never) to 6 (several times a day). Since the aim of this scale is to capture the variability – rather than intensity – of internet uses, all answers were recoded into a dichotomous answering scale (‘never’ = 0; any other response = 1), reflecting whether the internet was used for a particular activity or not (Blank & Grošelj, Citation2014). As internet uses have been defined as a formative construct (Reisdorf et al., Citation2021), the same validation procedure as for internet outcomes was undertaken. The PCA indicated that the 17 items sufficiently contributed to the construct (Table OS2). There were no issues with multicollinearity (max[VIF] < 2.2). Accordingly, a single formative composite score of internet uses was obtained by calculating the average number of activities per dimension, and then averaging across all four dimensions. Users with a higher composite score engaged in a higher percentage of different online activities.

An adapted short version of the Algorithm Literacy Scale – ALS (Dogruel et al., Citation2022) was used to assess algorithm awareness and knowledge. Five items were used to measure awareness and five for knowledge. Items measuring awareness asked for which purposes and in what (media) products algorithms are used. Items asserting knowledge tested participants’ understanding of how algorithms work and what implications they can have for users (Dogruel et al., Citation2022). All items were formulated as a test with true-/false-statements. Correct answers were coded as 1 whereas incorrect answers, including ‘I do not know’, as 0 (Dogruel et al., Citation2022). Both scales were validated by using the Rasch model (Emberstone & Reise, Citation2013). We first examined the difficulties of the items. For items measuring algorithm awareness difficulties ranged from P = .39 to P = .64, while for algorithm knowledge from P = .08 to P = .52. We then fitted a Rasch model on both sets individually. Based on item fit statistics and local dependence we excluded one item from each scale (Table OS3). The modified 4-item scales were good fitting with adequate reliability (awareness: KR20 = .83; knowledge: KR20 = .75). The scale scores for both dimensions of the short ALS were obtained by summing the respondent’s correct answers. Higher values of scores indicated higher levels of algorithm awareness and knowledge.

Internet skills were assessed using a short form of the Internet Skill Scale – ISS (van Deursen et al., Citation2016), validated in Slovenia by Grošelj et al. (Citation2021). In this study, ISS was considered a second-order factor with four first-order dimensions corresponding to operational, information navigation, social, and creative skills (Grošelj et al., Citation2021). Each dimension was measured with five Likert-type items. The respondents answered on a scale from 1 (not at all true of me) to 5 (very true of me) and included an ‘I do not understand what you mean by that’ response option which was recoded to missing values. Items measuring information navigational skills were reversed before the analysis. Following a confirmatory factor analysis (CFA), we dropped three poorly performing items. The second-order model indicated sufficient fit to the data χ2(df) = 424.265(115), p < .001, CFI = .947, RMSEA = .068 (90% CI = .061–.075), SRMR = .057 (Table OS4 and Figure OS1) and had adequate reliability (Cronbach’s alpha = .90; Composite reliability = .79). Accordingly, the items for each dimension were averaged to obtain subscale scores, and then the mean of the subscale scores was calculated to obtain a single scale score with higher values indicating higher level of internet skills.

Ubiquity of internet access was assessed using an adapted Material Internet Access scale (van Deursen & van Dijk, Citation2015). Respondents were asked how often they used each of the following five devices to access the internet: desktop or laptop PC, tablet PC, smartphone, smart TV, other internet-based devices such as game console or electronic reader. They provided answers on a 6-point ordinal frequency-based scale ranging from 1 (never) to 6 (several times a day). All answers were transformed into a dichotomous answering scale, reflecting whether each of the five devices was used to access the internet. All recoded items were averaged into a single score that reflects the number of devices used to access the internet with a higher value of the score indicating more ubiquitous access.

Data analysis

Hypotheses in the model were tested with path analysis (Kline, Citation2015), since some concepts were defined as formative constructs (i.e., internet uses, internet outcomes). Before conducting the path analysis, we screened the data for missing values, variable distributions, and their bivariate correlations (). As 347 (43.3%) responses had at least one missing value, the model was estimated by using the full information maximum likelihood (FIML) to avoid dropping such units (Kline, Citation2015). FIML was selected as the variables in the model can explain each other’s missingness (Table OS5) and because FIML provides less biased and more powerful estimates than other missing data techniques (Newman, Citation2014). To accommodate the non-normal distributions of the observed variables the robust maximum likelihood (MLR) estimator was used (Kline, Citation2015). All analyzes were conducted in R (R Core Team, Citation2022) using the eRm package for analysis of the ALS (Mair et al., Citation2021) and the lavaan package (Rosseel, Citation2012) for CFA and path analysis.

Table 2. Descriptive statistics and correlations among observed variables.

To answer the RQ1, we conducted a multi-group analysis (MGA) to test if path estimates differ across gender (male and female), age (18–44, 45–64, 65+), educational (high school or less and college or more), occupational (low- and high-skill job), and income (above and below average) groups. Following Hair et al. (Citation2018), we first tested the model fit for each of these groups. Next, to assess potential differences between groups, we compared an unconstrained model (i.e., where path estimates in each group are freely estimated) to the constrained model (i.e., where path estimates are set to equal across groups) using the chi square difference (Δχ2) test. A non-significant result indicates that the constraints do not worsen the model fit and hence the estimates can be considered equal across groups. Otherwise, the fit is worse, and estimates cannot be considered equal. In such cases, we tested each individual path estimate for (in)equality and subsequently assessed a modified constrained model where invariant estimates were constrained while non-invariant estimates were freely estimated.

Results

The tested model demonstrated an excellent fit to the data (N = 802): χ2(df) = 8.395(4), p = .078, CFI = .996, RMSEA = .046 (90% CI = .000–.095); SRMR = .024 ( and ). Ubiquity of internet access had a significant positive effect on internet skills (β = .452), breadth of internet uses (β = .319), and algorithm awareness (β = .136). Results also demonstrated significant direct positive effects of internet skills on breadth of internet uses (β = .409), algorithm awareness (β = .366), and algorithm knowledge (β = .280). Algorithm knowledge was positively affected by algorithm awareness (β = .482) and had a significant positive effect on breadth of internet uses (β = .115). Internet outcomes were positively affected by breadth of internet uses (β = .602). All statistically significant effects were demonstrated at p ≤ .001 level. The effect of ubiquity of internet access on algorithm knowledge and the effect of algorithm awareness on internet uses could not be confirmed.

Figure 2. Path analysis results (standardized estimates).

Figure 2. Path analysis results (standardized estimates).

Table 3. Summary of parameter estimates and hypotheses testing (direct effects).

Results of MGA showed that the model had acceptable fit in all groups (Table OS6). There were no differences in path estimates between men and women or between users with low- and high-skill jobs, but there were some significant differences (at p < .05 level) in the case of age, educational, and income groups (Table OS7 and OS8). Specifically, the effect of ubiquity of internet access on internet skills significantly differed across all three groups, being the weakest among younger, well educated, and high-income users and the strongest among older, less educated, and low-income users. Age also moderated the effect of internet skills on breadth of internet uses, which was weaker among younger compared to middle-aged and older users, and the effect of algorithm awareness on algorithm knowledge, which was stronger among younger and middle-aged users compared to older users. Interestingly, although income moderated the effects of ubiquity of internet access on algorithm knowledge this effect was non-significant within both high-income and low-income user groups ().

Table 4. Moderated unstandardized parameter estimates from (modified) constrained models across age, educational, and income groups.

Discussion

Main findings

Various hypotheses in our model (H1, H2, H3, and H11) confirmed results of the MCSDE in previous empirical research, that is, ubiquity of access positively affects internet skills and breadth of internet use, internet skills positively affect breadth of internet use, and breadth of internet use positively affects internet outcomes (e.g., Petrovčič et al., Citation2022; van Deursen et al., Citation2017). What is new in our expanded model is the inclusion of algorithm awareness and knowledge. Based on prior research in this area (Gruber et al., Citation2021; Hargittai et al., Citation2020), we situated algorithm awareness and knowledge as digital resources akin to internet skills that mediate between access and uses in the MCSDE. Following that logic, we tested the effects of both ubiquity of internet access and internet skills on algorithm awareness and knowledge. Both the strong model fit, and the validation of the tested hypotheses confirm that this approach makes sense, with some minor caveats.

For example, to our knowledge, there has been no prior research that examined the relationship between ubiquity of internet access and algorithm awareness and knowledge. Our results confirm the findings of Wei and Yan (Citation2023) that ubiquitous access has a positive effect on algorithm awareness (supporting H4); however, we did not find a statistically significant effect on algorithm knowledge (H5 was not supported). Even though a higher number of access devices may increase someone’s awareness of the presence of algorithms, it does not necessarily mean they understand more about how these algorithms work. Similarly, whereas algorithm awareness has no significant effect on breadth of use (H9 was not supported), algorithm knowledge is positively related to breadth of use (supporting H10). This highlights an important facet of research in this area. While both algorithm awareness and knowledge are part of broader algorithm literacy – and in fact, algorithm awareness has a strong effect on algorithm knowledge (supporting H8) – the differences in results support the notion that these concepts are different and should be considered separate parts of algorithm literacy (Dogruel et al., Citation2022), similar to the differentiation of other types of internet skills, such as the five dimensions in the ISS model (Grošelj et al., Citation2021; van Deursen et al., Citation2016).

This brings up another important argument posed by Hargittai et al. (Citation2020) and Gruber et al. (Citation2021). Both works argue that given the ubiquity of algorithms in our online experiences, algorithm literacy or skills, including the two dimensions of algorithm awareness and knowledge examined in this study, should be considered an integral component of the broader understanding of digital skills at large. Our analysis confirms that overall higher internet skills have a strong positive effect on both algorithm awareness and knowledge (supporting H6 and H7). Since our results indicate that internet skills affect internet uses directly and indirectly through algorithm knowledge, future conceptual models should consider defining algorithm literacy as an integral aspect of second-level digital inequalities. This is important not only for a better understanding of the mediating roles of algorithm awareness and knowledge between internet skills and uses, but also for developing initiatives aimed at improving the behavioral dimensions of algorithm literacy such as coping strategies that internet users need to limit and/or avoid the negative implications of algorithmic curation (Dogruel, Citation2021).

Our analyzes also found the moderating effect of age, education, and income but not gender or occupation (RQ1). The moderation effects indicate that addressing digital inequalities among disadvantaged groups (older, less educated, lower income) will have a disproportionally greater effect on digital inclusion than among better-off groups. For example, a one-point increase in internet skills will increase algorithm knowledge by 0.6 points among those with below average income, but by 0.3 points among those with above average income. The only exception was the weaker effect of algorithm awareness on algorithm knowledge among older users, compared to their younger counterparts. However, this finding is in line with prior literature (Fletcher & Nielsen, Citation2019; Ytre-Arne & Moe, Citation2021) suggesting that older users who gain awareness about algorithmic curation show less motivation to proactively acquire more knowledge about algorithms than younger users. Therefore, targeted interventions for older users are more so needed to improve their understanding of algorithms on the internet.

Overall, our study tested an expansion of the MCSDE with algorithm awareness and knowledge. The strong model fit and the overall results confirm the importance of algorithm awareness and knowledge in sequential models of digital inequality, echoing calls from other researchers (e.g., Gran et al., Citation2021) to include these (and possibly other) dimensions of algorithm literacy as an integral part of digital equity in a world that increasingly relies on algorithms and artificial intelligence to create, maintain, and run uncountable platforms and apps that internet users engage with on a daily basis (e.g., Cotter & Reisdorf, Citation2020; Gruber et al., Citation2021). Whether internet users are aware of algorithms and know how they work – to the extent that the latter is possible – is heavily dependent on their ubiquity of access and their general levels of internet skills. Not only that, but algorithm awareness and especially algorithm knowledge appear to influence how broadly they engage with the internet. Prior research has shown that algorithm awareness and algorithm knowledge can be tied to modified behaviors, such as avoiding or trying to manipulate platforms in certain ways (Fouquaert & Mechant, Citation2022; Siles et al., Citation2022). Our study confirms these results and supports the notion that algorithm awareness and knowledge affect the way that people utilize the internet, which, in turn, has an effect on the tangible outcomes of their internet use.

Limitations and future research

There are some limitations that need to be considered. While Slovenia is an average performing EU country in terms of internet skills and uses (Eurostat, Citation2022), no comparable data exists in terms of algorithm awareness and knowledge in the EU. Moreover, this study focused on two dimensions of algorithm literacy (i.e., awareness and knowledge). While this is appropriate as they affect the online activities of the largest number of internet users (Dogruel, Citation2021; Dogruel et al., Citation2022), future studies examining additional dimensions are nevertheless warranted. In addition, while the sequentiality of the paths in the explanatory model was derived directly from the assumptions of the MCSDE, our research design could not verify these causal claims. Relatedly, the current study could also not test the potential bidirectional causal relationship between algorithm awareness and knowledge and internet uses as proposed by some scholars (e.g., Swart, Citation2021). Indeed, a post-hoc analyzes of reverse (Table OS9) and mediational (Table OS10) models showed that the fit of such models was equal or close to the proposed model, indicating that longitudinal or experimental studies should elucidate the causal and bidirectional effects among the concepts in the explanatory model.

Conclusion

Algorithms have become ubiquitous, and they are here to stay. This also means that policymakers and practitioners in the area of digital inequalities need to take algorithm awareness and knowledge seriously and include them in efforts to improve digital engagement among the population. Internet users need to understand how their content is curated, what this means for the content they see online, and how this affects their lives both online and offline. To that extent, it is not enough to integrate algorithm awareness and knowledge into the conceptual models of digital inequalities. They should be also part of any serious digital inclusion intervention. Our findings suggest that awareness-raising initiatives regarding the implications of algorithmic curation would be particularly valuable for digitally disadvantaged groups, such as older adults or low-income groups. Not only would this address low levels of algorithm awareness, but it would also enable them to gain knowledge about the consequences of algorithmic curation across various online domains (e.g., social media, online news aggregators). Increasing the levels of algorithm awareness and knowledge among the population is critical for both coping with algorithmically curated online environments on an individual level and for raising the public’s understanding of potential risks and benefits that algorithmic curation and decision-making bring for society in general.

Supplemental material

Supplemental Material

Download MS Word (439.5 KB)

Acknowledgement

We thank Nuša Cekuta for her support in data analysis. We also thank both reviewers for their constructive and insightful comments that helped us improve the article.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data that support the findings of this study are openly available in ADP – Social Science Data Archives at https://doi.org/10.17898/ADP_SJM221_V1

, reference number SJM221.

Additional information

Funding

This work was supported by the Slovenian Research Agency under Grants no. BI-US/22-24-055, J5-2558, P5-0399, V5-2275, and the Young Research fellowship of the last author.

Notes on contributors

Andraž Petrovčič

Andraž Petrovčič (Ph.D. University of Ljubljana) is an associate professor of social informatics in the Faculty of Social Sciences, University of Ljubljana, Slovenia. His research focuses on socio-technical aspects of older adults’ interactions with internet technologies in the context of digital inequalities [email: [email protected]].

Bianca C. Reisdorf

Bianca C. Reisdorf (D.Phil. University of Oxford) is an associate professor in the Department of Communication Studies, University of North Carolina Charlotte, USA. Her work focuses on the intersection of inequalities and digital media and the internet among marginalized populations, as well as proxy internet use and how internet users look for and evaluate information from various media sources.

Vasja Vehovar

Vasja Vehovar (Ph.D. University of Ljubljana) is a full professor of statistics in the Faculty of Social Sciences, University of Ljubljana, Slovenia. His research focuses on internet research, information society indicators, statistics, missing data, sample collection, social science methodology, and web survey methodology.

Jošt Bartol

Jošt Bartol is a doctoral student of statistics at the University of Ljubljana, Slovenia, and a research fellow at the Centre for Social Informatics at the Faculty of Social Sciences of the University of Ljubljana, Slovenia. His research centres on information privacy on the internet and scale development.

Notes

1 Due to space limitations, we do not provide a discussion of other dimensions of algorithm literacy (e.g., critical evaluation, coping tactics, creation and design). For a detailed discussion of the concept of algorithm literacy, see Dogruel (Citation2021) or Shin et al. (Citation2022).

References

  • AAPOR. (2016). Standard definitions: Final disposition of case codes and outcome rates for surveys. https://aapor.org/wp-content/uploads/2022/11/Standard-Definitions20169theditionfinal.pdf
  • Blank, G., & Grošelj, D. (2014). Dimensions of internet use: Amount, variety, and types. Information, Communication & Society, 17(4), 417–435. https://doi.org/10.1080/1369118X.2014.889189
  • Büchi, M., Festic, N., & Latzer, M. (2019). Digital overuse and subjective well-being in a digitized society. Social Media + Society, 5(4), Article 2056305119886031. https://doi.org/10.1177/2056305119886031
  • Büchi, M., Just, N., & Latzer, M. (2016). Modeling the second-level digital divide: A five-country study of social differences in internet use. New Media & Society, 18(11), 2703–2722. https://doi.org/10.1177/1461444815604154
  • Cotter, K., & Reisdorf, B. C. (2020). Algorithmic knowledge gaps: A new dimension of (digital) inequality. International Journal of Communication, 14, 745–765.
  • DeVito, M. A. (2021). Adaptive folk theorization as a path to algorithmic literacy on changing platforms. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW2), 1–38. https://doi.org/10.1145/3476080
  • DiMaggio, P., Hargittai, E., Celeste, C., & Shafer, S. (2004). Digital inequality: From unequal access to differentiated use. In K. Neckerman (Ed.), Social inequality (pp. 355–400). Russell Sage Foundation.
  • Dogruel, L. (2021). What is algorithm literacy? A conceptualization and challenges regarding its empirical measurement. In M. Taddicken & C. Schumann (Eds.), Algorithms and communication (pp. 67–93). Digital Communication Research. https://doi.org/10.48541/dcr.v9.3
  • Dogruel, L., Masur, P., & Joeckel, S. (2022). Development and validation of an algorithm literacy scale for internet users. Communication Methods and Measures, 16(2), 115–133. https://doi.org/10.1080/19312458.2021.1968361
  • Emberstone, S. E., & Reise, S. P. (2013). Item response theory. Psychology Press.
  • Eslami, M., Rickman, A., Vaccaro, K., Aleyasen, A., Vuong, A., Karahalios, K., Hamilton, K., & Sandvig, C. (2015). I always assumed that I wasn’t really that close to [her]”: Reasoning about invisible algorithms in news feeds. In Proceedings of the 33rd annual ACM conference on human factors in computing systems (pp. 153–162). ACM. https://doi.org/10.1145/2702123.2702556
  • Eslami, M., Vaccaro, K., Karahalios, K., & Hamilton, K. (2017). Be careful; things can be worse than they appear”: understanding biased algorithms and users’ behavior around them in rating platforms. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 62–71. https://doi.org/10.1609/icwsm.v11i1.14898
  • Eurostat. (2022). How many citizens had basic digital skills in 2021? https://ec.europa.eu/eurostat/web/products-eurostat-news/-/ddn-20220330-1
  • Fletcher, R., & Nielsen, R. K. (2019). Generalised scepticism: How people navigate news on social media. Information, Communication & Society, 22(12), 1751–1769. https://doi.org/10.1080/1369118X.2018.1450887
  • Fouquaert, T., & Mechant, P. (2022). Making curation algorithms apparent: A case study of ‘Instawareness’ as a means to heighten awareness and understanding of Instagram’s algorithm. Information, Communication & Society, 25(12), 1769–1789. https://doi.org/10.1080/1369118X.2021.1883707
  • Gran, A.-B., Booth, P., & Bucher, T. (2021). To be or not to be algorithm aware: A question of a new digital divide? Information, Communication & Society, 24(12), 1779–1796. https://doi.org/10.1080/1369118X.2020.1736124
  • Grošelj, D., van Deursen, A. J. A. M., Dolničar, V., Burnik, T., & Petrovčič, A. (2021). Measuring internet skills in a general population: A large-scale validation of the short Internet Skills Scale in Slovenia. The Information Society, 37(2), 63–81. https://doi.org/10.1080/01972243.2020.1862377
  • Gruber, J., Hargittai, E., Karaoglu, G., & Brombach, L. (2021). Algorithm awareness as an important internet skill: The case of voice assistants. International Journal of Communication, 15, 1770–1788. https://ijoc.org/index.php/ijoc/article/view/15941
  • Gui, M., & Büchi, M. (2021). From use to overuse: Digital inequality in the age of communication abundance. Social Science Computer Review, 39(1), 3–19. https://doi.org/10.1177/0894439319851163
  • Hair, J. F., Jr., Black, W. C., Babbin, B. J., & Anderson, R. E. (2018). Multivariate data analysis. Cengage.
  • Hamilton, K., Karahalios, K., Sandvig, C., & Eslami, M. (2014). A path to understanding the effects of algorithm awareness. In Chi ‘14 extended abstracts on human factors in computing systems (pp. 631–642). ACM. https://doi.org/10.1145/2559206.2578883
  • Hargittai, E., Gruber, J., Djukaric, T., Fuchs, J., & Brombach, L. (2020). Black box measures? How to study people’s algorithm skills. Information, Communication & Society, 23(5), 764–775. https://doi.org/10.1080/1369118X.2020.1713846
  • Hargittai, E., & Hinnant, A. (2008). Digital inequality. Communication Research, 35(5), 602–621. https://doi.org/10.1177/0093650208321782
  • Hargittai, E., Piper, A. M., & Morris, M. R. (2019). From internet access to internet skills: Digital inequality among older adults. Universal Access in the Information Society, 18(4), 881–890. https://doi.org/10.1007/s10209-018-0617-5
  • Helsper, E. J. (2021). The digital disconnect: The social causes and consequences of digital inequalities. SAGE.
  • Helsper, E. J., van Deursen, A. J. A. M., & Eynon, R. (2015). Tangible outcomes of internet use: From digital skills to tangible outcomes project report. Oxford Internet Institute. http://www.oii.ox.ac.uk/research/projects/?id=112
  • International Labour Organization - ILO. (2016, June 21). ISCO – International standard classification of occupations. https://www.ilo.org/public/english/bureau/stat/isco/isco08/index.htm
  • Kline, R. B. (2015). Principles and practice of structural equation modeling. Guilford Press.
  • Lomborg, S., & Kapsch, P. H. (2020). Decoding algorithms. Media, Culture & Society, 42(5), 745–761. https://doi.org/10.1177/0163443719855301
  • Mair, P., Hatzinger, R., & Maier, M. J. (2021). eRm: Extended Rasch Modelling [Version 1.0-2] [Computer software]. https://cran.r-project.org/package = eRm
  • Newman, D. A. (2014). Missing data: Five practical guidelines. Organizational Research Methods, 17(4), 372–411. https://doi.org/10.1177/1094428114548590
  • Oeldorf-Hirsch, A., & Neubaum, G. (2023). What do we know about algorithmic literacy? The status quo and a research agenda for a growing field. New Media & Society. Advance online publication. https://doi.org/10.1177/14614448231182662
  • Petrovčič, A., Reisdorf, B. C., Prevodnik, K., & Grošelj, D. (2022). The role of proxy internet use in sequential pathways of digital exclusion: An empirical test of a conceptual model. Computers in Human Behavior, 128, Article 107083. https://doi.org/10.1016/j.chb.2021.107083
  • Petter, S., Straub, D., & Rai, A. (2007). Specifying formative constructs in information systems research. MIS Quarterly, 31(4), 623–656. https://doi.org/10.2307/25148814
  • Pinski, M., & Benlian, A. (2024). AI literacy for users–A comprehensive review and future research directions of learning methods, components, and effects. Computers in Human Behavior: Artificial Humans, 2(1), 100062. https://doi.org/10.1016/j.chbah.2024.100062
  • Powers, E. (2017). My news feed is filtered? Digital Journalism, 5(10), 1315–1335. https://doi.org/10.1080/21670811.2017.1286943
  • Rader, E., & Gray, R. (2015). Understanding user beliefs about algorithmic curation in the facebook news feed. In Proceedings of the 33rd annual ACM conference on human factors in computing systems (pp. 173–182). https://doi.org/10.1145/2702123.2702174
  • R Core Team. (2022). R: A language and environment for statistical computing [version 4.2.2]. [Computer software]. R Foundation for Statistical Computing.
  • Reisdorf, B. C., & Blank, G. (2021). Algorithmic literacy and platform trust. In E. Hargittai (Ed.), Handbook of digital inequality (pp. 341–357). Edward Elgar Publishing. https://doi.org/10.4337/9781788116572.00032
  • Reisdorf, B. C., Fernandez, L., Hampton, K. N., Shin, I., & Dutton, W. H. (2022). Mobile phones will not eliminate digital and social divides: How variation in internet activities mediates the relationship between type of internet access and local social capital in Detroit. Social Science Computer Review, 40(2), 288–308. https://doi.org/10.1177/0894439320909446
  • Reisdorf, B. C., Petrovčič, A., & Grošelj, D. (2021). Going online on behalf of someone else: Characteristics of internet users who act as proxy users. New Media & Society, 23(8), 2409–2429. https://doi.org/10.1177/1461444820928051
  • Robinson, L., Cotten, S. R., Ono, H., Quan-Haase, A., Mesch, G., Chen, W., Schulz, J., Hale, T. M., & Stern, M. J. (2015). Digital inequalities and why they matter. Information Communication & Society, 18(5), 569–582. https://doi.org/10.1080/1369118X.2015.1012532
  • Rosseel, Y. (2012). Lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), 1–36. https://doi.org/10.18637/jss.v048.i02
  • Scheerder, A., van Deursen, A. J. A. M., & van Dijk, J. A. G. M. (2017). Determinants of internet skills, uses and outcomes. A systematic review of the second- and third-level digital divide. Telematics and Informatics, 34(8), 1607–1624. https://doi.org/10.1016/j.tele.2017.07.007
  • Shin, D., Rasul, A., & Fotiadis, A. (2022). Why am I seeing this? Deconstructing algorithm literacy through the lens of users. Internet Research, 32(4), 1214–1234. https://doi.org/10.1108/INTR-02-2021-0087
  • Siles, I., Valerio-Alfaro, L., & Meléndez-Moran, A. (2022). Learning to like TikTok … and not: Algorithm awareness as process. New Media & Society. Advance online publication. https://doi.org/10.1177/14614448221138973
  • Swart, J. (2021). Experiencing algorithms: How young people understand, feel about, and engage with algorithmic news selection on social media. Social Media + Society, 7(2), 205630512110088. https://doi.org/10.1177/20563051211008828
  • UNESCO. (2012). International standard classification of education ISCED 2011. http://uis.unesco.org/sites/default/files/documents/international-standard-classification-of-education-isced-2011-en.pdf
  • van Deursen, A. J. A. M., & Helsper, E. J. (2015). The third-level digital divide: Who benefits most from being online?. In L. Robinson, S. R. Cotten, J. Schulz, T. M. Hale, & A. Williams (Eds.), Communication and information technologies annual (studies in media and communications (Vol. 10, pp. 29–52). Emerald. https://doi.org/10.1108/S2050-206020150000010002
  • van Deursen, A. J. A. M., & Helsper, E. J. (2018). Collateral benefits of internet use: Explaining the diverse outcomes of engaging with the internet. New Media & Society, 20(7), 2333–2351. https://doi.org/10.1177/1461444817715282
  • van Deursen, A. J. A. M., Helsper, E. J., & Eynon, R. (2016). Development and validation of the Internet Skills Scale (ISS). Information, Communication & Society, 19(6), 804–823. https://doi.org/10.1080/1369118X.2015.1078834
  • van Deursen, A. J. A. M., Helsper, E. J., Eynon, R., & van Dijk, J. A. G. M. (2017). The compoundness and sequentiality of digital inequality. International Journal of Communication, 11, 452–473. https://ijoc.org/index.php/ijoc/article/view/5739
  • van Deursen, A. J. A. M., & van Dijk, J. A. G. M. (2015). Toward a multifaceted model of internet access for understanding digital divides: An empirical investigation. The Information Society, 31(5), 379–391. https://doi.org/10.1080/01972243.2015.1069770
  • van Deursen, A. J. A. M., & van Dijk, J. A. G. M. (2019). The first-level digital divide shifts from inequalities in physical access to inequalities in material access. New Media & Society, 21(2), 354–375. https://doi.org/10.1177/1461444818797082
  • van Dijk, J. A. G. M. (2020). The digital divide. Wiley.
  • Wei, S., & Yan, P. (2023). Measuring users’ awareness of content recommendation algorithm: A survey on Douyin users in rural China. In I. Sserwanga, A. Goulding, H. Moulaison-Sandy, J. T. Du, A. L. Soares, V. Hessami, & R. D. Frank (Eds.), Information for a better world: Normality, virtuality, physicality, inclusivity. iConference 2023. Lecture notes in computer science (Vol. 13971, pp. 197–220). Springer. https://doi.org/10.1007/978-3-031-28035-1_15
  • Ytre-Arne, B., & Moe, H. (2021). Folk theories of algorithms: Understanding digital irritation. Media, Culture & Society, 43(5), 807–824. https://doi.org/10.1177/0163443720972314