389
Views
0
CrossRef citations to date
0
Altmetric
Research Article

AI-Driven Contextual Advertising: Toward Relevant Messaging Without Personal Data

ORCID Icon & ORCID Icon

Abstract

In programmatic advertising, bids are increasingly based on knowledge of the surrounding media context. This shift toward contextual advertising is in part a counter-reaction to the current dependency on personal data, which is problematic from legal and ethical standpoints. The transition is accelerated by developments in artificial intelligence (AI), which allow for a deeper semantic analysis of the context and, by extension, more effective ad placement. We survey existing literature on the influence of context on the reception of an advertisement, focusing on three context factors: the applicability of the content and the ad, the affective tone of the content, and the involvement of the consumer. We then discuss how AI can leverage these priming effects to optimize ad placement through techniques such as reinforcement learning, data clustering, and sentiment analysis. This helps close the gap between the state of the art in advertising technology and the AI-driven targeting methodologies described in prior academic research.

Introduction

In this work, we denote by contextual advertising (CA) the practice of optimizing advertising effectiveness by placing ads in favorable media contexts (Zhang and Katona Citation2012).Footnote1 The worldwide market for CA was valued at about US $100 billion in 2017 and at US $200 billion in 2022 (Research and Markets Citation2023b). It is expected to continue to increase to around US $560 billion by 2030. This growth is predicted to happen at an average annual rate of 14 percent between 2022 and 2030 (Research and Markets Citation2023a).

There is a substantial literature on the impact of context on advertising effectiveness (see, for example, Yi Citation1990; Norris and Colman Citation1992; Zhang and Katona Citation2012; Song Citation2014). However, the interplay between ad and context is complex, and empirical studies at times yield contradictory results.Footnote2 Despite this, CA is used for online media, where bidders in auctions of digital ad space use data about the media context to inform their decisions. The practice is already established in search advertising to include sponsored links among the answers to search queries. CA is now becoming a commodity also for display advertising, that is, the promotional third-party banners that finance a large part of the internet (Wood Citation2022).

The increasing use of CA can be ascribed to two factors. First is the societal concern for consumer privacy and fairness that led to the General Data Protection Regulation and the California Consumer Privacy Act. Apple now requires iPhone users’ consent before allowing tracking across apps and websites, and Google plans to remove third-party cookies from Chrome (Dave Citation2021). This motivates the industry to move away from personal data and behavioral tracking (Ipsos Citation2021), making CA—which requires neither—an attractive alternative. Rather than looking at previously recorded data, CA assumes that consumers’ content requests indicate their current state of mind and product preferences and places ads accordingly.

A second driver of CA is the abundance of analyzable media content and ad performance data in online environments, which makes it possible to harness artificial intelligence (AI) for targeting purposes. Google alone serves 29.8 billion ad impressions every single day, spread over its own site and the 200 million websites in its advertising network (Venture Beat Citation2023). Given such volumes of traffic, automation is becoming a necessity to orchestrate global-scale campaigns. Moreover, in contrast to the case for autonomous vehicles and medical diagnostics, the consequences of misclassifications are generally acceptable. This fault tolerance encourages experimentation and lowers the barriers for AI applications in online advertising. In the case of CA, modern AI technology can analyze media content beyond the superficial level of keywords and provide a deeper and more refined profiling of contexts. It can detect signals like topic (Zhao et al. Citation2021), sentiment (Birjali, Kasri, and Beni-Hssane Citation2021), and visual complexity (Saraee, Jalal, and Betke Citation2020) and optimize bidding in programmatic real-time auctions for advertising space (Qin and Jiang Citation2019). While AI is already being applied to CA, it is still at an early stage of adoption.

The purpose of this article is to explore the numerous ways in which AI can be applied to CA. For advertising researchers, an understanding of the implementation, benefits, and limitations of the technology is required to discern future research needs. Furthermore, our synthesis of advertising and technological research with regard to AI-driven CA provides a bridge between theory and practice.

Many advertising platforms already offer services for CA that involve AI technology. We want to take you behind the curtain by describing the underlying methods that are used to classify content and optimize targeting, incorporating relevant academic work on AI within advertising. We begin by reviewing the advertising literature on media context effects and, in doing so, identify three context factors that have been shown to correspond to advertising effectiveness. We then explain how these context factors can be detected and quantified through AI-based methods, before moving on to (1) the optimization of ad delivery through reinforcement learning, (2) the utilization of historical data through transfer learning, and (3) the attainment of deeper contextual understanding through large language models (LLMs). We conclude with a research agenda, in which we raise the need for research on the influence of context on ad evaluation in online media as well as studies on how AI and algorithms can meet the marketing goals of advertisers in a brand-safe, transparent, and computationally effective manner.

The influence of context: A review of advertising literature

CA predates the internet. Advertisers have long understood that context can be used to address groups of consumers with specific interests or shared challenges. Furthermore, a fitting context can increase the memorability of an ad or increase the persuasiveness of the ad message (Shen and Chen Citation2007; Yi Citation1990; Song Citation2014; Furnham, Bergland, and Gunter Citation2002). For example, by placing ads for public transport on a billboard next to a congested highway, the message reaches people facing a problem to which the ad provides a solution. To target customers with an interest in nature, a company selling tents may advertise their products in a magazine about trail running. From here on, we take a favorable media context (with respect to a given ad) to be one whose features increase the likelihood that a specific advertising objective is achieved. It is thus not enough that the context and the ad contain have overlapping keyword sets, for example, if this has no impact on the target advertising metric.

The notion of advertising effectiveness is multifaceted and can be measured in different ways. Previous studies have considered metrics such as ad and brand attitude, recall, and purchase intention (see, e.g., Cheong, de Gregorio, and Kim Citation2010). Three underlying context factors have received particular attention: (1) the applicability of the content and the ad, (2) the affective tone of the content, and (3) the involvement of the consumer. We briefly summarize previous literature on each of these in order. While early works mainly focused on print and TV media, attention has come to shift toward display advertising, the rectangular ads that pervade websites and social media. In the following sections, we discuss results from both offline and online media.

Applicability

A premise behind online CA is that the content a consumer chooses to engage with can be indicative of their interests and needs. Instead of relying on personal data, contextual advertisers indirectly target personal preference by pursuing article–ad applicability: placements where the content is relevant to the ad. Placing ads based on applicability has been shown to improve clickthrough rates (CTR) across several online content types, including third-party websites (Chakrabarti, Agarwal, and Josifovski Citation2008; Tagami et al. Citation2013), blog posts (Fan and Chang Citation2009), news sites (Oh, Lee, and Lee Citation2012), e-magazines (Zanjani, Diamond, and Chan Citation2011), and social media (Mao and Zhang Citation2015). Applicability has also been shown to improve consumers’ attitude toward advertised brands and their recall of ads in numerous channels, including print media (Shen and Chen Citation2007; Yi Citation1990), TV (Furnham, Bergland, and Gunter Citation2002), and display advertising (Huang Citation2014; Song Citation2014).

Applicability cannot only give an idea of the consumer’s interests and needs but can also create a priming effect: a subconscious influence on the consumer’s cognitive and behavioral reactions (Higgins Citation1996). By inducing feelings or emphasizing attributes, a media context can lead the consumer toward certain associations, which in turn affects how an ad is perceived. In a study by Yi (Citation1990), subjects were shown an article together with a companion ad for a car, which emphasized the large size of the car. When the topic of the article was “safety in air travel,” the subjects tended to associate the size of the car with safety; when the topic was “oil entrepreneurs,” the subjects instead associated the size with poor fuel economy. Yi (Citation1990) concluded that the theme of the priming article influenced the subjects’ attitude toward the brand and advertisement as well as their purchase intention. Because of this priming effect, brands frequently advertise in high-status contexts, such as in luxury magazines and at exclusive events, in the hope of gaining prestige by association. There is support for the effectiveness of this approach (Fuchs Citation1964) and that these types of implicit contextual cues are moderated by the applicability in associations between a medium and the advertised brand (Dahlén Citation2005).

While the potential benefit of applicability is evident, it is important to bear in mind that different topics give rise to varying advertising effectiveness (Oh, Lee, and Lee Citation2012). Moreover, the fact that current CA systems have only a superficial understanding of context can lead to unfortunate placements that damage the advertised brand. For example, although articles containing the keyword “family” are as a rule favorable contexts for childcare products, they can also concern domestic abuse and other types of trauma. A number of factors mediating the priming effect induced by applicability further complicate the effective use of contextual ads. These factors include ad clutter (Oh, Lee, and Lee Citation2012), ad complexity (Song Citation2014), the level of arousal the ad produces (Belanche, Flavián, and Pérez-Rueda Citation2017), and the task orientation and product involvement of the consumer (Zanjani, Diamond, and Chan Citation2011).

Affective tone

The value of using emotions to engage with consumers is well known in marketing, and advertisers generally strive to evoke positive feelings through their messaging. Similarly, the affective tone of the surrounding media context can induce feelings that impact satisfaction, customers’ loyalty, and decision-making. This idea is supported by Kwortnik and Ross (Citation2007), who found that when consumers are exposed to ads for vacation trips, their rational evaluations are influenced by the emotional resonance of the ads. The conceptual basis is affective priming, that is, the observation that the affective tone of media content creates a cognitive response that shapes the perception of the ads. Several studies have confirmed that positive content improves attitudes and purchase intention in both print media and television (Pelsmacker, Geuens, and Anckaert Citation2002; Goldberg and Gorn Citation1987; Kamins, Marks, and Skinner Citation1991; Yi Citation1990; LaTour and Latour Citation2009). However, congruity in tone between ad and content (e.g., positive–positive or negative–negative) leads to similar effects (Pelsmacker, Geuens, and Anckaert Citation2002; Kamins, Marks, and Skinner Citation1991). Meanwhile, negative emotions generated by media content can stimulate analytical processing of advertising messages. For instance, in a study by Shapiro, Macinnis, and Park (Citation2013), consumers took a more critical stance to a restaurant advertisement if they had first been exposed to a radio segment that dampened their mood.

Content involvement

Priming theory suggests that when consumers appreciate and are engaged in media content, their perception of accompanying ads improves (Goldberg and Gorn Citation1987). However, if they are so deeply emerged that they fail to notice the ads, the effect of positive priming is lost. Advertisers employ a range of methods to shift the reader’s attention away from the main content and toward their ads by, for example, the choice of colors (Fernandez and Rosen Citation2000) or positioning in the media (Huang Citation2018), but these may spoil the user experience. There is a tension between stakeholder interests: The consumers want the best possible user experience, which requires engagement and immersion, while the advertisers prioritize their marketing goals, not seldom at the cost of the user’s immersion. The publishers need to strike a balance between the two objectives, where both consumers and advertisers are sufficiently satisfied to continue interacting with the publisher.

The effects of content involvement on advertising effectiveness depend on the type of medium being consumed. Studies have found that TV viewers who are engaged in what they are watching show increased ad recall and more positive attitude toward the ad served (Pelsmacker, Geuens, and Anckaert Citation2002; Tavassoli, Shultz, and Fitzsimons Citation1995). Among readers of print media, on the other hand, a higher degree of involvement reduces ad recall (Norris and Colman Citation1992; Pelsmacker, Geuens, and Anckaert Citation2002). Norris and Colman (Citation1992) conjecture that this difference can be explained by the medium in which ads are presented. In TV, ads temporarily replace the media stream, forcing themselves upon the consumer’s attention, whereas in print media, an absorbing article is more likely to keep the consumer’s attention away from accompanying ads. If this conjecture is correct, then the effect of content involvement in flow-based social media such as TikTok and Instagram should bare greater similarity to the case for TV than for print, as the ads are spliced into the main media stream. Finally, in online display advertising, Tai and Chang (Citation2005) found that higher levels of involvement improved CTR, ad attitudes, and conversion rates (i.e., purchases, roughly speaking). The effects were moderated by the type of web content and were more pronounced for entertainment and learning content than for news content.

In summary, differences across media channels and the interplay of various context factors produce a complex environment. The promise of AI within CA lies in disentangling this complexity and identifying favorable ad placements using the richness of contextual data available online.

The application of AI to CA

The volume and speed at which online advertising space is bought and sold, in combination with the economic values at stake, have motivated immense investments in process automation. This has made online advertising one of the early success stories for applied AI (Qin and Jiang Citation2019) and has contributed to the technological developments that have led to today’s LLMs. Initial uses have focused on personalization, but AI is expected to also assume a significant role in context-centric advertising (Choi and Lim Citation2020). The same technologies that make it possible to automatically caption images with text, or realize visual-linguistic question answering, can help uncover the dynamics between ads and context (see, e.g., Chen et al. Citation2016; Hou et al. Citation2017) and by doing so improve advertising effectiveness. Initial solutions have already been marketed by leading advertising platforms, and the technology is gradually becoming more sophisticated.

In this article, when discussing AI applications, we are referring to systems that are trained on data rather than explicitly programmed.Footnote3 In “Identifying context factors using AI,” we explore techniques through which AI learns to discern contextual factors. Additionally, in “Optimizing ad targeting through reinforcement learning,” we delve into how reinforcement learning can optimize ad placements toward an advertising goal over the course of an ad campaign.

A brief introduction to programmatic advertising

The vast majority of online advertisements are traded in programmatic auctions (Garnett Citation2018; Busch Citation2016), where advertising space is sold by digital publishers to advertisers. Because the auctions are conducted on the basis of individual viewings, they need to be completed within a matter of milliseconds, and the number of daily auctions is counted in tens of billions. Moreover, the value of a won auction is revealed immediately to the buyer: Either the ad is clicked or it is not. The combination of speed, scale, and direct feedback makes programmatic auctions a promising application area for AI.

Programmatic advertising takes off from the observation that every time a consumer requests some piece of content from an online publisher, there is an opportunity to display an ad. The consumer’s exposure to the ad is called an impression. Before serving the requested content, the publisher solicits bids on the impression from advertisers through a digital ad exchange. The winning advertiser provides an ad, which is embedded in the content and sent to the consumer. The time for the whole transaction is approximately 300 ms (a blink of an eye takes 300–400 ms). Two characteristic features are granularity and automation (Busch, Citation2016, 8). Granularity enables advertisers to make optimal use of their budget by inspecting each individual impression and bidding in proportion to its expected value in terms of, for example, consumer activation as measured in number of clicks. This detail-oriented view is only viable due to automation, which avoids expensive and time-consuming manual processing.

For these reasons, one of the most common uses of AI in real-time bidding is to classify impressions into categories that have predictive value for advertising effectiveness. When such a classification is applied to consumer data, we talk about segmentation of audiences. When instead the classification is made with respect to the media content, we have AI-driven CA.

Identifying context factors using AI

Let us now discuss how AI can detect and quantify the context factors presented in “The influence of context: a review of advertising literature.” The main approaches are detection of applicability through topic identification (Jelodar et al. Citation2019), recognition of affective tone through sentiment analysis (Yadav and Vishwakarma Citation2020), and estimation of the consumer’s involvement through regression analysis (Bates and Watts Citation1988) and deep learning (Schulz and Behnke Citation2012) from correlating factors such as time spent on the website, scroll pace, or likeness to content previously consumed (Kwon et al. Citation2019). Among these, the majority of attention has been directed toward detecting applicability. However, there is a growing realization of the importance of also acknowledging sentiment and content involvement. Throughout this section, we complement the theoretical presentation with concrete examples from industry.

Detecting applicability

Recall that applicability refers to the topical relevance between an ad and its surrounding context. Media publishers are, to varying degrees, tagging their articles with categories such as “sports,” “economy,” and “lifestyle.” These metadata are then provided as part of auctioned impressions, so that the buyers can make better-informed bids. However, manual categorization is time-consuming, and it is difficult to maintain consistency among annotators within a publication, let alone among publications. The categories can therefore only provide a high-level and rather imprecise understanding of contexts. This can mean that ads appear on pages that are seemingly aligned with the target category but adopt a different perspective than that anticipated by the advertiser. For example, when events become newsworthy, it is often because something has gone wrong; this means that news content tagged with the label “leadership,” for example, often involves failed leadership, such as harassment cases or derailed negotiations.

Topic detection, alternatively referred to as topic modeling or topic analysis, employs ML to assign labels or categories to content items (e.g., textual documents or websites) based on their respective topics or themes. A standard approach is latent Dirichlet allocation (LDA), a probabilistic model that tries to simultaneously represent each document as a distribution over topics and each topic as a distribution over words (Blei, Ng, and Jordan Citation2003). There are also methods that draw on deep learning, where different types of neural networks—commonly recurrent neural networks (Wang et al. Citation2019) or long short-term memory neural networks (Luan and Lin Citation2019)—are trained on large sets of documents and then used for categorization tasks. These methods save human labor by automating the topic labeling and ensure a more coherent use of metadata tags (Xu et al. Citation2019).

Topic identification can also be initiated by an advertising brand to detect media contexts relevant for them. For example, “strong industrial leadership” is too niched a category to be meaningful for many publishers to tag proactively, since it is not likely that they can reliably offer updated content in the category. However, a brand advertising through a range of publishers may—with the support of a contracted data provider—be able to identify a sufficient number of articles matching this description to run a successful campaign. Examples could be a company targeting positive reviews of their own brand or of articles mentioning their main competitor. In a broader context, this customized targeting allows brands to define and establish ownership over their own target categories. Topic identification realized through deep learning can also find abstract categories such as “relaxation” and “self-fulfilment,” which are difficult to reduce to individual keywords in article text (Liu et al. Citation2020), and help advertisers avoid negative reviews of their products or topics deemed brand-unsafe, such as content relating to war, violence, pornography, and so on (Zhang et al. Citation2008).

Topics can also be discovered by grouping articles by similarity and using the resulting clusters as pseudo topics (Aggarwal and Zhai Citation2012). It may, for example, be found that a certain cluster is particularly useful to promote household services, while another is better to promote bank loans. Analogously, we may find that some ads work better in positive contexts, while others benefit from emotionally charged media. In both cases, clustering can be a good alternative to direct topic identification, as it may be computationally less demanding to sort new articles into existing clusters than test for a vast set of fixed topics. A challenge with clustering is that the definition of similarity used to group media items must be carefully constructed, so that the resulting clusters form what we would see as natural categories (Harrando Citation2022).

To conclude this section, we note that topic-based targeting is widely applied in commercial solutions. Examples abound, ranging from Google Ads offering categorizing of content across the web into niched topics like “home improvement” to Spotify allowing targeting of music associated with daily moments such as working out, cooking, or partying. Furthermore, a number of AdTech companies provide contextual targeting solutions by crawling the web, scanning URLs, and categorizing content (Seb Citation2023). Differentiation occurs in terms of, for example, categorization of multimodal data; breadth and nuance of topic targeting options; reach across websites and apps; and brand safety—avoiding content and publishers that could damage the advertiser’s brand reputation.

Detecting affective tone

Data management platforms increasingly use sentiment analysis to offer contextual targeting based on emotion, assigning content labels such as “happy,” “serious,” “amusing,” “loving,” and “hopeful” (Sobreanis Citation2021; IAS Citation2020). Academic research has studied the advertising effect of the affective tone of YouTube content and used computational approaches to select suitable ads and insertion points in videos (Wen et al. Citation2022; Yadati, Katti, and Kankanhalli Citation2014). There is evidence to support that such practices improve the user experience and lead to increased attention and recall of ads, when compared to contextual systems that do not account for content emotion (Yadati, Katti, and Kankanhalli Citation2014). A significant amount of research has also been devoted to sentiment targeting of user-generated content; see, for example, Nair and Shetty (Citation2017), Fan and Chang (Citation2009), Qiu et al. (Citation2009), and Bushi and Zaïane (Citation2019). This makes it possible for advertisers to target reviews or blog posts reporting positive experiences related to a product and to avoid posts that take a negative or neutral stance.

Topic identification and semantic analysis are both instances of natural language processing (NLP) and usually focus on textual content, but the techniques are increasingly applied to multimodal content that also includes images, video, and sound (Chen et al. Citation2016; Hou et al. Citation2017). An industry example of sentiment targeting in the audio domain is YouTube Music, which offers targeting of moods such as “feel good,” “chill,” and “romance,” allowing advertisers to target music with an affective tone that resonates with their ad message or brand (Hsieh Nikolic Citation2020).

Detecting content involvement

In the online environment, attention-based metrics such as dwell time and viewport time, the time spent at a given position on a webpage, have been found to be robust metrics for content involvement (Lagun and Lalmas Citation2016; Bilenko and White Citation2008; Yi et al. Citation2014). Lagun and Lalmas (Citation2016) relied on analysis of viewport time data to quantify user engagement in online news reading. The article identified four levels of reading engagement, ranging from bounce (leaving the page quickly) to complete engagement (consuming most of the article and reading or contributing comments, etc.). This allowed the researchers to determine how features such as the length and the presence of images and videos on the page correlated with reader engagement.

Mappings between contextual features and user engagement allow publishers and advertisers to predict which articles and topics will attract user interest. The prediction does not require complex AI models; for example, LDA is used as a popular and relatively simple tool to chart content involvement (Lagun and Lalmas Citation2016). As discussed in “Content involvement,” the challenge lies rather in the effect between context factors possibly differing among content types, for example, between genres such as entertainment and political news (Yi et al. Citation2014) or between media types such as video and text (Pelsmacker, Geuens, and Anckaert Citation2002; Norris and Colman Citation1992). Therefore, a CA system designed for a particular domain is unlikely to perform optimally if directly transferred to a new setting.

Turning to practical examples, TikTok’s contextual targeting is limited to the top 4 percent of the app’s content, a practice motivated by the assumption that, for contextual ads, content engagement correlates with brand engagement (TikTok Citation2022). In addition to classifications of topic and affective tone, The New York Times allows ad targeting based on different types of internal motivation that consumers experience after reading a story. These motivations include “make a healthy change” and “donate to a cause.” The targeting approach thus aims to align high-involvement content with advertisements that resonate on a similar level.

Optimizing ad targeting through reinforcement learning

Reinforcement learning (Sutton and Barto Citation2018) adds an additional layer of automation by allowing the AI to actively explore the interplay between contextual features and advertisement effectiveness. It essentially uses trial and error to find patterns in the data and exploits these to continuously optimize ad placement. illustrates the principle. In brief, the AI maintains an internal model of how different features of an impression (i.e., the combination of ad and context) contribute toward the likelihood of activating the consumer. Every time the AI wins an auction and is allowed to serve an ad, it gains a new piece of information, for example, whether the ad was clicked or not. Over time, this information gradually improves the model’s ability to recognize promising impressions (Cai et al. Citation2017). The most important advantages of reinforcement learning are that it reduces the need for a priori knowledge about the media domain and allows the AI to continuously adjust to changing conditions in the supply and demand of advertising space. As a by-product of the optimization, the AI may also discover valuable priming factors that can be used in future campaigns, for example, that advertisements for camping equipment perform well near articles about work–life balance.

Figure 1. A simple application of reinforcement learning to optimize bids in programmatic advertising.

Figure 1. A simple application of reinforcement learning to optimize bids in programmatic advertising.

Neural networks have proven to be an effective means of realizing reinforcement learning in media and language applications (Kamath, Liu, and Whitaker Citation2019). The downside in the case of advertising is a lack of transparency due to the networks’ inherent complexity: The AI may come to exploit patterns in the data that are discriminatory or otherwise problematic, without the advertiser realizing. Ali et al. (Citation2019) demonstrated how optimization can produce unfair advertising outcomes even when targeting criteria are neutral. Their study on Facebook’s ad delivery system found that ads for employment and housing opportunities were disproportionately shown to certain gender and racial groups depending on the content of the ad. Moreover, AI systems based on reinforcement learning can be sensitive to noisy data in the early stages of the learning process, when it must base its decisions on relatively few observations. However, techniques such as filtering out accidental clicks that contribute to the noise have shown a strong positive impact on the performance of the trained model (Tolomei et al. Citation2019).

Reinforcement learning has also been tried further down the sales tunnel. After the user has clicked an ad and been taken to a site, they are typically desired to take one or more target actions. Liberali and Ferecatu (Citation2022) explore the use of reinforcement learning, more particularly the training of hidden Markov models, to adapt the site layout based on the user’s actions to guide them toward these actions in the best possible way. This practice of adapting the characteristics of a marketing instrument on a page-view level is known as morphing. Banner morphing, adapting the layout or message of an ad to groups of consumers, has also been applied, with promising results (Urban et al. Citation2014). This encourages similar studies into how reinforcement learning can accommodate the layout and message of an ad to its surrounding context. If successful, morphing stands to provide a substantial lift in advertising effectiveness beyond that stemming from direct context matching of static ads.

The agent starts by initiating its bidding strategy, possibly based on historical data. It then waits for an offer to bid on an impression. When this offer arrives, the agent evaluates the impression based on available information about, for example, personal data about the user, or context data, and decides whether to bid. If the agent decides to bid, it also decides on the amount to bid. If the agent does not bid, then it returns to update its strategy, now knowing more about what types of impressions it can expect. Assume then that the agent does bid. Now, if it loses the bid and the bid amount was lower than the agent’s predicted value of the impression, then it learns that it must be more aggressive in future bids. If, however, the bid was equal to the predicted value but the agent still lost, then it receives neither a punishment nor a reward. Finally, if the agent wins the bid, then it receives a punishment that is proportional to its bid but also a large positive reward if the user clicked. After this feedback has been received, the agent updates its strategy again to reflect the new knowledge.

Optimization metrics for AI-driven advertising

For an AI-driven advertising system to function, there needs to be precise metrics to optimize toward. Traditional metrics for display ads include impressions, clicks, and conversions (purchases, leads, and so forth). For video ads, average viewing time can be added to the list. Although the direct feedback of these metrics is well suited for AI optimization, it does not always reflect the diverse marketing objectives of advertisers. An advertiser aiming to prompt immediate sales could benefit from optimizing clicks or purchases per page view, while the success of a brand awareness campaign requires longer feedback loops. For example, a brand might advertise to stay top-of-mind in a target category or to reposition their brand to be perceived as more environmentally friendly or exclusive. Sometimes the individual viewing of an ad has small effect, and it is only through repeated exposure to the message that the consumer is affected (Campbell and Keller Citation2003). The dissonance between impression-based metrics and long-term advertising goals, along with other computational advertising challenges such as click fraud, is further discussed by Yun et al. (Citation2020).

To meet these challenges, the advertising industry has introduced a battery of novel metrics. This includes view-through conversion (VTC) rates, incremental measures, and survey-based metrics. VTC rates measure the percentage of users who do not click on an ad but later take a desired action, such as making a purchase. Incremental measures appraise the additional impact of an advertising campaign beyond organic outcomes (Yun et al. Citation2020). Survey-based metrics sample ad recall and psychological measures such as brand evaluations of both consumers who have been shown an ad and a control group who have not been shown the ad. Through the described metrics, we discern a trend of evaluating ad campaigns using experimental methods used in scientific experiments and A/B testing. This trend is contingent on the availability of data online and AI identifying patterns of data (Huh and Malthouse Citation2020). While these methods promise more insightful ad evaluation, they are not without drawbacks. Surveying consumers is costly, and the results can lack accuracy due to issues such as nonresponse bias and lack of sufficient sample size. Moreover, contextual advertisers who are concerned about data privacy should be aware that many of these metrics rely on the use of cookies to track consumers across platforms and over time.

Leveraging existing knowledge with transfer learning

When launching a new ad campaign, an ad targeting system faces a cold-start problem: how to effectively target ads when there are limited historical data available. Transfer learning is an ML technique in which knowledge from a related task is used to improve the performance of an ML model on a new task (Wang and Chen Citation2023). Transfer learning has proven effective for online behavioral targeting, using response data from other products to inform the targeting of a new ad campaign (Aggarwal, Yadav, and Keerthi Citation2019). In CA, historical ad campaigns a can speed up the training of a reinforcement learning system, allowing it to start from a point where it has already learned general relationships between context factors and ad performance. Historical advertising data thus become a strategic asset for an advertising systems provider, particularly as a dataset lacking representation of the new ad campaign will perform poorly.

Unlocking deeper contextual comprehension with LLM

Recent progress in language modeling has pushed the state of the art in almost every NLP task, including topic identification and sentiment analysis. Simply put, a language model is a probability distribution over a vocabulary of words, conditioned on a sequence of preceding words. For example, given the sequence “ladies and,” the likelihood of the next word being “gentlemen” is high compared to that of “men,” although “gentlemen” is a less common word in general. In NLP, language models are used to generate sentences word-by-word, oftentimes from an initial starting prompt. To see how this can be used for topic identification, assume that we have a document D that we want to classify along with three candidate topics, a, b, and c. We may then query the model with a custom prompt: “Out of the topics a, b, and c, the topic of” + D + “is most likely,” where the plus sign denotes concatenation.

Language models have been realized over the years in countless ways, but in recent years LLMs based on transformers (Vaswani et al. Citation2017)a type of deep neural network with a mechanism called attention—have become popular. The value of attention lies in that it allows the network to filter out irrelevant information when generating its output. Examples of transformer-based LLMs are GPT-3 (Brown et al. Citation2020), its instruction-tuned variant ChatGPT (OpenAI Citation2022), Bard (Google Citation2023), and Llama (Touvron et al. Citation2023).

While unlikely to match human performance, LLMs can, in principle, recognize the relevance between an article and an advertisement as well as discern the affective tone of the former. Even in the absence of information about the reader, LLMs can provide some indication of the likelihood that an article will engage its audience. Two drawbacks of employing LLMs for ad targeting are the added computational cost and time spent in processing. As of now, the expense of categorizing a 750-word news article using a state-of-the-art LLM is in the magnitude of a few cents in USD (OpenAI Citation2023). Depending on server load, the processing time is around 60 s (Menear Citation2023), which is magnitudes longer than the 400-ms time frame of a typical programmatic auction. However, each article is the focus of hundreds if not thousands of auctions, and a complete processing is only needed once. In subsequent requests, the article’s associated categories can simply be retrieved, which can be done within the strict time frame of programmatic auctions. Moreover, the processing of individual articles can be done in parallel, so the computations involved scale well to large data sizes. Taken together, this means that even assuming a daily processing of some 100,000 articles, both cost and latency fall into the acceptable range for many companies operating in the advertising ecosystem. The main challenge with LLMs is instead the copyright associated with news articles, as uploading third-party articles to platforms such as OpenAI or Google is likely to infringe upon copyright law. There is also the sustainability aspect to consider, as LLMs are—as previously mentioned—resource-intensive.

Combining contextual and personalized targeting

Up to this point, we have mainly discussed CA in isolation, but many advertisers will likely combine the use of context with targeting based on personal data. Personalized advertising, due to its effectiveness, is unlikely to disappear completely; the decrease in ad revenue per webpage with disabled cookies has been reported as 50 to 80 percent (CMA Citation2020; Bleier Citation2021). At the same time, the trend away from consumer tracking is clear. Nearly 40 percent of advertisers have reduced their use of third-party cookies, a proportion that is expected to increase further as companies like Google, Apple, and Mozilla make tracking more difficult (Epsilon Citation2020). Interest-based targeting has been presented primarily by Google as a more privacy-friendly version of personalized targeting. The idea is that websites the user visits are categorized into a set of broader topics, for example, “rock music” and “team sports” (Goel Citation2022), without collecting URL-level data.

We contend that advertisers can benefit from combining personalized and contextual methods, for example, using context-adapted messaging to consumers whose interest in a topic is confirmed by more privacy-preserving tracking, such as interest-based targeting. The benefit of such an approach has been demonstrated by Lu, Zhao, and Xue (Citation2016). The study found that consumers identified through browsing history as potential auto buyers were twice as likely to click on a car advertisement on an auto-related website than in an unrelated context. For advertisers, deciding which combinations of data to target can be difficult. Using the experience gained over an ad campaign, methods like transfer and reinforcement learning algorithms can assist in making these decisions, discerning the mix of personal and contextual variables that correlate to effective ad placement.

Research agenda

The literature on CA is substantial, but much of it focuses on offline media channels such as print and TV and many context effects in the online environment have not been explored. The complex interaction of context, advertisement, and user characteristics means that we have barely scratched the surface of the iceberg. The application of AI to CA is comparatively recent. Like in many applications of AI, there is a need for algorithms that can motivate their decisions. In advertising, this will mitigate the risk of unfair ad delivery and facilitate knowledge transfer between marketers and AI models. To this end, we outline a research agenda for future work.

Contextual segmentation and clustering

As AI enables us to create highly specific forms of targeting, we need to establish a set of best practices for defining and leveraging bespoke contextual segments, for instance, how to choose desirable media contexts for a given product category and practices for devising ads that exploit their placement in precisely defined contexts. This, in turn, requires a solid base of knowledge about brand-building and -positioning and insights into the distinctive characteristics of various media channels, publishers, and genres. As an area of research, the development of such guidelines is situated within the intersection of AI, marketing, and consumer behavior.

An alternative to working with predefined categories is to group articles based on similarity (see “Detecting applicability”). This approach has an advantage in that the resulting clusters cover the entire body of published content, including items that fall outside the standard IAB taxonomy. The clusters can be marketed toward potential ad buyers; for example, an inventory of articles about an upcoming solar eclipse could be valuable to tech retailers to sell telescopes. For clustering to be viable, however, it is necessary to find definitions of inter-article similarity that can be assessed automatically and that yield clusters that are natural from a human point of view (Harrando Citation2022). Although clustering is treated extensively in the literature, creating clusters that are simultaneously meaningful, distinct, and internally coherent is still far from a solved problem.

Emerging media contexts

Much of the research on CA is older and concentrates on offline media channels, including print and TV. Additional work is needed on digital media channels such as social media, online news, and video on demand. Online advertising presents new conditions by virtue of how consumers experience and process both content and ads (Jørgensen and Knudsen Citation2022; Bronner and Neijens Citation2006). Compared to analog media, online ads are more permission-based and skippable and content is more personalized and interactive (Truong, Mccoll, and Kitchen Citation2010). Naturally, there is also diversity in context characteristics between online platforms. For example, social media platforms elicit varying levels of engagement, which in turn influences advertising evaluations (Voorveld et al. Citation2018; Aldous, An, and Jansen Citation2022). The social media platform effects of engagement are dependent on the content source, topic, and sentiment (Voorveld et al. Citation2018). Such studies on more intricate relationships of context factors in online settings are too few and far between. We echo the call from Jørgensen and Knudsen (Citation2022) for new research on current and emerging online contexts.

Cultural priming patterns

Exposure to media context can prime cognitive or emotional responses that influence the processing of ads (Yi Citation1990). Research suggests that the association patterns that are activated through priming are often culturally dependent (see, e.g., Weber and Hsee Citation1998; Özgüner Citation2011; Bailey and Lown Citation1993). To give an example, the color white represents innocence and purity in western culture, but it is associated with death and misfortune in China. A study comparing advertising in individualistic and collectivistic cultures found that whereas magazine ads in the United States focused on individual benefits and personal success, ads published in Korea more often appealed to ingroup benefits, harmony, and family (Han and Shavitt Citation1994).

To leverage priming patterns in a culturally informed way, work is needed to develop an advertising-centric model of priming, which can be refined and evaluated using regional data. In the case of colors, initial steps have been taken by Jonauskaite et al. (Citation2019), who employ ML to estimate the specificity of color–emotion associations. The same method is likely to be transferable to family relations, times of year, stages of life, and other universal concepts and experiences. While AI systems could provide valuable insights, there is a clear need for research to validate and contextualize findings, interpreting them to form practical conclusions and theory.

Explainable AI

Reinforcement learning can serve as the basis for autonomous advertising systems, which infer from experience the contexts that are most beneficial for an ad. The same technology can also be used to generate customized versions of an ad for each new placement—so-called banner-morphing (Urban et al. Citation2014)—to maximize the benefit of contextual cues. For reinforcement learning to be used effectively and responsibly, it must be possible for a human operator to interpret and moderate the statistical patterns discovered by the AI. This can avoid outcomes such as an AI targeting ads for STEM jobs to contexts with a majority male audience (Lambrecht and Tucker Citation2019).

Adding explainability to the AI also allows for the distillation of general consumer patterns that can be of lasting value and helps to bootstrap later campaigns. To this end, we need to develop suitable levels of abstraction in the AI’s model of the world, so that it can be understood and instructed also by non-experts in ML. It would be wasteful not to take the human marketer’s expertise into account. Recent works have developed ad-targeting solutions that integrate LLMs to provide rationale for the system’s predictions (Yang et al. Citation2023), allowing for marketers to learn from the AI system’s experience. However, this raises fundamental questions regarding how marketing professionals prefer to receive information from AI systems and how to build trust in the technology. For example, how should AI communicate warnings of potentially unfair ad delivery or findings that go against the intuition of marketers? What is the long-term influence on essential skills and the role evolution of marketers if ad-targeting decisions are increasingly performed by AI?

Brand safety

Brand safety is the practice of protecting brands from the potential damage caused by their ads being associated with inappropriate or harmful content online. This risk has been amplified by the rise of programmatic ad buying, which gives advertisers less control over where their ads are placed. In a survey by IAB, 80 percent of advertisers said that brand safety is a top priority for their organization (IAB Europe Citation2020). At the same time, there is a lack of scientific research showing that offensive content negatively impacts brand perceptions of accompanying ads. Bellman et al. (Citation2018) tested well-known brands’ ads running as pre-rolls to terrorist videos on YouTube and found no effects on advertising evaluations. Some industry research, on the other hand, claims severe detrimental effects on consumers’ brand views (IAB Citation2020b).

While advertisers are undoubtedly right to avoid, for example, hate speech (Olivia Citation2017) and disinformation sites (Joe Citation2023), some argue that brand safety measures are often too broad in their application, which can damage media publishers. During the onset of the COVID-19 pandemic, news publishers lost advertising revenue because brands were refusing to place their ads next to content about the pandemic, fearing negative brand association (Nick Citation2020). Research on media context effects is needed to inform advertisers on brand safety considerations, for example, whether advertising alongside articles about controversial topics on reputable sites is brand-unsafe. This research should also inform the application of AI which, with its nuanced understanding of context, has the potential to apply brand-safety measures in a precise manner, without hurting publishers and unnecessarily forgoing advertising opportunities.

Fairness

A common strategy within CA is to target consumer groups based on their content preferences. In an online environment, it is easier to choose niche content that speaks to specific demographics and to decide at the level of individual impressions when to make an offer of services. This practice has the potential to create unfair outcomes for different communities. Studies on personalized advertising have shown how discrimination based on gender (Lambrecht and Tucker Citation2019), race (Ali et al. Citation2019), and income level (Miller and Hosanagar Citation2019) can occur from algorithmic ad distribution systems that find that targeting demographic groups is beneficial when optimizing for clicks. The targeting of demographic groups indirectly through context makes such discriminatory outcomes even more difficult to identify. An AI without the capacity to explain its choices adds an additional layer of obfuscation. Practical solutions will likely have to be multidisciplinary, combining technology, self-regulation, and legal means. From the perspective of advertising research, establishing a definition of fairness in ad delivery could serve as a guide for both industry and policymakers. This includes identifying the specific circumstances where fairness becomes a crucial consideration and developing methodologies for measuring fairness.

Conclusion

In CA, advertisers exploit characteristics of the medium and surrounding content to increase the effectiveness of their messaging. A favorable ad placement can improve a variety of advertising metrics, including brand perception, ad recall, CTR, and purchase intention. The increasing use of CA can be explained by a shift in data supply. On the one hand, personal data are becoming more difficult to work with, due to the legal regulations instigated to protect consumer privacy and to the major tech platforms canceling their support for third-party cookies. On the other hand, there is an abundance of contextual data readily available that can be mined with increasing efficiency due to ongoing advances in AI, in particular in ML techniques for NLP. Contextual and personalized advertising stand to have an even greater impact when used in combination. In this setting, AI can help find the combination of personal and contextual variables that best predict advertising effectiveness.

In our review of previous work on CA, three factors stood out as being of particular importance: The first is applicability, the topical relevance between content and ad. The second is affective tone, the mood or feeling associated with the content. The third is content involvement, the consumer’s level of engagement with the content. Together and in isolation, they can be used to target groups of consumers, catering to their unique interests and current state of mind. Furthermore, the context can prime the consumer toward different association patterns and emotional responses. However, the effect of the context factors varies between media channels, and our understanding may be complicated by a host of mediating factors. These include ad clutter, ad complexity, the level of arousal the ad produces, as well as the task orientation and product involvement of the consumer. The promise of AI within CA lies in disentangling this complexity to operate safe and effective programmatic campaigns.

Today, the vast majority of digital ad spaces are traded in programmatic auctions. AI can support these transactions in several ways. In real-time bidding, AI can assess the abovementioned context factors associated with an impression: (1) the applicability of the content through topic identification, (2) the affective tone through sentiment analysis, and (3) the involvement of the consumer through correlating factors such as time spent on the website, scroll pace, or similar content previously consumed. Another AI technology is clustering, which allows the grouping of content by similarity to gain a better overview of the media landscape. Finally, reinforcement learning involves the computer using trial and error to optimize advertising campaigns toward some quantifiable objective. The main advantages of reinforcement learning are that it reduces the need for domain expertise and that it automatically adapts to changing market dynamics. Transfer learning can be used to transfer knowledge from previous campaigns to provide an improved starting point for reinforcement learning targeting to build upon. LLMs have advanced the state of the art in topic identification and sentiment analysis, enabling improved context understanding but conferring increased computational costs and copyright challenges.

There are several avenues for future work. A central item is to understand how different types of contexts prime for different associations and how these effects differ and develop with time and among cultures. To ensure transparency, fairness, and brand safety, we must strive for explainable systems and, to achieve this, new types of data clustering are needed that more faithfully reflect our human notions of similarity. In addition to technical solutions, an understanding of how marketers effectively interact with and understand AI systems is required. Furthermore, the ability to target highly specific contexts raises research questions on how brands best can leverage this. To paraphrase a seminal AI paper (Vaswani et al. Citation2017), relevance is all you need.

Disclosure statement

The second author has co-founded a university spin-off that provides contextual data to the programmatic ecosystem. While this affiliation has been beneficial for the project, offering access to industry contacts and insights, it also presents a potential conflict of interest. To mitigate this, the second author has taken care to maintain objectivity in the research underlying the paper. Additionally, she has deferred all potentially sensitive decisions about the manuscript to the first author.

Notes

1 There are alternative definitions of CA in the literature (Bleier Citation2021): Some academics (Banerjee and Dholakia Citation2008; Lian, Cha, and Xu Citation2018), regulators (CMA Citation2020), and prominent actors within the advertising industry (IAB Citation2020a) widen the term “contextual data” to include device location data, first-party data, and user sessions. We limit our treatise of contextual data to include only the content that the online user is currently consuming.

2 See, for example, the conclusions of Furnham, Bergland, and Gunter (Citation2002) versus those of Tavassoli, Shultz, and Fitzsimons (Citation1995) on the impact of content involvement or versus those of Mahdi and Furnham (Citation2016) on applicability between context and ad.

3 Definition wise, AI systems learning from data describes the subset of AI known as machine learning (ML). However, the transformative strides currently seen in the AI field at large as well as in advertising applications primarily stem from ML. ML systems, especially when discussed in industry applications, are often simply denoted as AI systems.

References

  • Aggarwal, C., and C. Zhai. 2012. “A Survey of Text Clustering Algorithms.” In Mining Text Data, edited by Thomas J. Watson, 77–128. Boston, MA: Springer.
  • Aggarwal, K., P. Yadav, and S. S. Keerthi. 2019. “Domain Adaptation in Display Advertising: An Application for Partner Cold-Start.” Proceedings of the 13th ACM Conference on Recommender Systems, Copenhagen, Denmark, 178–186.
  • Aldous, K. K., J. An, and B. J. Jansen. 2022. “What Really Matters?: Characterising and Predicting User Engagement of News Postings Using Multiple Platforms, Sentiments and Topics.” Behaviour & Information Technology 42 (5): 545–568. https://doi.org/10.1080/0144929X.2022.2030798.
  • Ali, M., P. Sapiezynski, M. Bogen, A. Korolova, A. Mislove, and A. Rieke. 2019. “Discrimination through Optimization: How Facebook’s Ad Delivery Can Lead to Biased Outcomes.” Proceedings of the ACM on Human-Computer Interaction 3 (CSCW): 1–30. https://doi.org/10.1145/3359301.
  • Bailey, W. C., and J. M. Lown. 1993. “A Cross-Cultural Examination of the Aetiology of Attitudes towards Money.” Journal of Consumer Studies & Home Economics 17 (4): 391–402. https://doi.org/10.1111/j.1470-6431.1993.tb00181.x.
  • Banerjee, S., and R. Dholakia. 2008. “Mobile Advertising: Does Location Based Advertising Work?” International Journal of Mobile Marketing 3: 68–74.
  • Bates, D. M., and D. G. Watts. 1988. Nonlinear Regression Analysis and Its Applications. Somerset, NJ: John Wiley and Sons
  • Belanche, D., C. Flavián, and A. Pérez-Rueda. 2017. “Understanding Interactive Online Advertising: Congruence and Product Involvement in Highly and Lowly Arousing, Skippable Video Ads.” Journal of Interactive Marketing 37: 75–88. https://doi.org/10.1016/j.intmar.2016.06.004.
  • Bellman, S., Z. H. Abdelmoety, J. Murphy, S. Arismendez, and D. Varan. 2018. “Brand Safety: The Effects of Controversial Video Content on Pre-Roll Advertising.” Heliyon 4 (12): e01041. https://doi.org/10.1016/j.heliyon.2018.e01041.
  • Bilenko, M., and R. W. White. 2008. “Mining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity.” Proceedings of the 17th International Conference on World Wide Web, 51–60. New York, NY: Association for Computing Machinery.
  • Birjali, M., M. Kasri, and A. Beni-Hssane. 2021. “A Comprehensive Survey on Sentiment Analysis: Approaches, Challenges and Trends.” Knowledge-Based Systems 226: 107134. https://doi.org/10.1016/j.knosys.2021.107134.
  • Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. “Latent Dirichlet Allocation.” Journal of Machine Learning Research 3: 993–1022.
  • Bleier, A. 2021. “On the Viability of Contextual Advertising as a Privacy-Preserving Alternative to Behavioral Advertising on the Web.” SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3980001.
  • Bronner, F., and P. Neijens. 2006. “Audience Experiences of Media Context and Embedded Advertising: A Comparison of Eight Media.” International Journal of Market Research 48 (1): 81–100. https://doi.org/10.1177/147078530604800106.
  • Brown, T., B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantan, et al. 2020. “Language Models Are Few-Shot Learners.” Advances in Neural Information Processing Systems 33: 1877–1901.
  • Busch, O. 2016. Programmatic Advertising: The Successful Transformation to Automated, Data-Driven Marketing in Real-Time. 2nd ed. Berlin: Springer Verlag.
  • Bushi, S., and O. Zaïane. 2019. “APNEA: Intelligent Ad-Bidding Using Sentiment Analysis.” IEEE/WIC/ACM International Conference on Web Intelligence, 76–83. New York, NY: Association for Computing Machinery. https://doi.org/10.1145/3350546.3352503.
  • Cai, H., K. Ren, W. Zhang, K. Malialis, J. Wang, Y. Yu, and D. Guo. 2017. “Real-Time Bidding by Reinforcement Learning in Display Advertising.” Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 661–670. New York, NY: Association for Computing Machinery. https://doi.org/10.1145/3018661.3018702.
  • Campbell, M. C., and K. L. Keller. 2003. “Brand Familiarity and Advertising Repetition Effects.” Journal of Consumer Research 30 (2): 292–304. https://doi.org/10.1086/376800.
  • Chakrabarti, D., D. Agarwal, and V. Josifovski. 2008. “Contextual Advertising by Combining Relevance with Click Feedback.” Proceedings of the 17th International Conference on World Wide Web, 417–426. New York, NY: Association for Computing Machinery. https://doi.org/10.1145/1367497.1367554.
  • Chen, J., B. Sun, H. Li, H. Lu, and X.-S. Hua. 2016. “Deep CTR Prediction in Display Advertising.” Proceedings of the 24th ACM International Conference on Multimedia, 811–820. New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2964284.2964325.
  • Cheong, Y., F. de Gregorio, and K. Kim. 2010. “The Power of Reach and Frequency in the Age of Digital Advertising.” Journal of Advertising Research 50 (4): 403–415. https://doi.org/10.2501/S0021849910091555.
  • Choi, J.-A., and K. Lim. 2020. “Identifying Machine Learning Techniques for Classification of Target Advertising.” ICT Express 6 (3): 175–180. https://doi.org/10.1016/j.icte.2020.04.012.
  • CMA. 2020. “Online Platforms and Digital Advertising.” Competition and Markets Authority. www.publishing.service.gov.uk
  • Dahlén, M. 2005. “The Medium as a Contextual Cue: Effects of Creative Media Choice.” Journal of Advertising 34 (3): 89–98. https://doi.org/10.1080/00913367.2005.10639197.
  • Dave, P. 2021. “Google Delays Chrome’s Blocking of Tracking Cookies to Late 2023.” www.reuters.com
  • Epsilon. 2020. “Preparing for a World Without Third-Party Cookies.” www.epsilon.com
  • Fan, T.-K., and C.-H. Chang. 2009. “Sentiment-Oriented Contextual Advertising.” Knowledge and Information Systems 23 (3): 321–344. https://doi.org/10.1007/s10115-009-0222-2.
  • Fernandez, K. V., and D. L. Rosen. 2000. “The Effectiveness of Information and Color in Yellow Pages Advertising.” Journal of Advertising 29 (2): 61–73. https://doi.org/10.1080/00913367.2000.10673609.
  • Fuchs, D. A. 1964. “Two Source Effects in Magazine Advertising.” Journal of Marketing Research 1 (3): 59–62. https://doi.org/10.1177/002224376400100309.
  • Furnham, A., J. Bergland, and B. Gunter. 2002. “Memory for Television Ads as a Function of Advertisement-Programme Congruity.” Applied Cognitive Psychology 16 (5): 525–545. https://doi.org/10.1002/acp.812.
  • Garnett, O. 2018. “Own a Brand? Why You Should Pay Attention to Programmatic in 2019.” www.forbes.com
  • Goel, V. 2022. “Get to Know the New Topics API for Privacy Sandbox.” www.blog.google/produces/chrome/
  • Goldberg, M. E., and G. J. Gorn. 1987. “Happy and Sad TV Programs: How They Affect Reactions to Commercials.” Journal of Consumer Research 14 (3): 387–403. https://doi.org/10.1086/209122.
  • Google. 2023. “Bard: A Conversational AI Tool by Google.” www.bard.google.com
  • Han, S.-P., and S. Shavitt. 1994. “Persuasion and Culture: Advertising Appeals in Individualistic and Collectivistic Societies.” Journal of Experimental Social Psychology 30 (4): 326–350. https://doi.org/10.1006/jesp.1994.1016.
  • Harrando, I. 2022. “Representation, Information Extraction, and Summarization for Automatic Multimedia Understanding.” PhD diss., University of Bergen.
  • Higgins, E. 1996. “Knowledge Activation: Accessibility, Applicability, and Salience.” In Social Psychology: Handbook of Basic Principles, edited by E. T. Higgins and A. W. Kruglanski, 133–168. New York: Guilford Press.
  • Hou, S., L. Chen, D. Tao, S. Zhou, W. Liu, and Y. Zheng. 2017. “Multi-Layer Multi-View Topic Model for Classifying Advertising Video.” Pattern Recognition 68: 66–81. https://doi.org/10.1016/j.patcog.2017.03.003.
  • Hsieh Nikolic, M. 2020. “Audio Ads on YouTube Expand Reach and Grow Brand Awareness.” www.blog.google.com
  • Huang, S.-l. 2014. “The Impact of Context on Display Ad Effectiveness: Automatic Attitude Activation and Applicability.” Electronic Commerce Research and Applications 13 (5): 341–354. https://doi.org/10.1016/j.elerap.2014.06.006.
  • Huang, Y.-T. 2018. “The Female Gaze: Content Composition and Slot Position in Personalized Banner Ads, and How They Influence Visual Attention in Online Shoppers.” Computers in Human Behavior 82: 1–15. https://doi.org/10.1016/j.chb.2017.12.038.
  • Huh, J., and E. C. Malthouse. 2020. “Advancing Computational Advertising: Conceptualization of the Field and Future Directions.” Journal of Advertising 49 (4): 367–376. https://doi.org/10.1080/00913367.2020.1795759.
  • IAB. 2020a. “Internet Advertising Revenue Report.” IAB. www.iab.com
  • IAB. 2020b. “Understanding Brand Safety & Brand Suitability in a Contemporary Media Landscape.” www.iab.com
  • IAB Europe. 2020. “IAB Europe Announces Results of Its Brand Safety Pol.” www.iabeurope.eu
  • IAS. 2020. “Feeling Good about Sentiment Analysis.” www.integralads.com
  • Ipsos. 2021. “IAB State of Data 2021 Quantitative Analysis Assessing Perceived vs. Actual Preparedness for the Post Third-Party Cookie and Identifier Tracking Ecosystem.” www.iab.com
  • Jelodar, H., Y. Wang, C. Yuan, X. Feng, X. Jiang, Y. Li, and L. Zhao. 2019. “Latent Dirichlet Allocation (LDA) and Topic Modeling: Models, Applications, a Survey.” Multimedia Tools and Applications 78 (11): 15169–15211. https://doi.org/10.1007/s11042-018-6894-4.
  • Joe, R. 2023. “One Year into Russia’s Invasion of Ukraine, Disinformation Sites with Ads Powered by Google Have More Than Doubled.” www.businessinsider.com
  • Jonauskaite, D., J. Wicker, C. Mohr, N. Dael, J. Havelka, M. Papadatou-Pastou, M. Zhang, and D. Oberfeld. 2019. “A Machine Learning Approach to Quantify the Specificity of Colour–Emotion Associations and Their Cultural Differences.” Royal Society Open Science 6 (9): 190741. https://doi.org/10.1098/rsos.190741.
  • Jørgensen, R. H., and G. H. Knudsen. 2022. “Media Context: A Literature Review and Research Agenda.” Journal of Marketing Management 38 (17–18): 1937–1957. https://doi.org/10.1080/0267257X.2022.2098365.
  • Kamath, U., J. Liu, and J. Whitaker. 2019. Deep Learning for NLP and Speech Recognition. Vol. 84. Cham, Switzerland: Springer.
  • Kamins, M. A., L. J. Marks, and D. Skinner. 1991. “Television Commercial Evaluation in the Context of Program Induced Mood: Congruency versus Consistency Effects.” Journal of Advertising 20 (2): 1–14. https://doi.org/10.1080/00913367.1991.10673209.
  • Kwon, E. S., K. W. King, G. Nyilasy, and L. N. Reid. 2019. “Impact of Media Context on Advertising Memory: A Meta-Analysis of Advertising Effectiveness.” Journal of Advertising Research 59 (1): 99–128. https://doi.org/10.2501/JAR-2018-016.
  • Kwortnik, R. J., and W. T. Ross. 2007. “The Role of Positive Emotions in Experiential Decisions.” International Journal of Research in Marketing 24 (4): 324–335. https://doi.org/10.1016/j.ijresmar.2007.09.002.
  • Lagun, D., and M. Lalmas. 2016. “Understanding User Attention and Engagement in Online News Reading.” Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 113–122. New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2835776.2835833.
  • Lambrecht, A., and C. Tucker. 2019. “Algorithmic Bias? An Empirical Study of Apparent Gender-Based Discrimination in the Display of STEM Career Ads.” Management Science 65 (7): 2966–2981. https://doi.org/10.1287/mnsc.2018.3093.
  • LaTour, K., and M. Latour. 2009. “Positive Mood and Susceptibility to False Advertising.” Journal of Advertising 38 (3): 127–142. https://doi.org/10.2753/JOA0091-3367380309.
  • Lian, S., T. Cha, and Y. Xu. 2018. “Enhancing Geotargeting with Temporal Targeting, Behavioral Targeting and Promotion for Comprehensive Contextual Targeting.” Decision Support Systems 117: 28–37. https://doi.org/10.1016/j.dss.2018.12.004.
  • Liberali, G., and A. Ferecatu. 2022. “Morphing for Consumer Dynamics: Bandits Meet Hidden Markov Models.” Marketing Science 41 (4): 769–794. https://doi.org/10.1287/mksc.2021.1346.
  • Liu, J., C. Xia, X. Li, H. Yan, and T. Liu. 2020. “A BERT-Based Ensemble Model for Chinese News Topic Prediction.” Proceedings of the 2020 2nd International Conference on Big Data Engineering, Shanghai, China, 18–23. https://doi.org/10.1145/3404512.3404524.
  • Lu, X., X. Zhao, and L. Xue. 2016. “Is Combining Contextual and Behavioral Targeting Strategies Effective in Online Advertising?” ACM Transactions on Management Information Systems 7 (1): 1–20. https://doi.org/10.1145/2883816.
  • Luan, Y., and S. Lin. 2019. “Research on Text Classification Based on CNN and LSTM.” 2019 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), Dalian, China, 352–355. https://doi.org/10.1109/ICAICA.2019.8873454.
  • Mahdi, S., and A. Furnham. 2016. “The Effects of a Health-Based Context on Memory for Advertisements.” Psychology 07 (08): 1163–1173. https://doi.org/10.4236/psych.2016.78117.
  • Mao, E., and J. Zhang. 2015. “What Drives Consumers to Click on Social Media Ads? The Roles of Content, Media, and Individual Factors.” 2015 48th Hawaii International Conference on System Sciences, Kauai, HI, USA, 3405–3413.
  • Menear, K. 2023. “ChatGPT Turbo vs ChatGPT Plus: Testing the First Priority Feature.” www.medium.com
  • Miller, A. P., and K. Hosanagar. 2019. “How Targeted Ads and Dynamic Pricing Can Perpetuate Bias.” www.hbr.org.
  • Nair, L. R., and S. D. Shetty. 2017. “Streaming Big Data Analysis for Real-Time Sentiment Based Targeted Advertising.” International Journal of Electrical and Computer Engineering 7 (1): 402–407. https://doi.org/10.11591/ijece.v7i1.pp402-407.
  • Nick, M. 2020. “Coronavirus: When Using the c-Word Gets You Blacklisted.” www.dw.com
  • Norris, C. E., and A. M. Colman. 1992. “Context Effects on Recall and Recognition of Magazine Advertisements.” Journal of Advertising 21 (3): 37–46. https://doi.org/10.1080/00913367.1992.10673374.
  • Oh, H.-J., C. Lee, and C.-H. Lee. 2012. “Analysis of the Empirical Effects of Contextual Matching Advertising for Online News.” ETRI Journal 34 (2): 292–295. https://doi.org/10.4218/etrij.12.0211.0171.
  • Olivia, S. 2017. “Google’s Bad Week: YouTube Loses Millions as Advertising Row Reaches US.” www.theguardian.com
  • OpenAI. 2022. “OpenAI: Introducing ChatGPT.” www.openai.com/blog/chatgpt
  • OpenAI. 2023. “Pricing.” Accessed December 10, 2023. www.openai.com/pricing
  • Özgüner, H. 2011. “Cultural Differences in Attitudes towards Urban Parks and Green Spaces.” Landscape Research 36 (5): 599–620. https://doi.org/10.1080/01426397.2011.560474.
  • Pelsmacker, P. D., M. Geuens, and P. Anckaert. 2002. “Media Context and Advertising Effectiveness: The Role of Context Appreciation and Context/Ad Similarity.” Journal of Advertising 31 (2): 49–61. https://doi.org/10.1080/00913367.2002.10673666.
  • Qin, X., and Z. Jiang. 2019. “The Impact of AI on the Advertising Process: The Chinese Experience.” Journal of Advertising 48 (4): 338–346. https://doi.org/10.1080/00913367.2019.1652122.
  • Qiu, G., J. Bu, C. Chen, and F. Zhang. 2009. “Intelligent Advertising for User Generated Content through Sentiment Analysis.” 2009 International Conference on Advances in Social Network Analysis and Mining, Athens, Greece, 330–333.
  • Research and Markets. 2023a. “Contextual Advertising Global Market to Reach $562.1 Billion by 2030: Contextual Advertising and Personalization Evolve into Critical Aspects in Building a Comprehensive Ad Campaign Strategy.” Accessed 2023-09-06. https://www.linkedin.com/pulse
  • Research and Markets. 2023b. “The Contextual Advertising Market.” Accessed 2023-09-06. https://www.researchandmarkets.com/report/contextual-advertising
  • Saraee, E., M. Jalal, and M. Betke. 2020. “Visual Complexity Analysis Using Deep Intermediate-Layer Features.” Computer Vision and Image Understanding 195: 102949. https://doi.org/10.1016/j.cviu.2020.102949.
  • Schulz, H., and S. Behnke. 2012. “Deep Learning.” KI—Künstliche Intelligenz 26 (4): 357–363. https://doi.org/10.1007/s13218-012-0198-z.
  • Seb, J. 2023. “The Guardian Will Be Paid for Permitting the Collection of Contextual Data on Its Site.” www.digiday.com
  • Shapiro, S., D. Macinnis, and C. Park. 2013. “Understanding Program Induced Mood Effects: Decoupling Arousal from Valence.” Journal of Advertising 31 (4): 15–26. https://doi.org/10.1080/00913367.2002.10673682.
  • Shen, F., and Q. Chen. 2007. “Contextual Priming and Applicability: Implications for Ad Attitude and Brand Evaluations.” Journal of Advertising 36 (1): 69–80. https://doi.org/10.2753/JOA0091-3367360105.
  • Sobreanis, L. 2021. “Kargo Reinvents Advertising with AI-Driven Contextual Targeting.” www.ibm.com
  • Song, J. H. 2014. “Are Contextual Advertisements Effective? The Moderating Role of Complexity in Banner Advertising.” International Journal of Advertising 33: 351–371.
  • Sutton, R. S., and A. G. Barto. 2018. Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
  • Tagami, Y., S. Ono, K. Yamamoto, K. Tsukamoto, and A. Tajima. 2013. “CTR Prediction for Contextual Advertising: Learning-to-Rank Approach.” Proceedings of the Seventh International Workshop on Data Mining for Online Advertising. New York, NY: Association for Computing Machinery.
  • Tai, H.-T., and S.-S. Chang. 2005. “The Causal Model of Internet Advertising Effectiveness.” Asia Pacific Management Review 10: 78–92.
  • Tavassoli, N. T., C. J. Shultz, and G. J. Fitzsimons. 1995. “Program Involvement: Are Moderate Levels Best for Ad Memory and Attitude toward the Ad?” Journal of Advertising Research 35: 61–72.
  • TikTok. 2022. “TikTok Pulse: Bringing Brands Closer to Community and Entertainment.” www.newsroom.tiktok.com
  • Tolomei, G., M. Lalmas, A. Farahat, and A. Haines. 2019. “You Must Have Clicked on This Ad by Mistake! Data-Driven Identification of Accidental Clicks on Mobile Ads with Applications to Advertiser Cost Discounting and Click-through Rate Prediction.” International Journal of Data Science and Analytics 7 (1): 53–66. https://doi.org/10.1007/s41060-018-0122-1.
  • Touvron, H., T. Lavril, G. Izacard, X. Martinet, M.-A. Lachaux, T. Lacroix, and B. Rozière, et al. 2023. “LLaMA: Open and Efficient Foundation Language Models.” arXiv preprint arXiv:2302.13971.
  • Truong, Y., R. Mccoll, and P. Kitchen. 2010. “Practitioners’ Perceptions of Advertising Strategies for Digital Media.” International Journal of Advertising 29 (5): 709–725. https://doi.org/10.2501/S0265048710201439.
  • Urban, G., E. MacDonald, R. Bordley, J. Hauser, and G. Liberali. 2014. “Morphing Banner Advertising.” Marketing Science 33 (1): 27–46. https://doi.org/10.1287/mksc.2013.0803.
  • Vaswani, A., N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, … I. Polosukhin. 2017. “Attention Is All You Need.” In Advances in Neural Information Processing Systems, edited by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Vol. 30, 6000–6010. Red Hook, NY: Curran Associates, Inc.
  • Venture Beat. 2023. “30 Billion Times a Day, Google Runs an Ad.” www.venturebeat.com/business
  • Voorveld, H. A., G. van Noort, D. G. Muntinga, and F. Bronner. 2018. “Engagement with Social Media and Social Media Advertising: The Differentiating Role of Platform Type.” Journal of Advertising 47 (1): 38–54. https://doi.org/10.1080/00913367.2017.1405754.
  • Wang, J., and Y. Chen. 2023. Introduction to Transfer Learning: Algorithms and Practice. Singapore: Springer.
  • Wang, R., Z. Li, J. Cao, T. Chen, and L. Wang. 2019. “Convolutional Recurrent Neural Networks for Text Classification.” 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 1–6. https://doi.org/10.1109/IJCNN.2019.8852406.
  • Weber, E. U., and C. Hsee. 1998. “Cross-Cultural Differences in Risk Perception, but Cross-Cultural Similarities in Attitudes towards Perceived Risk.” Management Science 44 (9): 1205–1217. https://doi.org/10.1287/mnsc.44.9.1205.
  • Wen, T. J., C.-H. Chuan, J. Yang, and W. S. Tsai. 2022. “Predicting Advertising Persuasiveness: A Decision Tree Method for Understanding Emotional (In)Congruence of Ad Placement on YouTube.” Journal of Current Issues & Research in Advertising 43 (2): 200–218. https://doi.org/10.1080/10641734.2021.1963356.
  • Wood, L. 2022. “Stringent Data Privacy Regulations Give Impetus to Contextual Targeting.” www.globenewswire.com
  • Xu, G., Y. Meng, Z. Chen, X. Qiu, C. Wang, and H. Yao. 2019. “Research on Topic Detection and Tracking for Online News Texts.” IEEE Access 7: 58407–58418. https://doi.org/10.1109/ACCESS.2019.2914097.
  • Yadati, K., H. Katti, and M. Kankanhalli. 2014. “CAVVA: Computational Affective Video-in-Video Advertising.” IEEE Transactions on Multimedia 16 (1): 15–23. https://doi.org/10.1109/TMM.2013.2282128.
  • Yadav, A., and D. K. Vishwakarma. 2020. “Sentiment Analysis Using Deep Learning Architectures: A Review.” Artificial Intelligence Review 53 (6): 4335–4385. https://doi.org/10.1007/s10462-019-09794-5.
  • Yang, Q., M. Ongpin, S. Nikolenko, A. Huang, and A. Farseev. 2023. “Against Opacity: Explainable AI and Large Language Models for Effective Digital Advertising.” Proceedings of the 31st ACM International Conference on Multimedia, 9299–9305. New York, NY: Association for Computing Machinery. https://doi.org/10.1145/3581783.3612817.
  • Yi, X., L. Hong, E. Zhong, N. Liu, and S. Rajan. 2014. “Beyond Clicks: Dwell Time for Personalization.” Proceedings of the 8th ACM Conference on Recommender Systems (RecSys 2014), Foster City, Silicon Valley, California, USA 113–120.
  • Yi, Y. 1990. “Cognitive and Affective Priming Effects of the Context for Print Advertisements.” Journal of Advertising 19 (2): 40–48. https://doi.org/10.1080/00913367.1990.10673186.
  • Yun, J. T., C. M. Segijn, S. Pearson, E. C. Malthouse, J. A. Konstan, and V. Shankar. 2020. “Challenges and Future Directions of Computational Advertising Measurement Systems.” Journal of Advertising 49 (4): 446–458. https://doi.org/10.1080/00913367.2020.1795757.
  • Zanjani, S., W. Diamond, and K. Chan. 2011. “Does Ad-Context Congruity Help Surfers and Information Seekers Remember Ads in Cluttered E-Magazines?” Journal of Advertising 40 (4): 67–84. https://doi.org/10.2753/JOA0091-3367400405.
  • Zhang, K., and Z. Katona. 2012. “Contextual Advertising.” Marketing Science 31 (6): 980–994. https://doi.org/10.1287/mksc.1120.0740.
  • Zhang, Y., A. Surendran, J. Platt, and M. Narasimhan. 2008. “Learning from Multi-Topic Web Documents for Contextual Advertisement.” Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, 1051–1059. https://doi.org/10.1145/1401890.1402015.
  • Zhao, H., D. Phung, V. Huynh, Y. Jin, L. Du, and W. Buntine. 2021. “Topic Modelling Meets Deep Neural Networks: A Survey.” In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, Montreal, edited by Z.-H. Zhou, 4713–4720. International Joint Conferences on Artificial Intelligence Organization. https://www.ijcai.org/proceedings/2021/638