12,228
Views
1
CrossRef citations to date
0
Altmetric
Research Article

Digital Transformation of the Automotive Industry: An Integrating Framework to Analyse Technological Novelty and Breadth

&

ABSTRACT

Research demonstrates that digital technologies stimulate industrial transformation by enabling new interdependencies with firms outside and across firm and industry boundaries. However, we know little about the degree of novelty and breadth of digital technologies that have the potential to transform industries. Understanding the degree of novelty (spanning from radical to incremental) and breadth (spanning from one sector to multiple sectors) of digital technologies is important for measuring their impact on industrial transformation. Through a topic modelling research approach on autonomous vehicle technology patents from firms operating in Sweden and a confirmatory survey with the inventors of those patents, this paper reveals 26 digital technology topics that are transforming the automotive industry. The digital technology topics are distributed across four ideal-typical technology categories for transformation: augmenting, spanning, transforming, and disrupting. This study illustrates the value of studying digital technologies’ transformative nature using an integrating framework; it reveals that digital technologies in the automotive industry have mainly incremental characteristics but that these characteristics provide necessary preconditions for the few more radical technologies to achieve their potential in transforming the industry.

1. Introduction

Numerous scholars argue that the emergence of digital and artificial intelligence (AI) technologies and new abilities associated with implementing them are vital for firms’ activities in the digital age (Iansiti and Lakhani Citation2020; Menz et al. Citation2021; Dąbrowska et al. Citation2022; Bailey et al. Citation2022). However, the ‘novelty’ and ‘breadth’ of such technologies remain less explored. Debates are emerging around whether the novelty of digital technologies is more likely to be ‘radical’ by stimulating the creation of new market opportunities by dramatically changing the direction of industries (Nambisan, Wright, and Feldman Citation2019; Yoo et al. Citation2012) or ‘incremental’ by augmenting or automatising the capabilities of technologies (Lebovitz, Lifshitz-Assaf, and Levina Citation2022; Raisch and Krakowski Citation2021). Scholars also discuss whether the breadth of digital technologies is more likely to be for one sector or to span multiple sectors, arguing for potential complementarity to create modifications across several industries and products (Teece Citation2018). One example of digital technologies with a wide array of applications is movement recognition technologies, which can be applied in autonomous vehicles, allow robots to work alongside people, and improve wellbeing at home.

Although there are several well-documented examples of digital technologies transforming industries, including the health (Lebovitz, Lifshitz-Assaf, and Levina Citation2022; Coccia Citation2020), automotive (Svahn, Mathiassen, and Lindgren Citation2017), media (Anand Citation2016), and cosmetic (Lopez-Vega and Lakemond Citation2022) industries, researchers primarily explain the characteristics of single emerging digital technologies and do not study the impact of a whole range of digital technologies, which is required to transform an entire sector or industry (Röder, Both, and Hinneburg Citation2015; Kwon et al. Citation2019). Indeed, Bodrožić and Adler (Citation2022, p. 105) highlight that ‘digital transformation is not simply a basket of individual innovations: it is an interdependent cluster of revolutionary technologies, where developments in each technology affect many others’. Hence, the impact of digital technologies’ novelty and breadth on industrial transformation cannot be accounted for by the characteristics of a single digital technology. Instead, the outcome in terms of an industry digitalisation must be studied in relation to a larger cluster of technologies working with each other (Arthur Citation2009; Coccia Citation2017; Bodrožić and Adler Citation2022; Bailey et al. Citation2022; Röder, Both, and Hinneburg Citation2015). The framework presented in this paper aims to address this limitation.

In this paper, we take a novel approach to study autonomous vehicle technologies by closely investigating their technological breadth (spanning from one sector to multiple sectors) and technological novelty (spanning from incremental to radical) to unravel the interplay of digital technologies across different digital technology categories and their impact on digital transformation in the automotive industry. Specifically, this paper addresses the following research question: What characteristics of digital technologies can be depicted across the novelty and breadth dimensions to explain their impact on industry digitalization?

Empirically, this paper presents an analysis of interdependent digital technologies captured as digital technology groups. We refer to these groups as digital technology topics, and we study the characteristics and impact of these topics in relation to technological breadth and novelty. In an aggregated patent-level analysis, we employed a computational research method for natural language processing (i.e. latent Dirichlet allocation [LDA]) (Blei, Ng, and Jordan Citation2003). Specifically, we employed topic modelling (Hannigan et al. Citation2019) to obtain emerging semantic topics from patents linked to autonomous vehicle technologies in Sweden. The dataset includes 455 patents, which reveal the emerging digital technology groups that automotive companies in Sweden developed between 1970 and 2019. This method allowed us to identify the probability that companies will create knowledge related to 26 unique digital technology topics. At an aggregated level, these 26 topics represent the most relevant digital technology areas for autonomous vehicle technologies. Each topic captures a set of words that recurs across all the patents. The identified topics provide an inductive semantic analysis of all patents included in the dataset. Furthermore, we conducted an expert survey to understand the novelty of the technology topics and the breadth of their application.

Based on our findings, this paper offers two main contributions. First, this paper contributes to empirically untangling the characteristics of digital technology topics of varying technological novelty and breadth and thereby explains the pervasive characteristics associated with digital technologies in the automotive industry. Our work responds to the call to integrate solid findings about technological change and industry digitalisation (Hanelt et al. Citation2021; Warner and Wäger Citation2019; Bodrožić and Adler Citation2022; Coccia and Watts Citation2020). Particularly, our work connects to and extends the notion of creative accumulation (Bergek et al. Citation2013) by arguing that digital transformation is the outcome of introducing new creative processes to embed digital capabilities into objects as well as building new digital capabilities outside the main industry.

Second, this paper proposes an integrating framework to clarify the breadth and novelty of digital technologies and their impact on industrial transformation. The framework presents four digital technology categories: augmenting, spanning, transforming, and disrupting. The digital technologies in these four categories are not mutually exclusive but complementary and, in some instances, partially overlapping. By analytically distinguishing the novelty and breadth dimensions, we highlight the characteristics of and impact on industrial transformation in the four categories and thereby contribute to better understanding of emerging digital technologies (Daniele et al. Citation2015). The integrating framework also provides alternatives for firms to consider development and exploration in new digital technology categories.

Contrary to recent claims suggesting that digitalisation will transform or disrupt established firms (Anand Citation2016), our findings explain that digital technologies renew and extend the innovation capabilities across industries. However, the incremental renewal and improvement of the focal industry and associated technologies may provide crucial preconditions for realising the potential of radical changes. An illustrative example explored in this paper is collision prevention assistance systems, a radically new technology whose successful implementation also requires incremental improvements in other systems, such as navigation, speed control, and road friction analysis systems.

This paper is structured as follows: Section 2 discusses recent research on digital transformation and outlines a framework for addressing digital transformation based on the combined analysis of the novelty and breadth of digital technologies. Section 3 presents our computer-based text-analysis approach and methodological considerations. Section 4 presents our results. Section 5 discusses our integrating framework as well as our conclusions, study limitations, and avenues for further research.

2. Background literature and framework

This section begins with an overview of the recent literature on digital transformation and digital technologies. Section 2.2 integrates the two central dimensions of digital technologies: technological breadth and technological novelty. Finally, Section 2.3 describes four proposed categories of digital technologies based on the combination of these dimensions.

2.1. Digital transformation

The rise of digital technologies is triggering the transformation of several industries in which firms’ digital transformation activities are contingent on both the institutional conditions and firms’ innovation strategies (Vial Citation2019; Bodrožić and Adler Citation2022). So far, scholars have outlined distinct frameworks to design new digital innovation strategies and build digital innovation capabilities and forms of production for digital technologies (Bailey et al. Citation2022; Dąbrowska et al. Citation2022). They have also stressed the need to further identify new ways to comprehend the dimensions and characteristics of industry digitalisation (Menz et al. Citation2021; Nambisan et al. Citation2017; Bodrožić and Adler Citation2022). For example, it is necessary to understand how and what types of digital technologies enable industrial transformation that results in a fundamental renewal of firms’ established technology portfolios and innovation capabilities (Hanelt et al. Citation2021; McAfee and Brynjolfsson Citation2017; Utterback, Pistorius, and Yilmaz Citation2018).

Although extant research focuses on the application and benefits of digital technologies for business and innovation platforms (Anand Citation2016; Gawer Citation2021) as well as their benefits for labour productivity and sales (Cirillo et al. Citation2022), research partially disregards the challenges faced by firms in manufacturing industries and complex product industries that require the integration of digital capabilities in component- and system-related knowledge. In these industries, industrial transformation embodies innovation outside the range of expertise of the focal firm or industry and requires firms to undertake rapid technology development based on existing practices while pursuing distinct technology categories (Bergek et al. Citation2013).

Studying digital technologies at the industry level remains a large challenge because it requires new methods and theoretical lenses to explore the overarching categories and characteristics associated with these technologies. Indeed, only an aggregated meso-level perspective of digital and AI technologies and their characteristics can lead to a full explanation of industry digitalisation (Arthur Citation2009; Bailey et al. Citation2022). On this basis, digital transformation across different industries might depend on the aggregate and overlapping characteristics of individual digital technologies, which in turn form unique building blocks. The need for such meso-level studies may partially be due to the fact that ‘neither firms nor innovations, taken individually, can completely explain technology change. Instead, they must be viewed as parts of a larger system; [in which] various agents interact with each other, and institutions matter’ (Carlsson and Stankiewics Citation1991, p. 23). Bailey et al. (Citation2022) also explain that studying the relationships across technologies and other industrial actors, such as users, equipment manufacturers and vendors, data analytic firms, governments, technology companies, etc., can help uncover the fundamental digital transformation of industries. Moreover, as early-stage digital technologies like the Internet of Things, AI, blockchain, and Metaverse have already matured and are ready for wide deployment, it is less clear how these technologies will be implemented across industries and appeal to a wider set of industries and activities (Bodrožić and Adler Citation2022; Daniele et al. Citation2015).

Although some studies primarily explain the application of digital technologies in single industries, such as the health (Coccia Citation2020; Lebovitz, Lifshitz-Assaf, and Levina Citation2022), media (Anand Citation2016), and automotive industries (Svahn, Mathiassen, and Lindgren Citation2017), less attention has been devoted to the potential impact of such application across industries. Indeed, scholars lament this lack of attention paid to the benefits of the cross-sectorial application of digital technologies (Martynov Citation2020) as well as the mechanisms firms use to cross knowledge boundaries (Dougherty and Danielle Citation2012). Particularly for firms in manufacturing industries (Svahn, Mathiassen, and Lindgren Citation2017), digital technologies require both building capabilities in new technology areas that are outside existing knowledge bases and evolving existing technologies (Hanelt et al. Citation2021). Understanding the characteristics of digital technologies paves the way to study their aggregated effect on digital transformation as similar technologies can concurrently be developed in different industries and by different firms (Nambisan et al. Citation2017; Yoo et al. Citation2012).

2.2. Dimensions of digital technologies

2.2.1. Technological novelty

Scholars have recently begun to claim that the rise of digital and AI technologies, such as algorithms, sensors, and robots, has accelerated the transformation of industries and triggered a new industrial paradigm (c.f. Yoo et al. Citation2012; Bailey et al. Citation2022; Iansiti and Lakhani Citation2020; McAfee and Brynjolfsson Citation2017). However, not all digital technologies are uniformly disruptive across industries. For example, in the service and entertainment industries, digital technologies have had a profound influence on the associated industry structures and capabilities and have enabled the entrance of new players (Anand Citation2016). An example is Netflix, which changed from an online DVD-rental store to a film production company. In other sectors, particularly mature industries, such as the manufacturing and automotive industries, digital technologies can either be characterised by their radical changes or only represent an incremental product change (Benner and Tripsas Citation2012; Nambisan, Wright, and Feldman Citation2019).

Two streams of literature highlight the creation of radical technologies for technological change. One stream suggests it is large corporations with established capabilities that can launch new radical technologies, while the other stream points out the relevance of new industry entrants for radical technology development (Bergek et al. Citation2013). The first perspective primarily focuses on incremental technology evolution and learning from advances in technology – from the development of dominant design technologies to ‘incremental technology improvements which ultimately tap the full potential of the initial invention’ (Brynjolfsson and McAfee Citation2014, p. 77). For example, Dougherty and Danielle (Citation2012) suggest digital technologies complement technology progress (e.g. drug discovery in the pharmaceutical industry), and Coccia (Citation2020) highlights how artificial intelligence can be used for cancer detection. The second perspective proposes that radical innovations can be initiated by new industry entrants that tend to give rise to technological change and disrupt firms pursuing only incremental technological change (Benner and Tripsas Citation2012; Christensen et al. Citation2018). Although these research insights offer possible explanations for when technologies may have the potential to be radical or incremental within and across sectors and product markets (Malerba Citation1992), they have not been applied to settings with unspecified technology characteristics, such as digital technologies.

The literature also remains unclear on how firms develop digital technologies that could hinder or enhance established industry structures characterised by incremental changes and dominant technological architectures (Eggers and Francis Park Citation2018; Bodrožić and Adler Citation2022). One research stream suggests that technological change requires industry incumbents to pursue new business models in tandem with their old business models to maintain their market hegemony in the long term (Eklund and Kapoor Citation2019; Kapoor and Klueter Citation2021). Other streams of research focus on the antecedents (e.g. firm size and experience, complementary assets, managerial cognition, and top management characteristics) of technological change for incumbent firms (Eggers and Francis Park Citation2018; Bergek et al. Citation2013; Coccia and Watts Citation2020). According to this latter perspective, firms apparently compete by launching either radical technologies aiming to disrupt the market or incremental technologies seeking to extend the lifecycle of products (Utterback, Pistorius, and Yilmaz Citation2018; Bergek et al. Citation2013). Eggers and Francis Park (Citation2018), p. 382) suggest that ‘future research needs to identify the nature of technological change and uncover the degree of radicalness of the innovation’ in firms pursuing digital technological change.

Right now, a point of contention is the novelty of digital technologies (Raisch and Krakowski Citation2021). Some researchers conceptualise digital transformation as ‘a process that aims to improve an entity [firm or technologies] by triggering significant changes to its properties through combinations of information, communication, and connectivity technologies’ (Vial Citation2019, p. 121) where the focus is on incorporating ‘digital capabilities into objects that previously had a pure physical materiality’ (Yoo et al. Citation2012, p. 1398). Others outline the need to study the novelty of digital technologies to transform industries’ established capabilities and knowledge bases (McAfee and Brynjolfsson Citation2017; Nambisan, Wright, and Feldman Citation2019) as well as the scope of digital technologies to generate a multi-industry impact for industrial transformation. The latter perspective also stresses that firms seek to build radical technologies to reinforce their existing technology portfolios and benefit from arising market opportunities as well as to cope with changes in the technological landscape and industry structure (Coccia Citation2017, Citation2015; Kwon et al. Citation2019).

2.2.2. Technological breadth

Research underlines the effect of digital technologies on industry structure and firm dynamics as well as firms’ efforts to experiment and spread the benefits of digital technologies across different markets and technological applications with the ultimate goal of defining new trajectories of technical progress (Benner and Tripsas Citation2012). In their pursuit to develop new digital technologies, firms build technologies across one-sector or multi-sector technology areas to position their innovation portfolios in relation to the breadth of industry technologies (Kneeland, Schilling, and Barak Citation2020; Lopez-Vega, Tell, and Vanhaverbeke Citation2016). Hence, because technological expertise is expanding and scattered among firms in different industries (Kwon et al. Citation2019), researchers have tried to provide an aggregated industry framework for digital technologies.

Recent studies provide insights into the pervasive characteristics of digital technologies, such as their openness, convergence, and generativity (Nambisan, Wright, and Feldman Citation2019), and highlight their potential complementarity to create new resource combinations and modifications across multiple industries and products (Teece Citation2018). For example, these characteristics are frequently associated with platform technologies, distributed innovation, and combinatorial innovation, which affect the patterns of technology development (Nambisan et al. Citation2017; Yoo et al. Citation2012). In particular, the system integration/generativity perspective (c.f. Thomas and Tee Citation2022) explains how firms use boundary mechanisms to shape the nature and extent of digital technologies by recombining them across multiple technology domains (Nambisan, Wright, and Feldman Citation2019; Brynjolfsson and McAfee Citation2014). For example, Lee and Berente (Citation2012) explain that due to the increasing complexity of products and the need to integrate and coordinate work on digital technologies between original equipment manufacturers and suppliers, mature ‘digital control system’ technologies with generative capabilities have been developed to enable the transformation of multiple industries, including the manufacturing, automotive, and telecommunications industries, which can gain multiple benefits from a single digital technology.

Also, some studies highlight that digital technologies, such as 3D printers, sensors, and cameras, could have a broad range of applications in areas like robotics and computer vision (Yoo et al. Citation2012). In the automotive industry, Svahn et al. (Citation2017) use the example of Volvo Cars’ Connected Car initiative (i.e. in-car entertainment system) to explain four competitive concerns – innovation capability, innovation focus, innovation collaboration, and innovation governance – that enhance the generative capabilities of digital technologies. This study of Volvo presents both tensions during the development of digital technologies and mechanisms to cope with potential digitalisation pitfalls. Despite the relevance of this particular digital technology, its potential for multi-sector application remains limited to the vehicle industry, and other potential digital technologies are not considered (e.g. speed control systems, cameras, sensors). Combining the technological novelty and breadth dimensions, the following section proposes a framework to understand different technology categories

2.3. An integrating framework for industry digitalization

The integrated study of technological breadth and novelty provides four ideal-typical categories of digital technologies’ impact on industry digitalisation: (1) augmenting, (2) spanning, (3) transforming and (4) disrupting (see ). The purpose of the proposed typology and the respective categories is to disentangle the main characteristics to which those digital technologies contribute. By highlighting the main objectives and characteristics of the respective categories, we arrive at more nuanced insights into their impact on transformation than the well-established one-dimensional distinctions between incremental vs. radical and one- sector vs. multi-sector. In turn, these insights help advance knowledge about digital technologies as drivers of innovation in different applications.

Figure 1. Digital Technology paths.

Figure 1. Digital Technology paths.

First, incremental change and upgrading by implementing new improved technologies within one sector characterise one of the most commonly recognised technology development categories – referred to as the augmenting category in this framework. This category represents what could be described as the digitalisation of previously non-digital technologies (i.e. physical products or analyses) (Raisch and Krakowski Citation2021; Lebovitz, Lifshitz-Assaf, and Levina Citation2022). Historically notable examples include the introduction of digital technologies in applications in the film, camera, and broadcasting industries. At first, this change could be described as the automatisation or augmentation of processes with the aim to improve quality and efficiency. However, gradually, this upgrading leads the focal industry to a tipping point at which those actors who have not embraced the new technology cannot survive in the new competitive landscape arising from the digital path becoming entirely dominant (Benner and Tripsas Citation2012). While we describe the technology breadth of this augmenting path as involving one sector, it should be clarified that the same or similar technologies – here referred to as platform technologies (Gawer Citation2021) – may develop several one-sector applications (e.g. platform applications).

Second, incremental change and upgrading by implementing improved technologies within several sectors characterise the category referred to as the spanning category in our framework. This ideal-typical category may be hard to distinguish from the augmenting category in empirical analyses in the sense that it does not represent a radical leap in technological novelty but rather gradual improvements in what already exists through the infusion of new digital technologies (Lopez-Vega, Tell, and Vanhaverbeke Citation2016). What mainly distinguishes this category from the augmenting category is that the new digital technologies are initially developed and implemented in other sectors and then spill over to new sectors, which, by definition, implies a wider scope of applicability than the technologies characterising the augmenting category (Thomas and Tee Citation2022). A wide range of information technology-based innovations fall into this ideal-typical category (e.g. movement sensors, vision technologies). As opposed to the augmenting category, which mainly entails gradual improvements in already existing applications in well-defined sectors, until the tipping point referred to above arises, the spanning category leads to the creation of new sectors with new markets and new types of factors, such as (de)centralised platforms and ecosystems that did not previously have a role in the respective system (Gawer Citation2021).

Third, the transforming category refers to the development and application of radically new technologies in single sectors or within established technology paradigms, thereby transforming those sectors in a more fundamental manner than the augmenting category. Historically notable examples include the development and application of recombinant DNA, which radically changed parts of the pharmaceutical industry by opening a new path for implementing biotechnology-based drugs. While not necessarily revolutionary beyond the actual application, the development of this technology made it possible to address health challenges previously not within reach. Other examples are the development of biobased energy and speciality chemicals, which made those respective sectors partly abandon the temperature and pressure logic in favour of biological processes while maintaining largely the same range of products and applications and opening up fossil-free raw materials and cleaner production. In contrast to the augmenting category, in which such a shift happens through a gradual process leading up to a tipping point, the shift in this category appears at the very outset. This means that two distinct development logics exists in parallel without one necessarily becoming dominant over time.

Finally, radically new technologies with multi-sector application seem to be very rare and sometimes breakthroughs. In this ideal-typical framework, we refer to this type as the disrupting category. It resembles characteristics of the transforming category with regards to the degree of novelty and thereby impact in terms of change potential. However, the multi-sector scope of these radical technological advancements also implies significant differences from the transforming category. The disrupting category also resembles characteristics of the spanning category, particularly in regards to the simultaneous applicability in a variety of sectors. However, the combination of radical novelty and multi-sector scope makes this category unique. Historically notable examples of advancements that fall into this category include steam and combustion engines, semiconductors, and platform technologies based on bio- and nano- science. Notably, most other technologies sometimes referred to as disruptive due to their revolutionary impact fall into the transforming category according to our framework. It is the disruptive technologies’ ability to dramatically change the directionality of a multitude of sectors that makes them revolutionary, not their degree of novelty.

In the following, we present an analysis of change trajectories in the automotive industry in Sweden. The purpose of the analysis is to apply the framework presented above in an empirical field with the aim to further decompose the respective digital technology categories, thereby advancing our understanding of the contribution of individual firms and their technological advancements to industry digitalisation.

3. Research design and method

3.1. Sample selection and research setting

Our research process started with identifying digital innovations across multiple industries in Sweden. First, we accessed the SWINNO databaseFootnote1, which provides an overview of firms’ innovations in Sweden from 1971 to 2017 (Sjöö et al. Citation2014). Although this dataset allows researchers to identify the market sector, collaborating actors, technological complexity, and novelty of the technology for the firm and market for multiple companies, we were mostly interested in the name and description of digital innovations. After reading the description of the innovations from the SWINNO database which is written in Swedish and collected from national magazines (i.e. Ny Teknik, Verkstäderna, Elektroniktidningen), we categorised all 4,507 innovations and identified 1,205 digital innovations for the 1971 to 2017 period. From the 1,205 digital innovations, we inductively distinguished six areas: (1) robotics, (2) cloud- based and connectivity, (3) Internet-of-things, (4) computers, (5) electrification and (6) autonomous vehicles.

We further categorised these innovations across the following industries: automotive, manufacturing, telecommunications, transportation and logistics, agriculture, forest, steel and metal, textile and confection, minerals, chemicals, plastics, food industry, and pharmaceuticals. These actions allowed us to identify numerous digital innovations from the automotive industry (e.g. vehicle connectivity, vehicle-to-vehicle infrastructure) as well as from other industries connected to the automotive industry (e.g. computer vision, machine learning, robotics). We also observed digital innovations in the telecommunications and manufacturing industries that have the potential to be used into the automotive industry, and vice-versa. This analysis provided an initial set of keywords to understand the major changes in the automotive industry in Sweden such as vehicle connectivity, driverless vehicles, and computer vehicles. See for more details.

Table 1. Summary of Digital Technologies Reviewed from the SWINNO Database.

Next, we reviewed publicly available consultancy reports and industry magazines to obtain a thorough understanding of the development and complexity of digital technologies and digital transformation in the automotive industry globally. We identified four industry trends: (1) autonomous vehicles, (2) electrification, (3) car sharing and connected vehicles, and (4) power plans. Finally, we concluded that autonomous vehicle technologies (see ) represent the technologies with the largest number of inventions, disruption potential, and dynamism in Sweden, which is why we selected this area to focus on.

Table 2. Main Trends in Autonomous Vehicle Technologies.

The findings from our desktop research and the SWINNO database resulted in a set of keywords that would enable us to further search and identify digital technologies at the European Patent Office. In total, we identified 1,452 autonomous vehicle patents in Sweden. After reading the abstracts of all the patents, we eliminated patents that did not relate to autonomous vehicle technologies, duplicates (patents that are registered in several countries with different codes), and patents missing a detailed invention description (i.e. only the abstract was available). To study a homogeneous group of digital technologies (i.e. same institutional setting, requirements), we only included patents with at least one inventor working in Sweden for a Swedish or foreign company. This resulted in a total set of 455 patents for further analysis. Patent data were used to identify technology domains, company names, inventor names, (co-) citations, and – most importantly – the patent description.

3.2. Analytical approach

The data analysis for this research comprised using a computational research method – topic modelling (Hannigan et al. Citation2019) – and a survey to determine the technological novelty and breadth of the identified digital technologies.

3.2.1. Data analysis and content coding with topic modelling

Recently, the social sciences have benefited from advancements in data science methods (Salganik Citation2018; Grimmer, Roberts, and Stewart Citation2022), which, through the use of a large data scope and data granularity, have offered management scholars new ways to find answers to ‘new questions’ (George et al. Citation2016; Antons et al. Citation2020; Coccia Citation2020). Specifically, machine learning is employed for supervised learning and unsupervised learning. Unsupervised learning is used for inductive analysis when data categories are unknown. In this study, we used unsupervised machine learning to analyse the text of the 455 autonomous vehicle patents. To analyse the data collected, we opted for a topic-modelling procedure that has captured the interest of management scholars because it allows text to be analysed as data (Antons et al. Citation2020; George et al. Citation2016). Compared to other established qualitative methods, this method allows researchers to use machine learning to identify themes that are latent across a large set of documents and to identify the composition themes in each document (Hannigan et al. Citation2019).

Topic modelling was carried out in two steps: first, we created the text corpus based on the text files of the selected patents, and second, we generated the topic model and derived analytical results from it. These two main steps have several sub-steps that had to be performed to complete the topic modelling, resulting in meaningful topics and thus a good basis for subsequent analysis (see Appendix 1 for the topic model phases). All steps for creating the corpus and building the topic model, including some parts of the analysis, were performed using Python 3.7 and pre-developed public libraries, such as nltk and genism, which are commonly used to perform text-analysis tasks and are described in detail in the respective steps. For a detailed explanation about the dataframe, data corpus, and topic modelling, read Appendix 2.

3.2.2. Survey analysis: topic novelty and breadth

We conducted a survey with the inventors of the patents to understand their perceptions of the technological novelty and breadth of the 26 identified topics. First, we used the library WordCloudFootnote2 to create a wordcloud for each topic, which is a cloud of words wherein the size of each word corresponds to the probability of the word occurring in the topic (See Appendix 3). We generated the wordclouds for the 26 topics to facilitate the inventors’ validation of the topics. Second, we designed a survey asking the inventors to assess the technological novelty and breadth of two wordclouds and respond to the following: (1) ‘Does the wordcloud capture (a) an incremental technology or (b) a radical technology?’ and (2) ‘Does the wordcloud capture a technology applicable (a) to a single technology sector or (b) to two or more technology sectors?’ For the topic assessments, each inventor received a unique link to two wordclouds in which her or his patents had the highest probability of appearing. No indication of the content of the topics or suggested names were provided to the inventors. This method allowed us to analyse experts’ assessments of the novelty and breadth of each topic. In total, we received 192 responses for the 26 topics.

4. Findings

We used the library pyLDAvisFootnote3 to visualise an inter-topic distance map and an interactive and multidimensional scale representation of all topics. This web-based interactive visualisation tool also allowed us to capture the most relevant terms associated with each individual topic as well as understand the topic-term relationships in a fitted LDA model, including their proximity to each other as well as how they differ from each other. provides a global view of the topic model. plots the topics as circles in the two-dimensional plane whose centres are determined by computing the distance between topics. In , each topic’s overall prevalence (i.e. marginal topic distribution) is calculated using the areas of the circles (i.e. the size of the bubble) which is presented in decreasing order of prevalence from 10% for topics 1 and 2 to 1% for topic 26. The pyLDAvis library allowed us to make each topic human interpretable as they enabled us to see which technology characteristics made up a certain topic (see Appendix 3) as well as their relation, proximity to other topics and clusters of topics.

Figure 2. Clusters of Autonomous Vehicle Technologies.

Figure 2. Clusters of Autonomous Vehicle Technologies.

We analysed the 26 most significant topics covering all patents as well as calculated the development and relevance of the topics over time to understand their evolution (see Appendix 4). We noticed that 26 topics captures the highest distribution of the patents (read Appendix 2 for more details). Next, we asked three independent practitioners with experience in the automotive industry to help name the 26 topics (see and ). Thereafter, the practitioners also named the recognisable digital technology clusters – driving assistance systems; routing, positioning, and environmental monitoring; and data, connectivity, and in- car technology (see ).

Figure 3. Topics of the autonomous vehicle technologies.

Figure 3. Topics of the autonomous vehicle technologies.

Table 3. Novelty and Breadth of Autonomous Vehicle Technologies.

As computer-based research methods require careful interpretation and validation of the results (McKenny et al. Citation2018), our survey results helped to further validate and categorise the independent practitioners’ analysis. The survey results confirm that most digital technology topics are incremental and equally applicable to one sector (i.e. the automotive) or multiple sectors. Our respondents categorised only Topic 5 (new collision prevention technology for autonomous driving), which is part of the driving assistance systems cluster, as radical and applicable to multiple sectors. Topics 18 and 24 (device-connecting function for communication and device to recognise light sources) were equally radical and incremental and applicable to multiple sectors. Finally, surprisingly, our survey results indicate that none of the topics were radical and applicable only to the automotive industry. See for the typology with the results.

Figure 4. Digital Technology Categories for Autonomous Vehicle Technologies.

Figure 4. Digital Technology Categories for Autonomous Vehicle Technologies.

To further understand the technology category of each topic in relation to the potential breadth and novelty of application, we integrated the patent-level data with the 26 topics. As part of our analysis, in Appendix 5., we calculated the average of the total number of Cooperative Patent Classification (CPC) codes from all the patents for each topic, the average of the total unique CPC codes (maximum of seven CPC categories and minimum of one CPC category), the average for each CPC category, and the average number of backwards citations for each topic. CPC Code B is most related to the automotive industry. Appendix 6 shows the percentages for all the unique CPC codes.

For incremental topics, our results highlight a convincing difference between topics with one- sector application and multi-sector application. In the augmenting category (incremental and one-sector application), the topics have a low average number of total unique CPC codes, which confirms that these topics are concentrated among a few distinct technology sectors (i.e. the transportation sector and one other sector), as well as a low average number of backwards citations, indicating that the respective patent might not be based on knowledge from multiple domain areas. For example, Topics 1, 11, 21, and 26 have a low average number of unique CPC codes, which hints at the narrow range of application for these topics compared to other topics. Moreover, for the operations and transportation technology domain (i.e. CPC Code B), the aforementioned topics have large technology-domain depth in the transportation sector compared to topics in other paths. These topics correspond to different topic clusters – driving assistance systems and routing, positioning, and environmental monitoring – which confirms the validity of our results.

The topics in the spanning category (incremental and multi-sector application) have a higher average number of total unique CPC codes overall, which confirms that they cover multiple technology domains. The technology domains correspond to all the technology classifications (i.e. seven CPC codes but a lesser degree to CPC Code A, human necessities, and CPC Code E, fixed construction) and do not exclusively emerge from the transportation sector (i.e. CPC Code B) as the main technology domain. Our results also highlight that in most cases, the CPC code for transportation has a lower value than those of the other technology codes (e.g. electricity, physics, mechanical engineering). These incremental technology domains appear in all the topic clusters.

The disruptive category (radical and multi-sector application) shows the highest average number of unique CPC codes (2.09), indicating at least two technology classifications are relevant for patents in this path. We calculated the average number of the emerging technology classification for all the topics which is a novelty category assigned to patents that are expected to have a significant economic social and impact. For this paper, Topic 5 (collision prevention assistance) had the highest number (0.27), which indicates that this topic contains patents with the highest potential for disruption. Finally, Topics 6, 10, 18, and 24 are distributed between two digital technology categories, providing more nuance about the characteristics of different technology categories. These topics have the characteristic of being primarily distributed between two distinct technology classifications and a low presence across different technology sectors (lower average number of unique CPC codes).

5. Discussion and conclusions

The present study aimed to respond to the need for conceptual developments in relation to the novelty and breadth dimensions of digital technologies and how they influence industry digitalisation (Dąbrowska et al. Citation2022; Lanzolla et al. Citation2020; Lebovitz, Lifshitz-Assaf, and Levina Citation2022). Our findings broaden the way we think about single emerging technologies and digital technologies in particular (Röder, Both, and Hinneburg Citation2015; Kapoor and Klueter Citation2021; Bailey et al. Citation2022; Cirillo et al. Citation2022) because it demonstrates that industry digitalisation is the outcome of introducing new creative processes to embed digital capabilities into physical objects as well as building new digital capabilities outside a firm’s main industry. This paper offers two distinct contributions: namely, we propose an integrating framework to clarify the novelty and breadth of digital technologies and their impact on industry digitalisation of the automotive industry and provide a depiction of digital technologies at the meso-level.

The first contribution of this study shows that digital technologies (referred to as topics and categories of clustered topics in this paper) are primarily incremental and emerge from technologies beyond the automotive industry, including telecommunications, consumer products, defence, and machine manufacturing. While this finding is not surprising in itself, it is an important insight that calls for further and more comprehensive analyses to fully capture the interindustry dynamics of digital transformation even in cases where the main focus is on particular applications or industries (e.g. Brynjolfsson and McAfee Citation2014; Iansiti and Lakhani Citation2020). The proposed integrating framework of digital technology categories helps explain the impact of such technologies on industry digitalisation based on a combined analysis of novelty and breadth. The analysis brings attention to the importance of incrementally improving applications with digital technologies, referred to as the augmenting category in the framework, and introduces two additional functions of digital technologies, referred to as the spanning and disrupting categories. Understanding these categories more fully, especially their combinations, fulfils the need for such cross-sector analyses and enriches scholarship on technological change in general (Dosi Citation1984) and digital transformation in particular (Bodrožić and Adler Citation2022; Coccia and Watts Citation2020; Dąbrowska et al. Citation2022; Lanzolla et al. Citation2020).

Our second contribution depicts the characteristics of four digital technology categories across the novelty and breadth dimensions. The augmenting category involves a range of digital technologies primarily concerned with incremental changes for the automotive industry, which may be improved through automation or augmentation strategies (Lebovitz, Lifshitz-Assaf, and Levina Citation2022; Raisch and Krakowski Citation2021). Indeed, this study highlights several digital technologies that have renewed the established technological path for the automotive industry. Incumbents like Scania and Volvo have become integrators of new digital technologies (i.e. vehicle handling, perception sensors) initiated by suppliers from the same sector (i.e. Autoliv, Veoneer) and new industry entrants (e.g. Ericsson, Husqvarna). Thus, this study also reveals that digital transformation within the automotive industry mainly arises via an incremental creative accumulation process (Bergek et al. Citation2013) enabled by the development of new digital capabilities and the extension of component-related knowledge.

Moreover, our analysis of the spanning category reveals that digital technology topics with incremental characteristics emerge from multiple sectors, such as the electricity, construction, and mechanical engineering sectors, confirming that most digital technology topics are based on combinations of inventions from different sectors (Brynjolfsson and McAfee Citation2014; Teece Citation2018). This observation adds to the literature on technology recombination, which suggests that digital technologies have the potential to renew multiple industries because of their generative characteristics (Nambisan, Wright, and Feldman Citation2019; Kallinikos, Aaltonen, and Marton Citation2013; Thomas and Tee Citation2022). While our findings confirm previous research suggesting that digital technologies have integrative capabilities that enable the transformation of multiple industries (Lee and Berente Citation2012), it diverges from previous research investigating digital technologies as purely complementary to products or services in other industries, such as drug discovery (Dougherty and Danielle Citation2012) and navigation systems (Svahn, Mathiassen, and Lindgren Citation2017). Our findings highlight that digital technologies in the spanning category lead to the emergence of new actors and industry segments and potentially have a significant economic impact on industry digitalisation.

While the above categories for industry digitalisation appeared the most in the data upon which this study draws, unexpectedly, only one digital technology topic was identified as radical and implementable across multiple sectors – namely, the disrupting category. This category has the potential to transform both the automotive industry and several other industries, such as the telecommunications and manufacturing industries as well as have impact on city planning, transport regulation and policy and user acceptance. Until now, the established view from earlier studies shows that technological change is rare and emerges from radical innovations, such as digital imaging (Benner and Tripsas Citation2012) or semiconductors (Dosi Citation1984). Our finding extends the innovation literature and the assumption that digital transformation involves industrial disruption by definition (Kwon et al. Citation2019; Bodrožić and Adler Citation2022). We show that these kinds of digital technologies have the potential to generate a significant social and economic impact without disrupting an entire industry but instead transforming it. Moreover, we suggest that digital transformation may emerge from technologies spanning several sectors, enabling industry digitalisation across multiple technology domains.

The most notable conclusion from this study calls for slowing down our expectations on disruption emerging from radical digital technologies unless these are combined with corresponding but less radical technologies that enable full implementation across a range of applications. A concrete illustration of such a combinatorial application in the automotive industry is collision prevention assistance technology, which has the potential to change the entire automotive industry and also generate disruption in several related industries – basically everything related to mobility transformation. Realising such potential, however, requires a wide range of incremental improvements in related technologies, such as navigation, speed control, and road friction analysis. This insight illustrates the value of moving beyond studying digital transformation as the result of the successful implementation of revolutionary technologies to instead studying it as the result of interdependent clusters of both radical and incremental technological advancements (Bailey et al. Citation2022).

In addition to calibrating our expectations on radical innovations emerging from the infusion of digital technologies in mature industries, such as the automotive industry, this study also provides insights applicable to innovation policy. Increased awareness of the required combinations of technologies of varying novelty and breadth that feed into related and interdependent applications and sectors calls for broader innovation policy. Instead of focusing resources on directly promoting specific technologies or industries (Grillitsch et al. Citation2019), which has been a trend in innovation policy for several decades, it is worth considering more general policy measures, not directly targeting predefined sectors. The rationale for such a call is the unpredictable nature of digital innovation and link between technology progress and industry application, and thereby the highly limited ability for policy to predict what application will eventually generate innovative outcome. Such a more general approach would thus enable both identified technologies and applications and not yet identified technologies and applications to take advantage, thereby avoiding the risk of dominant regimes or actors capturing policy measures which would generate better value when targeting alternative actors.

Finally, we also want to emphasise the ample opportunities provided by digital social research methods for management scholars (George et al. Citation2016; Hannigan et al. Citation2019). For example, to further develop and validate our integrating digital transformation framework, scholars could extend the findings presented in this paper to perform multi-industry analyses that explain the types of knowledge combinations that lead to radical digital technologies (disruptive category) or the digital technologies that have a wide scope of application (spanning category). Furthermore, intuitively, a multi-industry analysis that includes other sources of information, such as industry reports and other forms of text (Grimmer, Roberts, and Stewart Citation2022), might help provide more granularity and variety across digital technology topics. This research design strategy is particularly important for understanding the characteristics of the transformative and disruptive categories. As suggested by Salganik (Citation2018), only further application of computational research methods and information can help improve understanding of phenomena (i.e. digital transformation and industry digitalisation in our case).

Further research using patent data could also enable researchers to study what personal characteristics of inventors lead to digital technologies of different novelty and breadth. Such a study could untangle the breadth and depth of knowledge expertise needed to create, for example, multi-industry digital technologies. This type of analysis could help respond to questions about industry path dependency (Benner and Tripsas Citation2012) and knowledge search heuristics (Lopez-Vega, Tell, and Vanhaverbeke Citation2016). Another promising research area is studying how digital technologies and what innovation strategies trigger firms, universities, government institutions, and society at large to work towards tackling societal development goals and grand challenges and ultimately shape industry transformation in developed and emerging markets (Lopez-Vega and Lakemond Citation2022). Only by doing so will we be able to explain the benefits of emerging digital technologies and the opportunities provided by the digital age.

Acknowledgement

The authors would like to thank the special issue co-editor Prof. Riccardo Leoncini and the anonymous reviewers for their guidance and insightful comments. Also, the authors would like to express their gratitude to the survey respondents and research assistants of the www.organizationalrenewal.com project. Responsibility for the information and views expressed in the article lies entirely with the authors.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This study was supported by Marianne and Marcus Wallenberg Foundation. MMW 2016.0014

Notes

References

  • Anand, B. 2016. The Content Trap: A Strategist’s Guide to Digital Change. New York: United States Random House Publishers.
  • Antons, D., E. Grünwald, P. Cichy, and T. Oliver Salge. 2020. “The Application of Text Mining Methods in Innovation Research: Current State, Evolution Patterns, and Development Priorities.” R&D Management 50 (3): 329–351. doi:10.1111/radm.12408.
  • Arthur, W. B. 2009. The Nature of Technology: What It is and How It Evolves. London, England: Penguin Books Limited.
  • Bailey, D. E., S. Faraj, P. J. Hinds, P. M. Leonardi, and G. von Krogh. 2022. “We are All Theorists of Technology Now: A Relational Perspective on Emerging Technology and Organizing.” Organization Science 33 (1): 1–18. doi:10.1287/orsc.2021.1562.
  • Benner, M. J., and M. Tripsas. 2012. “The Influence of Prior Industry Affiliation on Framing in Nascent Industries: The Evolution of Digital Cameras.” Strategic Management Journal 33 (3): 277–302. doi:10.1002/smj.950.
  • Bergek, A., C. Berggren, T. Magnusson, and M. Hobday. 2013. “Technological Discontinuities and the Challenge for Incumbent Firms: Destruction, Disruption or Creative Accumulation?” Research Policy 42 (6): 1210–1224.
  • Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. “Latent Dirichlet Allocation.” Journal of Machine Learning Research : JMLR 3: 993–1022.
  • Bodrožić, Z., and P. S. Adler. 2022. “Alternative Futures for the Digital Transformation: A Macro-Level Schumpeterian Perspective.” Organization Science 33 (1): 105–125. doi:10.1287/orsc.2021.1558.
  • Brynjolfsson, E., and A. McAfee. 2014. The Second Machine Age: Work, Progress, and Prosperity in a Time of Brilliant Technologies. New York: W. W. Norton.
  • Carlsson, B., and R. Stankiewics. 1991. “On the Nature, Function and Composition of Technological Systems.” Journal of Evolutionary Economics 1 (2): 93–118. doi:10.1007/BF01224915.
  • Christensen, C. M., R. McDonald, E. J. Altman, and J. E. Palmer. 2018. “Disruptive Innovation: An Intellectual History and Directions for Future Research.” Journal of Management Studies 55 (7): 1043–1078. doi:10.1111/joms.12349.
  • Cirillo, V., L. Fanti, A. Mina, and A. Ricci. 2022. “New Digital Technologies and Firm Performance in the Italian Economy.” Industry and Innovation 1–30. doi:10.1080/13662716.2022.2055999.
  • Coccia, M. 2015. “General Sources of General Purpose Technologies in Complex Societies: Theory of Global Leadership-Driven Innovation, Warfare and Human Development.” Technology in Society 42: 199–226. doi:10.1016/j.techsoc.2015.05.008.
  • Coccia, M. 2017. “Sources of Disruptive Technologies for Industrial Change.” L’industria 38 (1): 97–120.
  • Coccia, M. 2020. “Deep Learning Technology for Improving Cancer Care in Society: New Directions in Cancer Imaging Driven by Artificial Intelligence.” Technology in Society 60: 101198. doi:10.1016/j.techsoc.2019.101198.
  • Coccia, M., and J. Watts. 2020. “A Theory of the Evolution of Technology: Technological Parasitism and the Implications for Innovation Magement.” Journal of Engineering and Technology Management 55: 101552. doi:10.1016/j.jengtecman.2019.11.003.
  • Dąbrowska, J., A. Almpanopoulou, A. Brem, H. Chesbrough, V. Cucino, A. Di Minin, F. Giones, et al. 2022. ”Digital Transformation, for Better or Worse: A Critical Multi-Level Research Agenda.” R&D Management 52 (5): 930–954. doi:10.1111/radm.12531.
  • Daniele, R., D. Hicks, and B. R. Martin. 2015. “What is an Emerging Technology?” Research Policy 44 (10): 1827–1843.
  • Dosi, G. 1984. Technical Change and Industrial Transformation: The Theory and an Application to the Semiconductor Industry. London: Palgrave Macmillan UK.
  • Dougherty, D., and D. D. Danielle. 2012. “Digital Science and Knowledge Boundaries in Complex Innovation.” Organization Science 23 (5): 1467–1484. doi:10.1287/orsc.1110.0700.
  • Eggers, J. P., and K. Francis Park. 2018. “Incumbent Adaptation to Technological Change: The Past, Present, and Future of Research on Heterogeneous Incumbent Response.” The Academy of Management Annals 12 (1): 357–389. doi:10.5465/annals.2016.0051.
  • Eklund, J., and R. Kapoor. 2019. “Pursuing the New While Sustaining the Current: Incumbent Strategies and Firm Value During the Nascent Period of Industry Change.” Organization Science 30 (2): 383–404. doi:10.1287/orsc.2018.1229.
  • Gawer, A. 2021. “Digital Platforms’ Boundaries: The Interplay of Firm Scope, Platform Sides, and Digital Interfaces.” Long range planning 54 (5): 102045. doi:10.1016/j.lrp.2020.102045.
  • George, G., E. C. Osinga, D. Lavie, and B. A. Scott. 2016. “Big Data and Data Science Methods for Management Research.” Academy of Management Journal 59 (5): 1493–1507. doi:10.5465/amj.2016.4005.
  • Grillitsch, M., T. Hansen, L. Coenen, J. Miörner, and J. Moodysson. 2019. “Innovation Policy for System-Wide Transformation: The Case of Strategic Innovation Programmes (SIPs) in Sweden.” Research Policy 48 (4): 1048–1061. doi:10.1016/j.respol.2018.10.004.
  • Grimmer, J., M. E. Roberts, and B. M. Stewart. 2022. Text as Data: A New Framework for Machine Learning and the Social Sciences. New Jersey: Princeton University Press.
  • Hanelt, A., R. Bohnsack, D. Marz, and C. Antunes Marante. 2021. “A Systematic Review of the Literature on Digital Transformation: Insights and Implications for Strategy and Organizational Change.” Journal of Management Studies 58 (5): 1159–1197. doi:10.1111/joms.12639.
  • Hannigan, T. R., R. F. J. Haans, K. Vakili, H. Tchalian, V. L. Glaser, M. Shaoqing Wang, S. Kaplan, and P. Devereaux Jennings. 2019. “Topic Modeling in Management Research: Rendering New Theory from Textual Data.” The Academy of Management Annals 13 (2): 586–632. doi:10.5465/annals.2017.0099.
  • Iansiti, M., and K. R. Lakhani. 2020. Competing in the Age of AI: Strategy and Leadership When Algorithms and Networks Run the World. Boston, MA: Harvard Business Review Press.
  • Kallinikos, J., A. Aaltonen, and A. Marton. 2013. “The Ambivalent Ontology of Digital Artifacts.” MIS Quarterly 37 (2): 357–370. doi:10.25300/MISQ/2013/37.2.02.
  • Kapoor, R., and T. Klueter. 2021. “Unbundling and Managing Uncertainty Surrounding Emerging Technologies.” Strategy Science 6 (1): 62–74. doi:10.1287/stsc.2020.0118.
  • Kneeland, M., M. Schilling, and S. A. Barak. 2020. “Exploring Uncharted Territory: Knowledge Search Processes in the Origination of Outlier Innovation.” Organization Science 31 (3): 535–557. doi:10.1287/orsc.2019.1328.
  • Kwon, Seokbeom, X. Liu, A. L. Porter, and J. Youtie. 2019. “Research Addressing Emerging Technological Ideas Has Greater Scientific Impact.” Research Policy 48 (9): 103834. doi:10.1016/j.respol.2019.103834.
  • Lanzolla, Gianvito, A. Lorenz, E. Miron-Spektor, M. Schilling, G. Solinas, Tucci, C. L. 2020. “Digital Transformation: What is New if Anything? Emerging Patterns and Management Research.” Academy of Management Discoveries 6 (3): 341–350.
  • Lebovitz, S., H. Lifshitz-Assaf, and N. Levina. 2022. “To Engage or Not to Engage with AI for Critical Judgments: How Professionals Deal with Opacity When Using AI for Medical Diagnosis.” Organization Science 33 (1): 126–148. doi:10.1287/orsc.2021.1549.
  • Lee, J., and N. Berente. 2012. “Digital Innovation and the Division of Innovative Labor: Digital Controls in the Automotive Industry.” Organization Science 23 (5): 1428–1447. doi:10.1287/orsc.1110.0707.
  • Lopez-Vega, H., and N. Lakemond. 2022. “Tapping into Emerging Markets: EMNEs’ Strategies for Innovation Capability Building.” Global Strategy Journal 12 (2): 394–417. doi:10.1002/gsj.1429.
  • Lopez-Vega, H., F. Tell, and W. Vanhaverbeke. 2016. “Where and How to Search? Search Paths in Open Innovation.” Research Policy 45 (1): 125–136.
  • Malerba, F. 1992. “Learning by Firms and Incremental Technical Change.” The Economic Journal 102 (413): 845–859. doi:10.2307/2234581.
  • Martynov, A. 2020. “Technological Footprint Similarity and Firm Performance.” Strategic Organization:1476127019898191.
  • McAfee, A., and E. Brynjolfsson. 2017. Machine, Platform, Crowd: Harnessing Our Digital Future. W. W. Norton.
  • McKenny, A. F., H. Aguinis, J. C. Short, and A. H. Anglin. 2018. “What Doesn’t Get Measured Does Exist: Improving the Accuracy of Computer-Aided Text Analysis.” Journal of Management 44 (7): 2909–2933. doi:10.1177/0149206316657594.
  • Menz, M., S. Kunisch, J. Birkinshaw, D. J. Collis, N. J. Foss, R. E. Hoskisson, and J. E. Prescott. 2021. “Corporate Strategy and the Theory of the Firm in the Digital Age.” Journal of Management Studies 58 (7): 1695–1720. doi:10.1111/joms.12760.
  • Nambisan, S., K. Lyytinen, A. Majchrzak, and M. Song. 2017. “Digital Innovation Management: Reinventing Innovation Management Research in a Digital World.” MIS Quarterly 41 (1): 223–238. doi:10.25300/MISQ/2017/41:1.03.
  • Nambisan, S., M. Wright, and M. Feldman. 2019. “The Digital Transformation of Innovation and Entrepreneurship: Progress, Challenges and Key Themes.” Research Policy 48 (8): 103773. doi:10.1016/j.respol.2019.03.018.
  • Raisch, S., and S. Krakowski. 2021. “Artificial Intelligence and Management: The Automation–Augmentation Paradox.” Academy of Management Review 46 (1): 192–210. doi:10.5465/amr.2018.0072.
  • Röder, M., A. Both, and A. Hinneburg. 2015. “Exploring the Space of Topic Coherence Measures.” In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 399–408. Shanghai, China: Association for Computing Machinery.
  • Salganik, M. J. 2018. Bit by Bit: Social Research in the Digital Age. New Jersey: Princeton University Press.
  • Sjöö, K., J. Taalbi, A. Kander, and J. Ljungberg. 2014. “SWINNO: A Database of Swedish Innovations, 1970-2007.” In edited by no. 133 Lund Papers in Economic History. General Issues. Department of Economic History, Lund University.
  • Stevens, K., P. Kegelmeyer, D. Andrzejewski, and D. Buttler. 2012. Exploring Topic Coherence Over Many Models and Many Topics. Paper presented at the Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning. Jeju Island, Korea. Association for Computational Linguistics.
  • Svahn, F., L. Mathiassen, and R. Lindgren. 2017. “Embracing Digital Innovation in Incumbent Firms: How Volvo Cars Managed Competing Concerns.” MIS Quarterly 41 (1): 239–254. doi:10.25300/MISQ/2017/41.1.12.
  • Teece, D. J. 2018. “Profiting from Innovation in the Digital Economy: Enabling Technologies, Standards, and Licensing Models in the Wireless World.” Research Policy 47 (8): 1367–1387. doi:10.1016/j.respol.2017.01.015.
  • Thomas, L. D. W., and R. Tee. 2022. “Generativity: A Systematic Review and Conceptual Framework.” International Journal of Management Reviews 24 (2): 255–278. doi:10.1111/ijmr.12277.
  • Utterback, J. M., C. Pistorius, and E. Yilmaz. 2018. “The Dynamics of Competition and of the Diffusion of Innovations.“ MIT Sloan School Working Paper 5519: 18. http://hdl.handle.net/1721.1/117544
  • Vial, G. 2019. “Understanding Digital Transformation: A Review and a Research Agenda.” The Journal of Strategic Information Systems 28 (2): 118–144. doi:10.1016/j.jsis.2019.01.003.
  • von Krogh, G. 2018. “Artificial Intelligence in Organizations: New Opportunities for Phenomenon-Based Theorizing.” Academy of Management Discoveries 4 (4): 404–409. doi:10.5465/amd.2018.0084.
  • Warner, K. S. R., and M. Wäger. 2019. “Building Dynamic Capabilities for Digital Transformation: An Ongoing Process of Strategic Renewal.” Long range planning 52 (3): 326–349. doi:10.1016/j.lrp.2018.12.001.
  • Yoo, Y. J., J. B. Richard, L. Kalle, and M. Ann. 2012. “Organizing for Innovation in the Digitized World.” Organization Science 23 (5): 1398–1408. doi:10.1287/orsc.1120.0771.

Appendices Appendix 1.

Topic Modelling Procedure

Appendix 2.

Dataframe, Corpus, and Topic Modelling

The first part of the data analysis required the patents to be in text form (i.e. to be in English and transferred from a PDF to a text file [.txt]). That means that the starting point or prerequisite for creating our corpus of all 455 relevant patents was that they had to be in a.txt file format, which is computer readable and thus easy to import into Python. We created these.txt files and included all information from the patents from the abstracts to the claims. To work with the patent data, we imported each text file into Python by transferring the data and storing it in a pandas dataframe. PandasFootnote4 is a common open-source Python library that offers easy-to-use high-performance data structures and data analysis tools for the Python programming language. Within the dataframe, which one can imagine as a matrix, we had 455 rows and two columns— one containing the patent IDs and one containing the respective text content of each patent. The text content was stored as a so-called string, which is the text representation within the Python environment. Once all of the patents were imported into Python and stored in a computer-readable format, pandas allowed us to easily work with the data.

To serve as a good input in the subsequent topic modelling and to enable the generation of expressive topics, the text data needed to be processed. This procedure started with converting the text of each patent into lowercase, ensuring that we treated all words the same and avoided subsequent specific factor errors (McKenny et al. Citation2018). Afterwards, we used regular expressionsFootnote5 (RE) to specify a set of strings that matched the REs, allowing us to coarsely delete strings without any meaning for our purpose. In our case, the specification was that the strings should only contain alphabetic letters from A to Z and be at least two characters long, which meant that all unnecessary numbering, brackets, and fill words, which are numerous in patent text, were deleted.

Having pre-cleaned the patent text, we proceeded by tokenising the strings, which, until that moment, represented one patent each. Tokenising means that we broke down large strings containing multiple sentences into individual words separated by commas, which we then stored individually as strings. This step represented a crucial prerequisite for the actual topic modelling described below and allowed us to further clean the text. The next step was to eliminate all words that had no inherent meaning, also called stopwords, which was done by setting up a stopword list and then deleting all words in this list from the patent text. To find relevant stopwords, we started by using the pre-defined stopword list provided by nltk to cover general stopwords like ‘the’, ‘and’, or ‘of’. We then manually extended this list with other stopwords that appeared to have no direct meaning or were too general like ‘figure’, ‘claim’, or ‘illustrate’ as almost all patents contained the terms ‘patent claim’ and ‘figure’, but these terms did not convey any meaning related to autonomous vehicle technologies.

The final step in creating the corpus was to lemmatise the words. This means that the words were transformed to their most basic form, which in turn reduced the number of words with the same meaning. In other words, we tried to reduce the number of distinct words with the same meaning – for example, plurals (reduce ‘vehicles’ to ‘vehicle’). We began this procedure by tagging every word in the corpus according to its position in the text, which was computed using the nltk.pos_tag() functionFootnote6 This function took in every word and added a part-of-speech tag (pos_tag) to it, which specified some property of the word or token in the text, such as noun, verb, adjective, etc. This extra information about the context of the words allowed for better lemmatisation, which we performed using another part of the nltk library, the WordNetLemmatizerFootnote7 This module finally executed the lemmatisation by taking into account the pos_tag of the words and thus performed the last step in creating the corpus. The resulting text data was still stored in the pandas dataframe, which was then saved on the computer’s hard drive. This step was done for two reasons: first, to optimise computing power and time by decoupling the time-intense process of generating the text corpus from the topic modelling and data analysis and, second, to have a version history of the dataframe, which in turn assisted in assessing what corpus was better suited for our purpose by allowing a comparison of two corpuses (one for training and one for validation).

The second part of the topic modelling involved generating the topics using gensimFootnote8, an open- source library that offers text-analysis tools. We employed LDA, which is the most commonly used topic modelling method and represents a generative probabilistic model for unsupervised text-corpus analysis (Blei, Ng, and Jordan Citation2003). This method has also been employed by different innovation management scholars (von Krogh Citation2018; Hannigan et al. Citation2019). First, we loaded the saved dataframe into Python and prepared it for the topic-modelling algorithm. This step was necessary as the algorithm had certain input parameters that needed to be matched exactly to be functionable. First, we created a dictionary and a bag of words (BoW) out of the loaded dataframe using gensim. The dictionary was created using the gensim.corpora.Dictionary object, which, in essence, provided a representation of the distinct words that made up the whole corpus and further contained the word frequencies, or the number of occurrences of a certain word within the corpus. The raw dictionary contained 16,741 distinct words, which were represented through a number (the token_id) and ultimately made up the corpus of 1,579,447 words. These distinct words comprised words that occurred in only one document, such as ‘nutrient’, as well as words that occurred in almost all documents, such as ‘vehicle’ or ‘unit’. As the words that occurred in only very few documents either contributed nothing to cross-document topics or were corrupted, we filtered the dictionary, excluding all words that occurred in fewer than five documents, eventually leaving the dictionary with 4,651 distinct entries. We manually checked that the excluded words did not convey any relevant meaning about autonomous vehicle technologies. The BoW was created using gensim’s doc2bow function, which contained the information about the words in each document with their individual word frequencies within this document.

To derive to our final topic model, we iteratively generated the corpus and the topic model and assessed the resulting topics, which, in each iteration, resulted in either new stopwords that then had to be deleted from the corpus or adapted parameters for the topic-modelling algorithm. This procedure, especially the assessment of the resulting topics, was supplemented by two intrinsic performance indicators: the perplexity and coherence score of the model. The perplexity, measured as the normalised log probability of a retained test set, describes how surprised a model is by new data it has never seen before or, in other words, how well the model can represent or reproduce the statistics of the retained data. The coherence score measures the semantic similarity of the most probable words within one topic and thus represents a measure for human interpretability of the topics (Röder, Both, and Hinneburg Citation2015). In our case, the perplexity is -7.08, and the coherence score is 0.43, which represent acceptable values that are on the lower end of what is assumed to be ‘good’ according to (Stevens et al. Citation2012).

The dictionary and BoW were then the prerequisites and basic inputs for the topic-modelling algorithm. The algorithm had a multitude of parameters to specify exactly how it performed the modelling, which were largely kept at their default values, with changes only made to the following: num_topics = 30, random_state = 100, chunksize = 60, passes = 10, iterations = 500, per_word_topics = True. The most important and influential change was the number of topics (num_topics) that the algorithm should generate, which in our case was 30. The practitioners noticed that Topics 27 to 30 did not provide any meaningful insights, so these topics were excluded from our analysis.

Appendix 3:

Wordclouds for Topic 1 to Topic 26

Appendix 4:

Topic and cluster development over time

Appendix 5:

Technology Classification of Topics

Appendix 6:

Patent Distribution according to CPC codes