2,810
Views
1
CrossRef citations to date
0
Altmetric
Articles

Expendable to essential? Changing perceptions of gig workers on Twitter in the onset of COVID-19

ORCID Icon, ORCID Icon, , ORCID Icon & ORCID Icon
Pages 634-653 | Received 01 Oct 2021, Accepted 12 Dec 2021, Published online: 31 Dec 2021

ABSTRACT

While ride-hailing ridership declined in 2020 due to COVID-19 induced restrictions like stay-at-home orders, food/grocery delivery services became quasi-essential. This study investigates if and how public perceptions of gig work related to platform-based ride-hailing and food/grocery delivery services changed during the early stages of the pandemic. We collected a sample of 23,845 Twitter posts (‘tweets’) related to these platform-based services within two-week periods before and after the US COVID-19 emergency declaration. Sentiment analysis on tweets was conducted to investigate changes in public perception of gig work. Tweet content was analyzed by descriptively coding about 10% of the sample of tweets manually along ten different dimensions (e.g., personal experience, informative, and about driver); then we used thematic analysis to gain an understanding about the public’s views towards gig work/workers. We tested supervised machine learning methods to explore their potential to classify the rest of the sample along the ten descriptive dimensions. The number of tweets increased by approximately 150% after the emergency declaration and became more positive in sentiment. Qualitative results indicate that tweets about negative personal experiences with drivers/companies decreased during COVID-19, while tweets exhibiting a sense of community (e.g., sharing information) and concern towards gig workers increased. Findings can inform policy and workforce changes regarding platform-based service companies.

Introduction

COVID-19 has resulted in a range of impacts across most US workforce segments. Understanding early pandemic impacts may yield insights that can help companies and workers to successfully navigate the continuing pandemic and future work. This study utilizes an exploratory approach to examine one segment of the workforce – gig workers – to determine whether, and if so how, public perceptions of gig work related to platform-based ride-hailing and food/grocery delivery services changed during the early stages of the pandemic. After a brief review of gig work, COVID-19 impacts on public perception of gig work, the role of public opinion on policymaking, and use of social media to study opinion, we discuss the data collection and analysis methods. Study results are presented. We conclude with a detailed discussion of our methods, findings, and study contributions.

Gig work

The flexibility of ‘gig work’ has attracted a large number of workers in recent years. Gig workers operate in less structured labor arrangements typically characterized by flexible working conditions (e.g., choice of working hours) and lack of job security and other employment benefits (e.g., paid sick leave) as a result of their lack of formal employer-employee relationship (Abraham et al., Citation2018). There is no universally accepted definition of gig work among researchers; however, it is most commonly used to refer to emerging ‘electronically mediated work’ or ‘online platform work’ (Current Population Survey staff, Citation2018). Gig workers use a company’s electronic platform (e.g., smartphone app) to connect to customers and obtain short tasks for which they get paid. This arrangement provides them with the flexibility to choose when and whether they want to work.

About 10.1% of US workers had ‘alternative employment arrangements’ in 2017, with about 1.0% making income through online platform work (Current Population Survey staff, Citation2018). The US transportation and utilities industry saw approximately a 50% increase in independent contractors between 2005 and 2017 (Abraham & Houseman, Citation2020), with this increase primarily attributed to the emergence of platform-based ride-hailing companies like Uber and Lyft (Paquette & Long, Citation2018).

Ride-hailing services are a form of shared mobility that features passengers connecting with independently working drivers (‘gig drivers’) using online platforms, typically smartphone apps, provided by platform-based ride-hailing companies, also known as transportation network companies (TNCs) (Shaheen et al., Citation2015). Gig drivers use their personal vehicle or arrange for a vehicle themselves to provide services. Similar to ride-hailing, platform-based food/grocery delivery services offered by delivery network companies (DNCs) like Grubhub and Instacart connect customers with gig drivers to get restaurant food or grocery items delivered to a prespecified location (Shaheen et al., Citation2015). In some cases, gig drivers also shop for grocery items in-store on behalf of customers. Some gig drivers hold both ride-hailing and food/grocery delivery jobs (Farrell et al., Citation2018). The present study focuses on these transportation sector platform-based gig workers – gig drivers using online platforms provided by TNCs and/or DNCs (T/DNCs) to connect with paying customers.

Lower and unstable income and lack of employment benefits are concerns for T/DNC drivers (Waheed et al., Citation2018). In 2014, Uber estimated that the median annual wage for drivers who work at least 40 hours a week is more than $90,000 in New York City and $74,000 in San Francisco (McFarland, Citation2014). However, most drivers don’t earn this amount (Farrell et al., Citation2018). Gig drivers bear fuel, insurance, maintenance, and depreciation vehicle costs (Henao & Marshall, Citation2019). Mean net earnings for TNC drivers after including expenses ranges from $5.72 to $10.46 per hour (Henao & Marshall, Citation2019). Average monthly earnings for T/DNC drivers fell by more than 50% and the portion of total income from driving decreased from 80% to 60% between 2013 ($1,469) and 2017 ($783) (Farrell et al., Citation2018). This decline is likely due to a reduction in driving hours caused by increasing driver competition. Several factors, such as consumer tips and surge pricing during peak hours, contribute to varying hourly wages for gig drivers.

Gig drivers, as independent contractors, are not entitled to employment benefits like health and unemployment insurance (Waheed et al., Citation2018). Labor advocates are pressuring platform-based companies to provide basic minimum wages and healthcare (Beckman et al., Citation2021). However, gig economy executives contend that this would put a significant financial strain on companies and decrease job flexibility (Khosrowshahi, Citation2020b).

Several T/DNCs have formed corporate alliances and lobbying groups to influence regulations in favor of maintaining the current employment status of gig drivers, raising questions from labor rights groups on the implications of such lobbying for US contingent workers (Pinto et al., Citation2019). The State of California’s Assembly Bill No. 5 that reclassified several gig workers from independent contractors to employees (California Assembly Bill No. 5, Citation2019) and the subsequent Proposition 22 that reverted their employment status to independent contractors is a prime example of TNC lobbying (Ballotpedia, Citationn.d.).

Impacts of COVID-19

The onset of the COVID-19 pandemic prompted federal and private changes in gig workers’ conditions. The US President declared COVID-19 a national emergency on 13 March 2020 (Proclamation No. 9994, Citation2020). Gig drivers faced exacerbated risks during COVID-19 due to COVID-19 exposure, as well as lack of benefits and lack of bargaining power to negotiate (Purkayastha et al., Citation2021). Some platform-based companies like Uber appealed to the government to provide their independent workers with economic aid and other benefits (Khosrowshahi, Citation2020a). On 27 March 2020, federal unemployment benefits were expanded to independent contractors through the Coronavirus Aid, Relief, and Economic Security Act (H.R.748 - CARES Act, Citation2020). State governments adapted unemployment policies relevant to gig workers in a disjointed fashion (Smith, Citation2020), resulting in no consistent national level gig worker aid eligibility (Lobel, Citation2020). Companies like Uber and Lyft offered paid sick leave to drivers during the 2020 US first shutdown after pressure from the public and policymakers (Katta et al., Citation2020; Lobel, Citation2020). Some companies also provided drivers with personal protective equipment (e.g., masks or hand sanitizer) for free or through company storefronts (Tripathi, Citation2020).

While TNCs attempted to persuade potential riders to use their services by mandating sanitation requirements (e.g., masks and hand sanitizer), this sometimes came at the expense of the drivers (Lobel, Citation2020). This may have worsened the pandemic economic situation of some drivers, as about 21% of drivers had no health insurance, nearly 15% relied on public assistance programs, and about 53% relied entirely on gig work for their income in the previous month (Benner et al., Citation2020).

Ride-hailing drivers (included under the ‘Taxi and Limousine Service’ industry) were listed as essential workers under recommended Vaccination Phase 1c (US Centers for Disease Control and Prevention, Citation2021). However, Phase 1c began two to three months after Phase 1a in most US states (Ballotpedia, Citation2021). Since gig drivers, like other transportation industry workers, have low telework ability (Dingel & Neiman, Citation2020) and they receive no or limited employment benefits, the delay in vaccine rollout may have prolonged their economic- and health-risks (Baker, Citation2020).

From an industry perspective, TNCs and DNCs experienced different impacts during the COVID-19 pandemic. Demand for ride-hailing services plummeted by almost 60-70% globally during early stages of the pandemic (Heineke et al., Citation2020). Conversely, food and grocery delivery services became quasi-essential, leading to unprecedented spikes in demand. Instacart added 300,000 new gig workers in late March of 2020 (Instacart, Citation2020). More than 55% of consumers that used online delivery services in one area of Portland, Washington metropolitan area reported ordering more deliveries during the pandemic (Unnikrishnan & Figliozzi, Citation2020). The increasing food/grocery delivery demand attracted a large number of people to work as gig drivers for DNCs (AppJobs Institute, Citation2020); many people turned to DNC gig work to address their immediate financial needs rather than waiting for government assistance (Ravenelle et al., Citation2021).

DNC gig drivers entered a unique position where their services provided users additional protection against COVID-19 exposure. However, drivers subsumed those exposure risks as they interacted with restaurant and grocery staff, as well as customers, until ‘no-contact deliveries’ became widely adopted (Tripathi, Citation2020). Part of what this study seeks to address is if the public noticed such changes in the working conditions of gig drivers and whether it changed their perception of gig drivers.

Public opinion and policy

Public perception and opinion can impact public policy (Page & Shapiro, Citation1983) as well as industry policy (Spears, Citation2021). Burstein (Citation2003) found that public opinion affected public policy nearly three-quarters of the time and that higher issue salience has an amplifying effect of public opinion on policy. Opinion-policy congruence exists to some degree in the US and is especially strong for high-salience issues (Manza et al., Citation2002). A study on the 2014 Ebola epidemic reported significant linkages between higher issue salience and altruistic behavioral intentions during a global health crisis (Yang, Citation2016). COVID-19 has led to heightened salience among the public of increased risk and altered working conditions across labor types, especially essential workers such as gig drivers (Mejia et al., Citation2021; Tripathi, Citation2020).

Legislators are responsive to issues discussed by the public on platforms like Twitter as it enables them to track public opinion and priorities (Barberá et al., Citation2019). For platform-based gig work, the case of California Proposition 22 demonstrates the importance of public opinion as well as corporate lobbying in influencing public policy. Further, consumers’ perceptions of company policies (e.g., labor practices) can affect their spending behavior and have economic ramifications for companies, as observed in T/DNCs (Belanche et al., Citation2021).

Social media

Social media websites generate immense amounts of diverse content pertaining to perception and opinions from a large population (Webb & Wang, Citation2013). While the efficacy of social media data in understanding public opinion is still open for debate (Edwards et al., Citation2013; Schober et al., Citation2016), it has changed how opinions are formed as the proportion of the US population seeking news on social media increased from 27% in 2013 to 48% in 2020, while TV and print news steadily declined (Newman et al., Citation2020). Accordingly, politicians strategically address their policy preferences to their constituents through social media platforms like Twitter (Hemphill et al., Citation2021). These platforms can provide an economical and efficient tool to gauge public opinion and its potential impact on public policy (Ceron & Negri, Citation2016); and social media data provide flexibility to analyze opinions at one point in time as well as their temporal evolution. Using emergent computational methods, such as machine learning (ML), allows researchers to draw insights from large volumes of social media data (Su et al., Citation2017). Conclusions drawn from social media analyses can align with those from survey data in terms of topic coverage, despite not having population coverage (Schober et al., Citation2016).

Twitter, a microblogging social media platform, is being increasingly used to assess public sentiment and perception of social issues, especially using content analysis methods (Oscar et al., Citation2017; Xiang et al., Citation2021). As of July 2021, Twitter had an estimated 77 million active users in the US (DataReportal, Citation2021). Twitter users are younger, more educated, wealthier, and more likely to be liberal – thus, not representative of the general US population (Wojcik & Hughes, Citation2019). However, opinion insights from social media can align with results from representative surveys of opinions through topic coverage that includes breadth and depth of issues (Schober et al., Citation2016).

COVID-19 research has utilized Twitter data to explore the nature of discussions of public health (Xiang et al., Citation2021), policy responses (Tsai & Wang, Citation2021), information-sharing (Saud et al., Citation2020), and labor conditions (Dubey & Tripathi, Citation2020). However, the coalescence of these topics on Twitter has not been explored for gig drivers. A recent study examined survey data measuring ride-hailing service usage before and during the COVID-19 pandemic with changes in public sentiments, emotions, and remarks towards ride-hailing services on Twitter (Morshed et al., Citation2021). However, they did not analyze tweet content to gain deeper insights into underlying themes of public discourse on Twitter.

The current study presents an exploratory analysis to identify emergent patterns in public discourse surrounding platform-based gig drivers before COVID-19 was declared a national emergency in the US and during the early stages of the pandemic using Twitter data.

Materials and methods

Data collection

We analyzed social media posts from Twitter (tweets) within two-week time windows before and after COVID-19 was declared as a US national emergency on 13 March 2020 (pre-COVID: 28 February 2020–12 March 2020; during-COVID: 13 March 2020–26 March 2020). We acquired publicly available tweets originating in the US and containing at least one of the keywords listed in between 28 February 2020 and 26 March 2020 using Salesforce’s Social Studio REST API v51.0 (Salesforce, Citationn.d.-a).

Table 1. Keywords and suggestive emojis used for sampling Twitter data.

Social Studio flagged 1,091,428 tweets within our search parameters, distributed unequally between pre- and during-COVID time periods (see the Results section for more details). Previous literature on thematic analysis of Twitter data used an approximate temporally homogeneous distribution of tweets (Ahmed et al., Citation2017). Hence, we used the REST API to sample 450 tweets per hour from the flagged tweets (Salesforce, Citationn.d.-b) and acquired a total of 264,171 tweets, distributed approximately equally between pre- and during-COVID time periods. Tweet text and metadata were retrieved from their tweet ID using Twitter Standard API v1.1 (Twitter API, Citationn.d.).

Tweets were removed that: (1) were duplicates from retweets (i.e., users reposting identical content from another user), (2) contained the searched keywords only in the Twitter handle (username) of the user who posted the tweet, (3) were non-English language based on the identified language in the metadata provided by Twitter API, (4) contained any of the suggestive emojis (i.e., digital icons with a sexual connotation; Nexø & Strandell, Citation2020) illustrated in , and (5) were from outside the US or from unknown location (based on user-specified location in the metadata). Duplicates from retweets were removed to avoid a biased sample as we did not have data on other mechanisms (i.e., ‘likes’) through which Twitter users can voice their opinions. User-specified locations were parsed to identify the country using OpenStreetMap data implemented in Python’s geocoder v1.38.1 library (Carriere, Citationn.d.). illustrates the number and percentage of remaining tweets after each step of the data sampling and cleaning processes. The final sample included 23,845 tweets (about 9.0% of the temporally homogenous sample) within the time period of interest, with 11,312 tweets pre-COVID and 12,533 tweets during-COVID.

Figure 1. Twitter data sampling, cleaning, and subsetting.

Figure 1. Twitter data sampling, cleaning, and subsetting.

The present study utilizes both manual and automated (based on ML) content analysis of the textual components of tweets. We retained non-suggestive emojis in our final sample as they are considered a textual component of tweets. However, we did not utilize website links or multimedia data (e.g., images and videos) embedded in tweets due to software and model limitations as discussed in the following sections.

Data analysis

Sentiment analysis

Sentiment analysis is a form of content analysis that implements supervised ML methods to determine the underlying sentiment or opinion of textual data (Desai & Mehta, Citation2017). We use Salesforce’s proprietary Sentiment Model to classify tweets as either positive, negative, or neutral in sentiment (Salesforce, Citationn.d.-c). The Sentiment Model is tuned to incorporate the effect of emojis embedded in the tweet, but not multimedia, in its sentiment coding.

Manual coding

Manual coding occurred in two phases using a random sample of about 10% of the tweets from the main dataset (N = 2,286). Descriptive coding of the content of the tweets occurred in phase one (Wolcott, Citation1994). The randomly sampled tweets were coded across seven dimensions adapted from McNeil et al. (Citation2012) and Oscar et al. (Citation2017) to understand stigma and overall public perception surrounding gig work: personal experience, informative, advice seeking, opinion, joke, ridicule, and miscellaneous. We coded three additional dimensions: about drivers, about company, and not relevant. About drivers and about company assessed whether a tweet was directed towards gig drivers or towards T/DNCs. The not relevant dimension accounted for tweets not related to platform-based gig work (e.g., tweets using the keyword ‘uber’ in its semantic context). details these dimensions. A tweet could be classified into multiple dimensions, except for miscellaneous and not relevant which were exclusive dimensions. To establish inter-rater reliability, three researchers initially coded 125 tweets together to achieve a kappa value greater than 0.80. The coding from this round was discarded, and these tweets were shuffled back into the data. Each researcher then independently coded 806 tweets, with 66 common tweets among them used to verify a kappa value of greater than 0.80 (Cohen, Citation1960; Stemler & Tsai, Citation2008).

Table 2. Descriptions of topical coding dimensions.

During the second phase of manual coding (i.e., thematic analysis), a subsample of coded tweet data, excluding the tweets previously coded as not relevant, were analyzed. A single researcher reviewed the tweets multiple times to inductively develop larger themes related to public perception of platform-based gig drivers (Nowell et al., Citation2017).

Machine learning model

This study explores the efficacy of using ML methods to descriptively code the remainder of the data (N = 21,559) along the ten dimensions. We developed a supervised ML model for each of our ten topical dimensions by adapting Oscar et al.’s (Citation2017) model. We used Oscar et al. (Citation2017)’s model due to similarities in data source (i.e., Twitter) and the coded topical dimensions. Their results serve as a point of comparison for us to determine the performance of ML methods to expand manual coding results using our data. The supervised ML model was trained using 90% of 2,220 independently manually coded tweets (i.e., 2,286 manually coded tweets minus 66 commonly coded tweets). The remaining 10% of the tweets were used to test model performance. illustrates the data subsetting for the ML model. The ML model analyzes the textual component, ignoring associated multimedia, of a tweet to determine whether it belongs to a particular topical coding dimension independent of other dimensions. We enforced two additional rules to the model output to remain consistent with the dimension definitions (). First, tweets classified as not relevant by the model were exclusively considered not relevant even if they were assigned in other dimensions by the model. Second, to comply with the dimension description (see ), tweets classified as miscellaneous by the model were not considered miscellaneous if the model also assigned them any other dimension.

To determine the effectiveness and usability of the ML model, testing accuracy (i.e., the percentage of ML predictions matching the manual coding classification) for each dimension is compared with the corresponding simple majority classifier (SMC) (Oscar et al., Citation2017). SMC is the testing accuracy for a base model that predicts all instances as the classification with most instances. A higher testing accuracy compared to SMC is indicative of a more accurate classifier. We also calculated Cohen’s kappa for each dimension to assess the inter-rater reliability for providing a standardized machine performance assessment (Zhai et al., Citation2021).

Results

Descriptive analysis

The volume trend of flagged tweets (see ) shows a significant spike in the number of tweets following the COVID-19 national emergency declaration on 13 March 2020. On average, 22,896 and 56,716 tweets per day were flagged for pre-COVID and during-COVID periods, respectively, which corresponds to an almost 150% increase in volume. Platform-based gig work received increased public attention on Twitter during the early days of COVID-19.

Figure 2. Volume trend of tweets between 28 February 2020 and 26 March 2020 (N = 1,091,428).

Figure 2. Volume trend of tweets between 28 February 2020 and 26 March 2020 (N = 1,091,428).

Sentiment analysis

Sentiment was calculated for the entire 23,845 tweet sample. The sentiment of pre-COVID tweets was about equally distributed (see ), with a slightly higher percentage of positive tweets (4,016 tweets, or 36%) compared to negative (3,853 tweets, or 34%) and neutral (3,443 tweets, or 30%). There was an increase in positive tweets (+10%) during COVID-19 (5,797 tweets, or 46%), while both neutral (3,094 tweets, or 25%) and negative (3,642 tweets, or 29%) tweets saw similar declines. Differences in the sentiment between pre- and during-COVID tweets were statistically significant (χ2 = 286.0, df = 2, p < 0.001).

Figure 3. Distribution of tweet sentiment pre- and during-COVID (N = 23,845).

Figure 3. Distribution of tweet sentiment pre- and during-COVID (N = 23,845).

We also investigated the change in sentiment by users. Our sample contained 9,775 and 11,297 unique users who tweeted pre- and during-COVD, respectively. Among them, 506 ‘common’ users tweeted within both time periods, with a total of 1,140 tweets pre-COVID and 863 tweets during-COVID. The proportion of positive and negative sentiment tweets from these common users were 30.8% (351 tweets) and 40.7% (464 tweets), respectively, pre-COVID and 35.1% (303 tweets) and 36.5% (315 tweets), respectively, during-COVID. Although the absolute sentiment proportions for these common users differ from that of the entire sample, they exhibit a similar general trend with an increase in positive sentiment during-COVID.

Manual coding

Descriptive coding

details the distribution of topical dimensions of manually coded tweets. Most tweets were classified as opinion and personal experience. However, the percentage of personal experience decreased (−12.4%) during-COVID while the percentage of opinion slightly increased (+2.5%). After the COVID-19 emergency declaration, there was an increase in the informative tweets (+12.1%). While the percentage of tweets coded as about drivers (−5.6%), about companies (−3.5%), advice seeking (−1.1%), and miscellaneous (−2.4%) decreased slightly after the declaration. Although the reduction in percentages of joke and ridicule tweets during-COVID in absolute terms is lower, their proportion during-COVID decreased relative to pre-COVID.

Figure 4. Distribution of dimensions for manual topical coding (N = 2,220) and topical coding from machine learning output (N = 21,559).

Figure 4. Distribution of dimensions for manual topical coding (N = 2,220) and topical coding from machine learning output (N = 21,559).

Thematic analysis

Three themes emerged from the thematic analysis of the manually coded tweets (N = 2,286): individual experiences, unity, and concern for gig drivers. See for detailed theme descriptions.

Table 3. Descriptions of themes identified in the data.

Individual experiences

Prior to COVID-19, tweets were predominately about individual experiences with gig workers. Most of these experiences were negative and focused on issues with the gig workers personality (e.g., silent, antisocial, rude), quality of service (e.g., canceled ride, bad/no music playing in car, long wait time, missing items, assaulted passenger, car odor), or driving capabilities (e.g., too fast/slow, almost hit someone, blocked crosswalk). For example:

@DoorDash_Help have used your service 4 times and still can’t get a seamless delivery.

Missing ingredients in food and a rude driver. Not sure how you still exist – tweeted on 20 February 2020

Negative experience tweets also discussed frustrations with gig work companies, such as expensive services, bad customer support, and malfunctioning apps. For example:

Lyft and uber are so expensive now, they aren’t even worth it – tweeted on 9 March 2020

To a lesser degree, tweets shared positive experiences with gig workers (e.g., enjoyable conversation with driver, good looking driver, prompt service, playing good music in the car) and gig companies (e.g., Lyft offered free rides to vote or to job interviews). For example:

my uber drivers and i always end up becoming friends by the time i reach my destination hahaha extrovert lifeee – tweeted on 4 March 2020

Unity

During COVID-19, the tweets evolved from an undertone of ‘I’ to an undertone of ‘we.’ Tweets centered around the importance of gig workers and being appreciative of their services during COVID-19. Tweeters expressed gratitude toward gig workers and/or encouraged others to tip gig workers well.

If you’re getting things delivered- food, groceries, etc. - make sure you’re tipping your delivery person, and tipping them well. This may be all they’ve got right now, and as much as they want to be home, quarantined … they’re out helping everyone stay safer. – tweeted on 19 March 2020

There was also a sense of community among the tweeters during COVID-19, with tweeters sharing information about store product availability, time slots for services, how to be ‘COVID’ safe, how their gig work driver practiced ‘COVID safety,’ and restaurant information.

If you're looking online, keep cart full in instacart. Sometimes it'll say no slots until weekend, but both times I ordered & later, had openings today as they caught up. Mine said delivery in 5 hrs, but she shopped/delivered right away. – tweeted on 25 March 2020

Concern for gig drivers

Present in both the pre-and during-COVID tweets, concern for gig drivers was communicated by tweeters who were frustrated that gig workers had a high risk for COVID-19 exposure but at the same time could not afford to miss work because they do not get paid enough.

If you order groceries today, realize that the delivery workers are likely ‘freelance’ workers with NO health benefits, paid payroll taxes, unemployment or worker’s comp. Tip them in sanitized CASH so the app company can’t steal it. #PSA #essentialworkers #instacart #labor – tweeted on 20 March 2020

Tweeters noted that most gig drivers do not get healthcare benefits, paid sick leave, and other employment benefits due to their employment status.

Are you deaf- misclassified U\L drivers forced into a sharecropper arrangement and are forced to work while ill b\c they owe their souls to the company - endangering us all. If they were employees they’d have - PSD, health insurance, unemployment benefits. – tweeted on 12 March 2020

Tweeters also appealed to T/DNCs and the government to improve gig drivers working conditions and provide better benefits for their work during the pandemic.

@Uber @lyft how about Lyft and Uber up their fares slightly and give their employees health benefits? – tweeted on 13 March 2020

Tweets about gig work and gig workers transitioned from mainly personal experiences to a larger community focus. Tweeters wanted to ensure that gig workers were supported, and their community had up-to-date information after COVID-19 induced lockdowns. While tweets regarding lack of employee benefits and employee value by gig work companies remained a concern prior to and after the national emergency declaration, new subthemes related to concerns for gig workers’ safety due to potential COVID-19 exposure emerged during COVID-19.

if you are planning on eating out or getting delivery i urge you to tip and tip WELL. 20% MINIMUM. if you think corona is scary, just think of all the people who are risking their health and safety to get your food to you – tweeted on 18 March 2020

Machine learning model

Our ML model did not perform well on several dimensions. The testing accuracies for the ten topical dimensions ranged from 63% to 97% and the differences between testing accuracies and SMCs (‘Diff’) ranged from −12.61% to 13.06%, indicating that SMC worked better than our model for some dimensions. To compare, Oscar et al. (Citation2017) reported testing accuracies between 86% and 95% and Diff between 5% and 38%. The distribution of these topical dimensions for the tweets classified using ML (N = 21,559) is representative of the smaller manually coded sample, as illustrated in .

Unlike testing accuracy and SMC, Cohen’s kappa is able to provide insights on inter-rater reliability between the ML model and human coders. For all dimensions except personal experience and about drivers, the kappa values were below 0.50 (ranging from 0.00–0.63), indicating an overall poor performance of the ML model. presents machine model performance measures for each dimension.

Table 4. Machine learning model performance measures (N = 222).

Discussion

Not only did the COVID-19 pandemic reveal critical shortcomings of the US labor market, such as the lack of protection for gig workers, but it also directed public attention towards such issues (Katta et al., Citation2020; Mejia et al., Citation2021). In this exploratory study, we examined how the public perception of gig drivers evolved during the early stages of the COVID-19 pandemic on Twitter.

Our results illustrate several changes in different aspects of public perception of platform-based gig drivers. The volume trend of tweets indicates that platform-based gig work received more attention on Twitter during the onset of the pandemic emergency mandate as opposed to two weeks prior to the mandate. Further, overall sentiment of sampled tweets became more positive during the pandemic. There was also a shift in the focus of public discourse from ride-hailing services to delivery services during COVID-19, which is consistent with the changing demands for platform-based services and labor shifts during COVID-19 (Heineke et al., Citation2020).

Prior to the national emergency declaration, most of the tweets were negative towards gig drivers and T/DNCs. However, tweets referenced fewer personal experiences related to T/DNCs during the pandemic, most likely because of fewer opportunities to interact with drivers, especially in ride-hailing services. Supporting prior research (Saud et al., Citation2020), we found that informative tweets surged after the emergency declaration as the Twitter community shared information about safety and service/goods availability to help each other in the wake of the pandemic. Although the proportion of opinions posted on Twitter did not change much after the COVID-19 emergency, they exhibited more concern for gig drivers’ safety and working conditions and urged other tweeters, T/DNCs, and the government to support gig drivers.

Although our ML model did not perform well, our results reflect a similar trend found in Oscar et al.’s (Citation2017) study, where ML classifications were more effective than that of simple majority classifier for topical dimensions with lower SMC values. For our data, this is shown by the improved performance compared to SMC in personal experience, opinion, and about drivers dimensions. Our ML model also had low kappa values (less than 0.63), which is not uncommon for ML models (Ullmann, Citation2019). Our exploratory approach of using ML model demonstrates some potential benefits of using such computational methods in topical coding of tweets for topics that are more prevalent in the data.

Lack of bargaining power and unionization have been identified as barriers to gig workers securing health protections and benefits from T/DNCs companies (Purkayastha et al., Citation2021), but several protests organized by gig workers early in the pandemic received national media attention (Lobel, Citation2020). The increased discussion, positive perception, and newfound empathy towards gig drivers on Twitter during the pandemic, as illustrated by our results, may indicate broader demand for better working conditions and employment benefits from T/DNC companies and support of these efforts by workers. Perhaps labor advocates can take advantage of the apparent public awareness/concern for gig workers and communicate to voters and policymakers how benefits would have improved workers’ outlook during COVID-19. However, we suspect that labor advocates will need to contend with competing narratives from T/DNCs. Going forward, a key challenge for labor advocates will be to maintain this positive momentum and achieve tangible policy outcomes while it lasts (Mejia et al., Citation2021).

The study has a few limitations. First, although diverse, Twitter users are not representative of the US population (Wojcik & Hughes, Citation2019), which affects the generalizability of our results. Second, our keywords list included the most popular terms related to platform-based gig drivers, but it was not exhaustive. Thus, bias may have been introduced if tweets that were relevant, but did not contain a keyword, were missed in data collection. Third, the ML model was trained to classify each topical dimension independently. Although this allowed us to train the model using a relatively lower sample size that required substantial manual coding efforts, it fails to capture the intrinsic relationships between different dimensions. Our ML model could potentially perform better for data with lower interrelationships between the topical dimensions. Future research exploring ML methods in descriptive coding could either train the model on multiple dimensions simultaneously (but would require larger training dataset and more manual coding effort) or could use more advanced ML approaches such as deep learning.

In sum, this study illustrates how Twitter data can be used as an exploratory tool to examine public perceptions of segments of the platform/gig workforce during the COVID-19 pandemic. While the results of this study reveal that perception of the gig workforce became more positive after the emergency declaration and that the content of tweets also evolved, further research is needed to determine if similar results are found among more nationally representative samples and if these positive perceptions will remain as the COVID-19 pandemic continues to linger and evolve in the US.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the National Science Foundation [grant number 2041215: Preparing the Future Workforce for the Era of Automated Vehicles]. Any opinions or findings expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Notes on contributors

Shubham Agrawal

Shubham Agrawal (Ph.D., Purdue University) is a postdoctoral fellow in the Department of Sociology, Anthropology and Criminal Justice. He received his Ph.D. in Transportation Engineering from Purdue University. He is currently investigating the impacts of automated vehicles on the driving workforce and developing strategies to prepare the future workforce for the transition. His other research interests include traveler behavior and cognition, emerging transportation technologies, alternative fuel vehicles, and smart mobility.

Amy M. Schuster

Amy M. Schuster (Ph.D., University of Kentucky) is a postdoctoral fellow in the Department of Sociology, Anthropology, and Criminal Justice at Clemson University. She earned her Ph.D. in Gerontology from the University of Kentucky, MSW from the University of Georgia, and BSW from Florida State University. Amy’s research interests have developed from her years of work experience as a social worker in a long-term care setting. She is particularly interested in information and communication technology use by older adults.

Noah Britt

Noah Britt is a junior at Clemson University majoring in computer science (BS), and most of his work revolves around analyzing social media data. He previously worked as an intern with the Social Media Listening Center at Clemson University.

Jessica Liberman

Jessica Liberman (B.A., Southern Illinois University of Carbondale) is a M.S. candidate in the Social Science program of the Department of Sociology, Anthropology, and Criminal Justice at Clemson University. She received her Bachelor of Arts in Sociology from Southern Illinois University of Carbondale in 2018. She currently works as a research assistant on the NSF project WEAVE: Preparing the Workforce for the Era of Automated Vehicles, applying methods of sociological inquiry to the project’s goals of workforce development.

Shelia R. Cotten

Shelia R. Cotten (Ph.D., North Carolina State University) is the Associate Vice President for Research Development and Provost’s Distinguished Professor at Clemson University. Her research examines emerging technologies and their impacts on quality of life, health, workforce, and educational outcomes. Her research has been funded by the National Science Foundation, National Institute on Aging, American Center for Mobility, and other organizations. She is a former Chair of the American Sociological Association’s Communication, Information Technologies, and Media Sociology (CITAMS) section; she won the Public Sociology and the William F. Ogburn Senior Career Awards from the CITAMS section. Prior to joining Clemson University in August 2020, she was an MSU Foundation Professor in the Department of Media and Information at Michigan State University.

References