Publication Cover
Chronobiology International
The Journal of Biological and Medical Rhythm Research
Volume 37, 2020 - Issue 9-10: Selected Proceedings: Shiftwork 2019
3,636
Views
17
CrossRef citations to date
0
Altmetric
SELECTED PROCEEDINGS: SHIFTWORK 2019

The relationship between workload, performance and fatigue in a short-haul airline

, , , &
Pages 1492-1494 | Received 20 Feb 2020, Accepted 28 Jul 2020, Published online: 24 Aug 2020

ABSTRACT

The aim of this study was to determine the relationship between pilot workload, performance, subjective fatigue, sleep duration, number of sectors and flight duration during short-haul operations. Ninety pilots completed a NASA Task Load Index, Psychomotor Vigilance Task and a Samn-Perelli fatigue scale on top-of-descent of each flight and wore an activity monitor throughout the study. Weak, but significant, correlations were revealed between workload and all factors. Subjective fatigue, number of sectors and lapses were significant predictors of workload. Pilots reported higher workload when fatigue increased, the number of sectors were higher, and objective performance was worse.

Introduction

Short-haul airline pilots face high workload and fatigue due to flying multiple flights in a single day and reduced sleep due to early start times and long duty days (Flynn-Evans et al. Citation2018; Honn et al. Citation2016; Vejvoda et al. Citation2014). Most duty and rest regulations address the maximum amount of duty time on a given day, the cumulative time for a duty period and a minimum acceptable duration of rest periods between duty periods. The impact of these factors on alertness and performance has been evaluated in many studies. Workload has not been studied as extensively, although it is included in the definition of fatigue by the International Civil Aviation Organization (ICAO Citation2015). The association between sleep, performance and workload during short-haul flights remains unclear although sleep and workload each contribute to pilot performance and fatigue. Therefore, the objective of this study was to determine the relationship between pilot workload, performance, subjective fatigue, sleep duration, number of sectors and flight duration during short-haul operations.

Methods

Participants

Ninety-five pilots (eight females) from a short-haul commercial airline volunteered to participate in the study during standard operations. Five pilots were removed from analyses due to missing and outlier data (>2 standard deviations).

Protocol

The study protocol was reviewed and approved by the NASA Institutional Review Board (HRIRB, protocol HRI-319). Pilots working for a single airline were recruited through flyers and via company e-mails. The pilots who volunteered to participate in the study were invited for a training day when they gave a written, informed consent and completed a demographic questionnaire. They were provided with an iPod (5th generation, iOS 6.8.53, Apple, Cupertino, CA, USA) uploaded with the study questionnaires and were provided instructions on the timing and procedures of the tests. The pilots flew a roster that included four duty blocks separated by three or four days off. Each duty block consisted of five consecutive duty days; each duty day included two or four sectors. Pilots departed from and returned to their home base.

Pilots completed a sleep diary and wore an Actiwatch (Respironics Inc., Bend, OR, USA) throughout the study. On top-of-descent (just before the start of the approach for landing) of each flight, pilots completed a Samn-Perelli scale (Samn and Perelli Citation1982), a 5-minute Psychomotor Vigilance Task (PVT) and a NASA Task Load Index (NASA-TLX, Hart and Staveland Citation1988). The Samn-Perelli scale is a fatigue scale asking participants to rate their level of fatigue from 1 = “fully alert, wide awake” to 7 = “completely exhausted, unable to function effectively.” The PVT is a validated test where participants are asked to respond as rapidly as possible to a stimulus appearing on the screen at different predetermined intervals (Arsintescu et al. Citation2017). The NASA-TLX is a measure of workload originally developed for use in the flight crew studies (Hart and Staveland Citation1988). NASA-TLX is comprised of six dimensions that measure mental demand, physical demand, temporal demand, performance, effort and frustration on a scale from 0 = “Low” to 100 = “High” except for performance for which the scale is from 0 = “Good” to 100 = “Poor.” The unweighted TLX scores were used for the analyses, which is common and highly correlated to the weighted version of the index (Byers et al. Citation1989).

Statistical analyses

All data were analyzed using SPSS (version 26.0, IBM SPSS Statistics for Windows, Armonk, NY, USA). Response speed (mean 1/RT) and lapses (RT > 500 ms) were used as PVT outcomes. Spearman’s rho non-parametric correlations (α=.05) were used to compare PVT, Samn-Perelli, sleep duration, the number of sectors and flight duration with NASA-TLX. A multiple stepwise regression was conducted to test if PVT lapses, response speed, fatigue, sleep duration (obtained on the previous night), number of sectors, duty day and flight duration predicted the overall workload rated by pilots.

Results

The participants ranged in age between 21 and 54 with an average of 33 (±8) y. Data was collected on 3,243 short-haul flights. On average, flight duration was 2:14 (± 0:49) h long.

Overall, the mean raw TLX was significantly, but weakly, correlated with PVT lapses, response speed, Samn-Perelli fatigue, number of sectors, sleep duration and flight duration ().

Table 1. Correlations between NASA-TLX and PVT lapses, response speed, Samn-Perelli, flight duration, number of sectors and sleep duration

Non-parametric correlations revealed weak, but significant, associations between lapses, response speed, Samn-Perelli, and each of the six workload dimensions () showing a decrease in performance and an increase in fatigue when the subjective workload was rated higher.

Flight duration was weakly, but significantly, correlated with temporal demand, effort and frustration, such that temporal demand, effort and frustration were higher on shorter flights (). The number of sectors was weakly, but significantly, correlated with temporal demand, effort, performance and frustration, which were rated higher on the days with four sectors. Sleep duration was very weakly, but significantly, correlated with mental demand and frustration. The stepwise regression showed that the Samn-Perelli (ß =.22, SE = .38, p < .001), the number of sectors flown each day (ß = .08, SE = .44 p < .01) and the PVT lapses (ß = .08, SE = .04, p < .01) explained a small, but significant, amount of the variance in workload (F(3, 1280) = 30.17, p < .001, R2 = .07).

Discussion

Our findings suggest that a higher self-reported workload is associated with slower reaction time and higher ratings of fatigue. There were very weak, but significant, correlations between mean pilot workload as measured by NASA-TLX and PVT performance, Samn-Perelli ratings, sleep duration, number of sectors and flight duration. In addition, each workload scale was significantly correlated with PVT performance and Samn-Perelli fatigue ratings. Pilots experienced more PVT lapses and their response speed was slower when workload was rated higher. Albeit still weak, the highest correlations of the study were between PVT performance and workload; workload was rated higher when fatigue was also rated as higher. Pilots reported being frustrated, experiencing time pressure and making more effort on shorter flights. The stepwise regression showed that Samn-Perelli fatigue, the number of sectors and PVT lapses were significant predictors of workload, although the effect was very small.

ur findings confirm and extend other studies that have shown that workload and fatigue are related to airline pilots. Previous studies also found a significant relationship between fatigue and multiple flight sectors (Flynn-Evans et al. Citation2018; Honn et al. Citation2016; Powell et al. Citation2007). Powell and colleagues found a linear relationship between fatigue as measured by Samn-Perelli and the number of sectors and time awake. Honn et al. (Citation2016) found that subjective fatigue was greater and PVT performance was worse in a flight simulation with multiple sectors compared to a single long flight, with a modest effect on the fatigue of multiple takeoffs and landings. In the real world, Flynn-Evans et al. (Citation2018) showed higher fatigue ratings and poorer performance among pilots flying multiple sectors with longer duty days compared to those flying fewer sectors, despite both groups having the same sleep opportunity. It is possible that sleep-deprived pilots perceive workload as higher because higher levels of workload and fatigue have been reported following shorter sleep duration (Bourgeois-Bougrine et al. Citation2003). The elevated fatigue observed in the present study may relate to multiple short sectors, and temporal and mental demand. Performance, mental and temporal demand were rated the highest of the six subscales, similar to previous studies investigating pilot workload (Burke et al. Citation2013).

Although significant, the correlations in the present study were very weak and the variance explained by the predictors in the regression model was very small. In addition, our study shows relationships between these factors but does not indicate causality. An additional limitation is that the TLX ratings referred to the flight up to the top-of-descent and it didn’t capture landing, which is the most stressful phase of flight. Finally, the flight sectors that we studied were relatively short due to the short-haul nature of the study, which may explain why flight duration was not a significant predictor in the regression model. Studies including longer flights may yield different outcomes.

Further research is needed to determine how other factors such as weather, air traffic control communications and experience may contribute to high workload during flights. When combined with less sleep, long duty days, multiple takeoffs and landings, these factors have the potential to increase risk. Future studies should focus on disentangling the many factors that can cause one to report elevated workload and to determine which specific aspects of workload may cause pilots to experience reduced performance.

Acknowledgements

We would like to thank the pilots who collected data for this study and the airline’s flight safety department for all their hard work in coordinating equipment and recruitment. This work was supported by the NASA Airspace Operations and Safety Program, System-Wide Safety Project.

References

  • Arsintescu L, Mulligan JB, Flynn-Evans EE. 2017. Evaluation of a psychomotor vigilance task for touch screen devices. Hum Factors. 59(4):661–670. doi:10.1177/0018720816688394
  • Bourgeois-Bougrine S, Cabon P, Mollard R, Coblentz A, Speyer J. 2003. Fatigue in aircrew from short-haul flights in civil aviation: the effects of work schedules. Hum Factors Aerosp Saf: An Int J. 3:177–187.
  • Burke KA, Wing DJ, Lewis TA. 2013. Pilot subjective assessments during an investigation of separation function allocation using a human-in-the-loop simulation. In 2013 Aviation Technology, Integration, and Operations Conference, Los Angeles, CA. p. 4360. https://doi.org/10.2514/6.2013-4360
  • Byers JC, Bittner AC, Hill SG. 1989. Traditional and raw task load index (TLX) correlations: are paired comparisons necessary? In: Mital A, editor. Advances in industrial ergonomics & safety. Vol. 1. Cincinnati, OH: Taylor & Francis; p. 481–485.
  • Flynn-Evans EE, Arsintescu L, Gregory K, Mulligan J, Nowinski JL, Feary M. 2018. Sleep and neurobehavioral performance vary by work start time during non-traditional day shifts. Sleep Health. 4(5):476–484. fsssdoi:10.1016/j.sleh.2018.08.002
  • Hart SG, Staveland LE. 1988. Development of NASA-TLX (Task Load Index): results of empirical and theoretical research. In: Hancock PA, Meshkati N, editors. Advances in psychology, 52. Amsterdam, The Netherlands: Human mental workload. p. 139–183.
  • Honn KA, Satterfield BC, McCauley P, Caldwell JL, Van Dongen HP. 2016. Fatiguing effect of multiple take-offs and landings in regional airline operations. Accid Anal Prev. 86:199–208. doi:10.1016/j.aap.2015.10.005
  • International Civil Aviation Organization. 2015. Fatigue management guide for the airline operators (2nd ed.). https://www.icao.int/safety/fatiguemanagement/FRMS%20Tools/FMG%20for%20Airline%20Operators%202nd%20Ed%20(Final)%20EN.pdf
  • Powell D, Spencer MB, Holland D, Broadbent E, Petrie KJ. 2007. Pilot fatigue in short-haul operations: effects od number of sectors, duty length, and time of day. Aviat Space Envir Med. 78(7):698–701.
  • Samn S, Perelli L. 1982. Estimating aircrew fatigue: A technique with implications to airlift operations. Technical Report No. SAM-TR-82-21. Brooks AFB (TX): USAF School of Aerospace Medicine. p. 1–26.
  • Vejvoda M, Elmenhorst EM, Pennig S, Plath G, Maass H, Tritschler K, Basner M, Aeschbach D. 2014. Significance of time awake for predicting pilots’ fatigue of short-haul flights: implications for flight duty time regulations. J Sleep Res. 23(5):564–567. doi:10.1111/jsr.12186