7,077
Views
23
CrossRef citations to date
0
Altmetric
Articles

Reference values of maximum performance tests of speech production

, , , &
Pages 56-64 | Received 28 Feb 2017, Accepted 11 Sep 2017, Published online: 26 Sep 2017

Abstract

Purpose: Maximum performance tests examine upper limits of speech motor performance, as used by speech-language pathologists in dysarthria assessment protocols. The Radboud Dysarthria Assessment includes maximum repetition rate, maximum phonation time, fundamental frequency range and maximum phonation volume to assist in detecting pathological performance. This study aims to obtain reference values for each of these tests.

Method: A group of 224 healthy Dutch adults aged 18–80 years performed the maximum performance tests. Age, sex, body height, smoking habit, and profession were registered. Using multivariable linear regression, a wide range of models was tested to examine the relationship between these person characteristics and speech performance. The likelihood ratio was used to test the goodness of fit to the data.

Result: Above 60 years of age, maximum repetition rate, fundamental frequency range and maximum phonation volume were all negatively affected by age. Below 60 years, only women showed effects of age on fundamental frequency range (increase) and maximum phonation volume (decrease). Maximum phonation time was primarily related to body height (increase).

Conclusion: This study presents reference values of four maximum performance tests for comparing the performance of dysarthric patients with non-pathological performance. Age was identified as most important factor influencing maximum speech performance.

Introduction

Maximum performance tests of speech production examine the upper limits of speech motor performance (Kent, Kent, & Rosenbek, Citation1987) and are used in dysarthria assessment protocols by speech-language pathologists (SLPs) to investigate the articulatory and phonatory–respiratory systems more independently than in spontaneous speech (Duffy, Citation2013). Typically, in spontaneous speech all systems (articulatory, velopharyngal, phonatory and respiratory) are cooperating during highly variable speech patterns, while the maximum performance tests have a limited variability. In 2014, the Radboud Dysarthria Assessment (RDA) was published (Knuijt et al., Citation2014), which includes four maximum performance tests of speech production: maximum repetition rate (MRR), maximum phonation time (MPT), fundamental frequency range (FFR) and maximum phonation volume (MPV). Despite the ongoing debate about motor control in speech versus non-speech tasks (Ben-David & Icht, Citation2017; Kent, Citation2015; Maas, Citation2016; Ziegler, Citation2003), we think that maximum performance tasks are of utmost importance in clinical dysarthria assessment. The most important reason is that, compared to spontaneous speech, repetitive speech patterns are less variable and, thus, easier to judge. Other reasons to include maximum performance tests in the RDA are the following. First, in spontaneous speech, a person with dysarthria can compensate speech motor deficits, for example, by slowing down the speaking rate. During maximum performance tests, such compensatory strategies are much harder to use, resulting in a more realistic expression of the different capacities of the articulatory and phonatory–respiratory systems. Second, maximum performance tests can help SLPs with distinguishing different types of dysarthria. For example, a dysrhythmic MRR is a specific feature of ataxic dysarthria (Ackermann, Hertrich, & Hehr, Citation1995; Brendel et al., Citation2015; Duffy, Citation2013) and hypokinetic dysarthria is characterised by a normal MRR with reduced amplitude of the articulatory movements (Ackermann et al., Citation1995). Finally, maximum performance tests can help to identify therapeutic options. For example, a high MPV in a patient with hypokinetic dysarthria reveals the voice capacity that is needed for successful training.

To distinguish pathological from non-pathological speech performance and to obtain an indication of the severity of pathological performance, reference values are needed. These data are partially available in the literature (Icht & Ben-David, Citation2014; Kent et al., Citation1987; Pierce, Cotton, & Perry, Citation2013), but reference values from a sizable population with a clinically relevant age span are lacking. In addition, maximum speech and voice capacity may be related to language or culture. Indeed, Icht and Ben-David (Citation2014) recently showed significant differences in MRRs between English, Portuguese, Farsi and Greek speaking persons. Therefore, the purpose of this study was to collect reference values for MRR, MPT, FFR and MPV in a large population of healthy Dutch adults and relate these to relevant person characteristics.

Method

Participants

We included 224 healthy native Dutch speakers (108 men and 116 women) aged 18–80 years. They were recruited by the investigators from the local community. Participants with a history of any swallowing, speech or voice problem were excluded. The participants were divided into age groups of one decade (18–29, 30–39, 40–49, 50–59, 60–69, and 70–80) and we aimed to include a more or less equal number of participants per age group. We collected the following person characteristics that might influence maximum performances: age (years), sex (men/women), body height (cm), smoking habit (yes/no), and professional occupation. Age and sex were registered as basic person characteristics. In addition, body height was registered because of its effect on lung capacity (Quanjer et al., Citation2012) and smoking habit and professional occupation because of their known effect on the quality of the voice (Awan & Alphonso, Citation2007; Timmermans et al., Citation2002). Based on profession, the level of vocal use was categorised by the classification of Koufman and Isaacson (Citation1991): I: elite vocal performer (singers and actors), II: professional voice user (teachers, receptionists), III: non-vocal professionals (doctors, lawyers), and IV: non-vocal non-professionals (students, laborers).

All participants signed informed consent before participating in the study. We obtained approval from the Committee on Research Involving Human Subjects of Arnhem and Nijmegen.

Speech measurements

The participants performed all speech tasks three times, in similar order, while sitting upright. All performances were recorded with a linear PCM recorder (Tascam DR-05, Tokyo, Japan) and the best maximum performance was used in the statistical analysis. Five trained examiners recruited and instructed the participants and recorded all performances. The examiners worked in pairs, but the participant was assessed by just one examiner.

Description of the tasks

MRR: the participants were instructed to repeat the monosyllabic sequences /pa/, /ta/ and /ka/ and the trisyllabic sequence /pataka/ as fast as possible for at least 6 s. MRR was analysed with Praat (Boersma & Weening, Citation1995) and expressed in syllables per second. The count-by-time method was used during the first 5 s of the sequence (Gadesmann & Miller, Citation2008).

MPT of /a:/: the participants were instructed to produce an /a:/ as long as possible after taking a maximal inhalation, at a comfortable pitch and at their habitual loudness. MPT was analysed with Praat and expressed in seconds.

FFR: the participants were instructed to produce an /a:/ from the lowest possible to the highest possible pitch and vice versa. Producing a musical scale was also allowed. People who experienced difficulties while performing this test were stimulated to produce only their lowest and highest pitches. FFR was analysed with Praat and expressed in Hz. FFR was converted from Hz to semitones using the formula: ST = 39.87 × log (F/50) (Rietveld & Van Heuven, Citation2009).

MPV: the participants were instructed to produce “Hallo!” (Hello!) and “Kom hier!” (Come here!) as loud as possible. MPV was measured with a dB-meter (Voltcraft SL-100, Hirschau, Germany) at 30 cm distance from the mouth, which was standardised by using the A4 assessment form.

Statistical methods

First, we used univariate analysis to explore the association between each maximum performance task and each person characteristic using Pearson correlation coefficients (age and body height), Spearman correlation coefficients (profession), and independent-samples t-tests (sex and smoking habit) to identify possibly influential person characteristics. Regarding MRR (articulation), we only explored the association with sex and age, whereas for MPT, FFR and MPV (voice), we explored the association with all person characteristics. Characteristics with a p value of <0.05 were selected for multivariate analysis (determinants). Second, multivariate linear regression was used to study the unique influence of the identified determinants (independent variables) on each maximum task performance (dependent variable) separately. We searched for the independent variable with the strongest influence to be able to construct reference lines. Therefore, we studied a wide range of models for each maximum performance test: first- to third-degree polynomials in age and body height, piece-wise regression in age and height, interaction terms with sex, and untransformed and logarithmic transformed values of the performance tests. The likelihood ratio was used to test differences between the models for their goodness of fit to the data. With respect to MPT, the dependent variable was the logarithmic transformed value of the MPT. The antilog-transformed results were calculated. For all other maximum performance tests, the dependent variable was the original performance.

A paired-samples t-test was used to test differences between the four individual sequences of the MRR (α level: p = 0.05). All statistical analyses were performed using SAS 9.2 for Windows (SAS Institute, Cary, NC).

Result

A total of 224 participants (108 men and 116 women) were included with a mean age of 43 years [standard deviation (SD) = 19.0, range 18–80] and a mean body height of 175.5 cm (SD =9.6, range 155–201). Sixty-eight participants (30.4%) were smokers and 66 (29.4%) were vocal professionals (level I and II) (see ). The age group 18–29 years was the largest for two reasons. Initially, we started including participants from 20 years old, but we extended the youngest age group from 20–29 years to 18–29 years, as adulthood starts at 18 years and the paediatric version of the RDA (under construction) reaches up to 17 years. Second, this youngest group initially seemed to score lower than expected. By including more participants, we intended to obtain a better representation of this age group.

Table I. Characteristics of all participants (n = 224).

Overall, smoking habit and profession (vocal use) did not influence the maximum performance tests, leaving age, sex and body height as independent variables for the multivariate regression analyses. When testing the models for their goodness of fit, the best fit was the piece-wise regression model with a cut-off point chosen at 60 years of age. Regression coefficients with 95% confidence intervals (95% CI) of the final models are presented. The 5th, 25th, 50th, 75th and 95th percentile reference lines are visualised in a graph. Reference lines are presented by sex when relevant.

Maximum repetition rate

The median and range of each MRR sequence are shown by age group in . Across all age groups, /ka/ was by far the slowest sequence (with 6.0 syl/s) and differed significantly from /pa/ (p < 0.01), /ta/ (p < 0.01) and /pataka/ (p < 0.01). /Pataka/ was the fastest sequence (with 6.9 syl/s) and differed, in addition to /ka/, significantly from /pa/ (p = 0.04) and /ta/ (p < 0.01). Finally, /pa/ was a significantly faster sequence than /ta/ (p < 0.01).

Table II. The observed median and range of the maximum performance tests of speech production by age group.

MRR was only significantly related to age. In , the estimated mean decrease in syllables per second per year is presented for each sequence, using a piece-wise linear regression model with a cut-off point at 60 years. After the age of 60 years, there was a significant decrease in the speed of performance [/pa/: r = −0.03 (95% CI −0.06; −0.01); /ta/: r = −0.06 (95% CI −0.09; −0.02); /ka/: r = −0.05 (95% CI −0.08; −0.02)], whereas the age range between 18 and 60 years did not show a significant decline. The percentile reference lines are shown in . Note that findings were nearly identical for men and women.

Figure 1. The percentile reference lines of the individual sequences of maximum performance rate against age, using a piece-wise linear regression with a cut-off point at 60 years of age.

Figure 1. The percentile reference lines of the individual sequences of maximum performance rate against age, using a piece-wise linear regression with a cut-off point at 60 years of age.

Table III. The estimated mean change per year in MRR, FFR, and MPV using a piece-wise linear regression model with cut-off point at 60 years.

Maximum phonation time

In , the median and range are shown by age group. Across all age groups, MPT was significantly related to body height (p < 0.01). In , the median and range are shown by category of body height. The percentile reference lines are shown in . The mean difference between men and women was 4.9 s (p < 0.01), but the effect of body height was stronger.

Figure 2. The percentile reference lines of MPT against body height, using a linear regression model after the logarithmic transformation of the MPT values.

Figure 2. The percentile reference lines of MPT against body height, using a linear regression model after the logarithmic transformation of the MPT values.

Table IV. MPT by category of body height.

Fundamental frequency range

The median and range are shown by age group in . FFR was significantly related to age and sex. In , the estimated mean change per year is presented for both sexes using a piece-wise linear regression model with a cut-off point at 60 years. In men, there was a significant decrease in FFR per year after the age of 60 years [r = −0.45 (95% CI −0.80; −0.12)], whereas in women this decrease was similar but did not reach significance [r = −0.43 (95% CI −0.88; 0.01)]. Yet for women, there was a significant increase in FFR per year in the age span of 18–60 years [r = 0.24 (95% CI 0.14; 0.35)] (). The percentile reference lines are shown in .

Figure 3. The percentile reference lines of FFR and MPV against age by sex, using a piece-wise linear regression model with a cut-off point at 60 years of age.

Figure 3. The percentile reference lines of FFR and MPV against age by sex, using a piece-wise linear regression model with a cut-off point at 60 years of age.

Maximum phonation volume

In , the median and range are shown by age group. MPV was significantly related to age and sex. In , the estimated mean decrease in MPV per year is presented using a piece-wise linear regression model with a cut-off point at 60 years. In men, there was a significant decrease above 60 years of age [r = −0.11 (95% CI −0.20; −0.02)], which was also found in women [r = −0.19 (95% CI −0.34; −0.05]. However, only for women, there was a significant decrease in MPV below the age of 60 years [r = −0.05 (95% CI −0.08; −0.01)]. MPV was influenced by body height as well, but the effect of age was stronger. Percentile reference lines are shown in ).

Discussion

This study presents reference values of four maximum performance tests of speech production from a sizeable healthy Dutch population. Overall, the data showed a fairly stable performance up to 60 years, but an age-related decline above the age of 60 years for MRR, FFR, and MPV, leaving the MPT relatively unaffected. Only in women FFR showed a marked increase from 18 to 60 years. Smoking habit and profession (vocal use) had no influence on any of the performances.

Looking at MRR, the age effect we found for the monosyllabic sequences is consistent with other studies that found an age-related effect in people older than 65 years (Ben-David & Icht, Citation2017; Padovani, Gielow, & Behlau, Citation2009; Ptacek & Sander, Citation1966). In contrast, Pierce et al. (Citation2013) recently assessed healthy subjects older than 65 years and found no significant age effect between 65 and 86 years, although the raw scores of the 75+ group were lower than of the 65+ group for all but one sequence. We found no age effect for the trisyllabic sequence, although the median speed of performance of the 70+ age group was by far the slowest. The absence of an age effect for the trisyllabic sequence under 60 years of age is in line with the study by Icht and Ben-David (Citation2014). In all age groups, /ka/ was the slowest sequence, which is consistent with previous findings (Kent et al., Citation1987; Padovani et al., Citation2009). Pronouncing /ka/ requires moving the tongue dorsum, which requires movement of most of the mass of the tongue. Men and women performed equally for all MRR sequences, which is in line with the literature. Indeed, studies regarding speech production, speaking and articulation rates hardly ever revealed sex differences (Hyde & Marcia, Citation1988; Kent et al., Citation1987; Tsao & Weismer, Citation1997).

Previously published norm data on the FFR are scarce. Only data on maximum pitch are available, as maximum pitch is one of the four parameters of the Dysphonia Severity Index (Wuyts et al., Citation2000). In accordance with our study, the maximum pitch lowers in ageing men and women, although the causes of laryngeal changes are different between the sexes (Goy, Fernandes, Pichora-Fuller, & van Lieshout, Citation2013; Hakkesteegt, Brocaar, Wieringa, & Feenstra, Citation2006; Hirano, Kurita, & Sakaguchi, Citation1989). In ageing men, bowing and vocal fold atrophy are most often described, whereas in ageing women vocal fold oedema is most frequent. Besides, the fundamental frequency (F0) decreases in post-menopausal women and increases in elderly men (Higgins & Saxman, Citation1991; Honjo & Isshiki, Citation1980; Torre & Barlow, Citation2009). To sum up, in men, the F0 rises and the maximum pitch lowers, whereas in women both the F0 and the maximum pitch lower, which may explain that the full range (FFR) decreases more in men than in women. MPT was not related to age, but only to body height, most likely because of the relationship between body height and lung function (Quanjer et al., Citation2012). Indeed, Awan (Citation2006) reported a significant correlation between MPT and vital capacity. The fact that we found higher rates for men compared to women is evident because of the interaction between sex and body height, and in line with the literature, revealing a longer MPT for men (Goy et al., Citation2013; Hakkesteegt et al., Citation2006; Wuyts et al., Citation2000). Unlike MPT, MPV was more dependent on age than on body height, although MPV depends on lung capacity as well, which is related to body height (Quanjer et al., Citation2012). MPT may be more dependent on lung volume, whereas MPV may be more dependent on muscle strength. With ageing, a decreased muscle strength in combination with the above mentioned laryngeal changes may account for the larger influence on MPT of age than body height.

In three of the four tasks, age was the most important factor influencing maximum performance tests of speech production. Most of this effect was observed from the age of 60 years and older. Human functioning generally declines above the age of 60 years due to neurological, metabolic, and hormonal changes (Carmona & Michan, Citation2016). These changes can have a negative influence on speech, just as on the physical performance of a person (Ramig, Citation1983). Looking at speech, presbyphonia is the term typically used for age-related vocal changes (Kendall, Citation2007). Yet, our study clearly shows that age-related changes are not confined to the voice, but reach out to the articulation domain as well, which could be termed “presbyarthria”. Indeed, in 1974, Ryan and Burk suggested that speech of aged adults may fall at the mild end of a dysarthric continuum. This conclusion was confirmed by Parnell and Amerman (Citation1987) in a perceptual study, in which a mild dysarthric speaker was difficult to distinguish from healthy geriatric participants. Other studies that confirm the age-related effects regarding articulation are those showing that speaking rate slows down with advanced age (Harnsberger, Shrivastav, Brown, Rothman, & Hollien, Citation2008; Ramig, Citation1983; Sadagopan & Smith, Citation2013) and studies showing that the variability of acoustic and kinematic measures increases with older age (Bennett, van Lieshout, & Steele, Citation2007; Wohlert & Smith, Citation1998).

Hence, the question is justified which underlying mechanism is responsible for this decline of speech quality above the age of 60 years? As healthy persons typically use a small amount of their maximum tongue strength during speech (Dworkin & Aronson, Citation1986), normal age-related loss of orofacial muscle strength (Adams, Mathisen, Baines, Lazarus, & Callister, Citation2013; Vanderwegen, Guns, Van Nuffelen, Elen, & De Bodt, Citation2013) can probably not account for loss of speech quality at older age. Recently, research has been conducted regarding non-muscular tissue stiffness. It was found that fibrosis (accumulations of excessive connective tissue), lipomatosis (accumulations of fatty cells), and amyloidosis (deposits of waxy proteins and polysaccharides) in tongue tissue increase progressively with age (Rother, Wohlgemuth, Wolff, & Rebentrost, Citation2002; Yamaguchi, Nasu, Esaki, Shimada, & Yoshiki, Citation1982). In line with these findings, Dietsch et al. (Citation2015) found increased non-muscular tissue stiffness of the skin overlying the masseter, the cheek and lateral tongue with age. Another example is the study by Mefferd and Corder (Citation2014), who found that older adults (>65 years) were able to increase lower lip and jaw speed during an MRR test with /fa/, but that they had more difficulties with stiffness regulation and force production than younger adults (22–55 years). Consequently, it is plausible that non-muscular stiffness of oral structures is a relevant cause for the decline in speech quality above the age of 60–65 years. In addition, the above mentioned insidious neurological changes may have a negative effect on speech as well. Indeed, if the central and peripheral nervous systems gradually decline, there will be slowing of movements, loss of coordination, and an increase in speech variability (Seidler et al., Citation2010), although people can adapt to these changes by using compensatory strategies (e.g. slowing down their speaking rate to ensure movement accuracy) (Diggles-Buckles, Citation1993).

Typically, for all maximum performance tests in this study, the range of non-pathological performance was large. Although a large range of normality has been found for other maximum performance tests such as maximum inspiratory pressure (Sclauser Pessoa et al., Citation2014) or the 6-min walk test (Enright & Sherrill, Citation1998), a large normal range may complicate the interpretation of the performance of individual dysarthric speakers. Yet, the reference lines provide the patient’s performance with a percentile score. Nevertheless, qualitative characteristics of maximum performance tasks are equally important to identify underlying pathology (weakness, rigidity, coordination deficits) and, thus, to contribute to the assessment of the type and severity of dysarthria.

Strengths and limitations

Our participants formed a fair representation of the general Dutch population (CBS, Citation2015), as we included various age groups between 18 and 80 years with a mean body height of 175.5 cm (SD =9.6, range 155–201) and a variation in professional voice use. However, we assessed only participants who had Dutch as their first language. It is, therefore, questionable whether our reference values are also applicable to people with other first languages or to people with Dutch as a second language. Icht and Ben-David (Citation2014) suggested that their across-language differences in the trisyllabic MRR sequence could be explained by different tongue settings, influencing the /t/ and /k/. In addition, the English /p/, /t/ and /k/ are aspirated, whereas these syllables in Dutch are not. Therefore, it seems valuable to extend our population with participants speaking Dutch as a second language and to compare our data with equally sized groups with other first languages using the same assessment protocol.

Generalisability is related to age range as well. We included participants from 18 years old, because normal values of children up to 17 years are being collected in preparation of the paediatric RDA. Because we took 80 years as the upper age limit, the normal values are not applicable to dysarthric patients older than 80 years.

Another limitation is that we used several examiners to collect the data. Although they were all trained by the first author (S.K.), we cannot rule out subtle differences in examination approach due to interobserver variability. We did not control for test–retest variability either, but all participants performed each task three times and we used their best performance for analysis.

Conclusion

This study provides reference values of four maximum performance tests of speech production to compare the performance of dysarthric patients with non-pathological speech performance. Age was identified as the most important factor influencing MRR, FFR, and MPV (>60 years), whereas MPT was primarily influenced by body height. Only women showed effects of age on FFR (increase) and MPV (decrease) < 60 years. Interestingly, age-related changes were not confined to the voice, but reached out to the articulation domain as well, which could be referred to as “presbyarthria”.

Declaration of interest

No potential conflict of interest was reported by the authors.

Acknowledgements

The authors thank Dr. J. C. M. Hendriks for his statistical expertise and Jacorine Roekevisch, Marleen Klaar, Céline van Kalmthout, Ellen Boskamp, Susan Steffen and Harry Goos for their participation in collecting the data.

References