8,096
Views
2
CrossRef citations to date
0
Altmetric
Research Article

Improvements in Speech of Children with Apraxia: The Efficacy of Treatment for Establishing Motor Program Organization (TEMPOSM)

ORCID Icon, ORCID Icon, , , , & show all
Pages 494-509 | Received 13 Aug 2020, Accepted 08 Apr 2021, Published online: 09 Jul 2021

ABSTRACT

Purpose: This study investigated the efficacy of Treatment for Establishing Motor Program Organization (TEMPOSM) in childhood apraxia of speech (CAS).

Method: A mixed between- and within-participant design with multiple baselines across participants and behaviors was used to examine acquisition, generalization, and maintenance of skills. TEMPOSM was administered in four one-hour sessions a week over a four-week period for eleven participants (ages 5 to 8), allocated to either an immediate treatment group or a wait-list control group. Acoustic and perceptual variables were measured at baseline, immediate post-treatment, and one-month post-treatment.

Results: Children demonstrated significant improvements in specific acoustic measures of segmentation and lexical stress, as well as perceptual measures of fluency, lexical stress, and speech-sound accuracy. Treatment and generalization effects were maintained one-month post-treatment with generalization to untreated stimuli.

Conclusion: TEMPOSM was efficacious in improving segmental and suprasegmental impairments in the speech of children with CAS.

Introduction

Childhood apraxia of speech (CAS) is a motor speech disorder characterized by an impairment in the programming of spatial and temporal parameters for accurate speech movement patterns.Citation1,Citation2 Specifically, apraxia of speech is a breakdown in the translation of intact phonological plans into the exact movement parameters required for segmental and prosodic accuracy of speech production.Citation2 Although the perceptual features that define the disorder have long been the subject of debate, there is emerging consensus that childhood and acquired forms of apraxia are unified by a common set of differentially diagnostic features including segmentation (increased segment and inter-segment duration), reduced stress contrast across syllables in polysyllabic words, and distortion of speech sounds.Citation2,Citation3

Apraxia of speech has been conceptualized within Klapp’s motor programming model in which there is an internal working memory buffer (INT) that holds motor units prior to execution, and a sequencer (SEQ) that places those units in the correct serial order after movement onset.Citation4,Citation5 Adult speakers with apraxia demonstrate impairments in INT, and not SEQ, indicating a deficit in the efficient programming of motor units.Citation6 As motor learning occurs, individual speech units (e.g. syllables) are concatenated into larger motor programs such as polysyllabic words;Citation7 however, this process is impaired in speakers with apraxia as more complex programs place an increased load on INT.Citation6 Specifically, inefficient concatenation explains the perception that individuals with apraxia “speak one syllable at a time,” or segment their speech.Citation8 Concatenation also allows for accurate programming of coarticulatory effects and prosodic patterns across syllables, both of which are notably impaired in speakers with apraxia of speech, since the application of the suprasegmental features that underlie lexical stress (e.g., changes in frequency, intensity and duration across syllables within a word) occurs in INT.Citation6

Children with CAS are thought to require years of intensive therapy,Citation1 reportedly up to 81% more therapy than severe phonological disorders to achieve similar functional outcomes,Citation9 perhaps due in part to a poor understanding of the disorder mechanism up until recently. Indeed, symptoms can persist into adulthood and result in a substantial disability affecting intelligibility, social communication, academic performance, and overall quality of life.Citation10,Citation11 Treatment approaches for CAS have primarily targeted improved accuracy of segmental features to expand phonemic inventory or develop a core vocabulary. Although there are a variety of treatment approaches, the efficacy of each has not been established (for a review, see ref. Citation12) Consequently, there is a critical need for development and implementation of innovative treatment approaches that effectively target the underlying impairment in CAS and the resulting disruption in temporal control of syllable-level prosody – specifically lexical stress contrasts and syllable segmentation – that characterizes the disorder. Mechanistic-based treatment approaches may result in increased treatment efficacy, with faster and more effective outcomes to intervention.

The current study investigates the efficacy of Treatment to Establish Motor Program Organization (TEMPOSM) as an intervention for CAS. TEMPOSM targets the hypothesized underlying impairment in CAS (i.e., segmentation of speech into individual sound/syllable units) by training concatenation of syllables into longer motor units. Additionally, TEMPOSM explicitly and simultaneously targets each of the three diagnostic features of CAS through repeated practice of fluent transitions between syllables, syllable stress contrasts, and accurate speech sounds in multisyllabic pseudowords for improved intelligibility and naturalness of speech. TEMPOSM is structured within a motor learning framework and fully incorporates principles of motor learning (PML) in order to maximize treatment efficacy. Broadly, PML delineate important factors in stimulus selection, practice structure, and feedback that increase the difficulty of training tasks and encourage self-evaluation of productions in order to promote retention and generalization of trained motor skills (see Methods for a complete list).Citation13,Citation14 Robin and colleagues first demonstrated a significant improvement in acoustic and perceptual measures of lexical stress in three children with CAS following treatment using this novel approach, now named TEMPOSM.Citation15 Subsequent studies of this approach, published using the name Rapid Syllable Transition Treatment (ReST),Citation16–20 have used exclusively demonstrated positive treatment effects in specific perceptual measures of articulatory and lexical stress accuracy, including in one randomized controlled trial of children with CAS.Citation17 A smaller single-case design study demonstrated similar perceptual improvements in four children with CAS who received lower-dose treatment frequency.Citation18 Telehealth delivery of this intervention also resulted in positive gains in perceptual measures of production accuracy.Citation19,Citation20

Perceptual measures are the current standard for clinical diagnosis and treatment of CAS but have limitations in comparison with acoustic measures, which have been shown to be reliable, objective, and sensitive measures of features of motor speech disorders.Citation21–23 Specifically, acoustic measures of the speech of children with CAS show evidence of a disruption in temporal control of speech, marked by increased duration and reduced variability in duration of speech segments.Citation15,Citation24–29 Ballard et al. provided strong preliminary evidence as to the utility of acoustic measures as sensitive indices of treatment-related change in CAS in the first single-case design study of TEMPOSM,Citation15 but there is otherwise a lack of acoustic evidence as to the efficacy of CAS treatment. Therefore, the application of these durational acoustic measures in a larger group study is a critical next step to advance our understanding of the specific mechanisms through which TEMPOSM improves prosody in CAS. The current study aims to extend previous work to include an additional acoustic measure (intersegment duration) to provide a more precise measure of changes in segmentation of speech following TEMPOSM intervention. Additionally, this study aims to validate the acoustic measures of lexical stress previously applied by Ballard et al.Citation15 (increased durational, pitch, and volume contrasts) in a larger group of children with CAS. Perceptual measures of segmentation of speech, lexical stress, and speech sound accuracy are also included as secondary outcome measures to parallel the measures used in previous studies of this treatment approach and the provision of perceptually based feedback during TEMPOSM clinical sessions.

The primary hypotheses are:

  1. Children will demonstrate significantly improved performance in acoustic and perceptual measures of segmentation, lexical stress, and consonant distortions as a result of treatment of trained pseudowords comprised of plosive phonemes, with retention of treatment effects one-month post-treatment.

  2. Treatment effects will generalize to untreated related exemplars, both real words and pseudowords comprised of plosive phonemes, with maintenance up to one-month post-treatment.

  3. Improvements in perceptual speech sound accuracy will not generalize to pseudowords that are untreated and unrelated to the trained set; that is, pseudowords comprised of fricative phonemes.

  4. Acoustic measures of intersegment duration and lexical stress contrast will be significantly correlated with perceptual measures of segmentation and stress accuracy.

Method

This study was approved by the Institutional Review Board for the Protection of Human Subjects in Research at the University of New Hampshire and written informed consent was obtained from both parents and the participating children.

Participants

Children were referred to the clinic by their current treating speech-language pathologists in response to a recruitment advertisement. Families first completed a short telephone screen to collect a case history and schedule an evaluation to confirm eligibility criteria for participation in the study: a) normal hearing; b) no orofacial structural abnormalities, muscle weakness, or altered muscle tone or reflexes; c) native speaker of English, d) no other developmental, neurological, or genetic disorders, and e) diagnosis of CAS. Formal clinical evaluation was then completed by graduate student clinicians under the supervision of qualified clinicians (authors DAR and ASP). Assessments included a pure-tone hearing screening to ensure normal hearing at time of study entry;Citation30 Clinical Evaluation of Language Fundamentals – Fifth Edition (CELF-5) to assess language skills;Citation31 Goldman – Fristoe Test of Articulation – Third Edition (GFTA-3) to document speech errors;Citation32 and a structural and functional oral and speech motor assessment (i.e., Motor Speech Examination, MSE) to determine presence of CAS and rule out structural abnormality or frank dysarthria.Citation33 The oral structure examination confirmed no evidence of asymmetry, slowness, weakness, reduced movement amplitude, or altered muscle condition (e.g., flaccidity or spasticity). Children deemed eligible for the study also demonstrated no evidence of impaired respiration, phonation, or resonance during the examination.

Expert clinician judgment of the presence/absence of key discriminative perceptual features remains the current gold standard in diagnosis of CAS, as agreed-upon thresholds of presences for key CAS features have yet to be established.Citation34 Evidence for segmentation, equal stress, and speech sound distortions, which are the three commonly cited key perceptual features in both acquired and childhood apraxia of speech,Citation1–3,Citation34,Citation35 was assessed during the polysyllable speech tasks of the MSE (e.g. production of words of increasing length, multiple repetitions of three-syllable words, sentences containing multisyllabic words, and conversational speech).Citation33 This is consistent with recent studies that have shown polysyllable words to be sensitive to these apraxic features.Citation3,Citation34 Diagnosis of CAS required unanimous independent agreement across two experienced speech-language pathologists (authors ASP and DAR, both certified SLPs with more than 20 years of clinical experience with CAS) as to the presence, in at least three tasks (word-level or above) of the MSE, of all three features of the disorder: 1) segmentation, defined as increased segment and intersegment durations, particularly the perception of syllable segregation and inter-syllabic pauses; 2) equal or reduced stress contrast across syllables in multisyllabic words or phrases; and 3) speech sound distortions, defined as phonetic errors in the production of a phoneme, including voicing distortions, vowel distortions, and distorted substitutions. Clean phonemic substitution errors (i.e. produced without distortion) were not counted as evidence for CAS. While there is no commonly agreed upon operational definition of “presence” of an apraxic feature,Citation34 we applied the descriptions of Strand et al. from the Apraxia of Speech Rating Scale; that is, a feature was considered present in any given task if a rater judged that it was either “detectable but infrequent” (i.e., this could be a single and clear occurrence in a task), “frequent but not pervasive”, or “nearly always evident” (see ref.Citation36). To ensure that a child did not receive the diagnosis based on a single occurrence of each feature, we required unanimous agreement that all three features were observed across at least three polysyllabic speaking tasks.

Table 1. Participant characteristics

shows participant flow through the study in accordance with TREND guidelines.Citation37 Of 22 families who completed the preliminary screen, 16 completed evaluations in the clinic. The remaining six families decided not to participate and did not schedule an evaluation. Three children were excluded following evaluation due to a diagnosis of phonological disorder with no evidence of motor programming impairment. Of the 13 children who met study criteria with a confirmed diagnosis of CAS, one family declined to participate in the study, and the remaining 12 children completed the four-week intervention. One of the 12 children was excluded from the group analyses reported here, as behavior prevented collection of some post-treatment measures; individual data for this participant is included in Supplemental Materials.

Figure 1. Flow chart of participant enrollment, assignment, and follow-up

Figure 1. Flow chart of participant enrollment, assignment, and follow-up

The 11 participating children ranged in age from 5;10 to 8;4 years;months, as of the first day of treatment (M = 7;1, SD = 0.7 years). Age and sex of each participant, as well as scores from administration of the Clinical Evaluation of Language Fundamentals – Fifth EditionCitation31 (CELF-5) and Goldman – Fristoe Test of Articulation – Third EditionCitation32 (GFTA-3) are presented in . All were enrolled in speech therapy for two or more years up until the start of this study and suspended community-based therapy until completion of the study (i.e. from the first baseline probe until the completion of the one-month retention phase).

Experimental Design

This phase II small-group cohort-control study employed a mixed between- and within-subjects design, with multiple baselines across participants and behaviors; the range of baseline length was varied sufficiently to allow for use of a waiting control group (for example, see ref. Citation38) Participants were pseudo-randomly assigned to either the immediate treatment or waiting control group to allow for constraints due to family availability during the treatment periods. Participants in the immediate treatment group (i.e. group 1 in ) completed 2–3 baselines in a roughly two-week period while the waiting control group (i.e. group 2 in ) completed 4–6 baseline testing sessions over a period of six weeks. Multiple baseline tests documented level and variability of performance for each child prior to treatment, with staggered baselines across participants used for demonstrating internal validity; that is, testing whether the treated and related behaviors improved significantly only after the administration of the treatment. This waiting control group design was used to establish the effect of the four-week TEMPOSM treatment protocol relative to no treatment; however, the two groups will be reported here as a single group analysis as the waiting control group showed no change in performance on baseline probes administered over the wait-list period. The subsequent entry of the wait list group into the four-week treatment allowed us to double the sample size that was then considered for analysis of changes in speech accuracy for treated and untreated stimulus sets over time. Additional experimental probes were completed immediately post-treatment (M= 2.2 days post-treatment, SD = 1.8 days), and at one-month post-treatment (M= 31.7 days post-treatment, SD = 2.3 days) to measure treatment effects, retention, generalization to untreated items, and experimental control.

Stimuli

Treatment stimuli consisted of three- or four-syllable pseudowords with either a strong-weak (SW) or weak-strong (WS) stress pattern over the first two syllables. Based on performance during the initial evaluation, baseline probes, and pre-practice during the first treatment session, children were assigned to either the three-syllable of four-syllable treatment level (see ) to provide an optimal challenge and functional difficulty level.Citation39 For the three-syllable stimuli, a list was generated of all 72 possible CVCVCV combinations containing three different plosive consonants (/b/, /t/, and/g/) and three different long vowels (/a/, /i/, /u/) in both SW and WS stress patterns, consistent with Ballard et al. (e.g., TAgibu and taGIbu).Citation15 A similar list was created for the four-syllable stimuli, again containing plosives (/b/, /t/, and/g/) and vowels (/a/, /i/ , /u/) to generate a comparable list of 72 stimuli (e.g., GItubagi and giTUbagi). From these stimulus lists, 40 syllable strings were randomly selected for treatment (Treated Plosives) in both SW and WS stress conditions (e.g. 20 unique strings, each to be practiced in both SW and WS stress patterns for a total of 40 items). The remaining 32 combinations were left untreated to measure transfer to similar but untreated exemplars in experimental probes (Untreated Plosives). Two additional stimulus sets were created for experimental probes: a Real Words set of three-syllable real words containing the treated sounds (e.g. toboggan, barbecue), and a Fricatives set of three-syllable pseudowords containing fricative sounds (e.g. SHIvasu, shiVAsu).

Baseline and experimental probes were administered by graduate clinicians and contained a total of 120 items, including 20 items (10 SW and 10 WS) randomly selected from each of the four sets above: Treated Plosives, Untreated Plosives, Real Words, and Fricatives. An additional 40 stimuli were selected from other complexity levels; however, performance on these items is not reported here. Stimuli were presented in randomly ordered carrier phrases (e.g., “There’s my ____” or “She has a _____.”). Four variations of the probe lists were used for experimental probes, with the order counterbalanced and randomized across participants. Each child’s treating clinician administered at least one of the child’s baseline probes and all subsequent experimental probes.

Treatment

Treatment was administered by trained graduate student clinicians over a four-week treatment period, with four one-hour sessions per week. The treatment period in this current study was extended one week from the original three-week period used in previous studies of this treatment approach.Citation15–18 Consistent with other motor learning protocols in speech pathology we extended the treatment period to four weeks as this has produced robust learning effects for motor speech disorders (for example, see ref. Citation40) and children with CAS are known to require high dosages of therapy.Citation1,Citation9,Citation41

Recommendations from the Treatment Fidelity Workgroup of the National Institutes of Health Behavior Change Consortium were implemented to ensure treatment fidelity.Citation42 Clinicians completed multiple intensive training sessions with first and last authors, including structured practice and role-playing for accurate implementation of the treatment protocol, as documented in the treatment manual. Over 50% of sessions were directly observed by one of three authors to ensure inter-rater reliability. Clinicians also each observed at least one session conducted by another clinician. Any discrepancies with the treatment protocol were addressed during treatment sessions or at frequent supervisory meetings.

Intervention explicitly targeted each of the three features of CAS through repeated productions of multisyllabic pseudowords at a natural speech rate. Correct production was assessed on accuracy across each of the three features of CAS: correct sounds, fluent transitions between syllables, and accurate lexical stress. Twenty stimuli (10 SW and 10 WS) from the Treated Plosive set were randomly selected for each session.

Treatment was structured within a motor learning framework (see for a summary of key PML).Citation13,Citation14 Treatment sessions consisted of (a) Pre-Practice, continuing until the child produced correct responses to five different stimuli with clinician-provided Knowledge of Performance (KP) feedback, modeling, and visual or tactile cues, as necessary; and (b) Practice, consisting of 100 total productions being five productions each of the twenty randomly ordered stimuli. During the Pre-practice stage, clinicians established a reference of correctness for the task, provided explanations of target response accuracy, and ensured motivation and stimulability. The Practice stage adhered to a strict low frequency, delayed feedback schedule with Knowledge of Results (KR) feedback provided on 60% of trials after a 3-second delay, and a 5-second delay between provision of feedback and presentation of the next stimulus. Clinicians used a feedback sheet containing a visual of the three targeted features to refer to each term as they gave KR feedback to the child (e.g. “Nice and smooth [pointing to visual for segmentation], but sounds and stress weren’t right [pointing in turn to visuals for sounds and stress].”). Since not all children could read fluently, stimuli were presented auditorily by the clinician, with a three-second delay between the model and the child’s production in response to a “Go” signal. This study differed from previous work in that children completed 60 minutes of Pre-practice for the first two sessions and did not begin the Practice stage until the third day of treatment. Pre-practice lasted no more than 15 minutes in subsequent sessions. Per Warren et al. recommendations for standardized descriptions of treatment intensity,Citation43 the intervention duration in this study was four weeks, the dose frequency was once per day for four days per week, and the treatment dose was 100 practice trials per session, for a cumulative intervention intensity of approximately 1600 trials per child. Children also completed shortened experimental probes immediately prior to the first treatment session in the second, third and fourth week of treatment (data not reported here).

Table 2. Principles of motor learning

Equipment

All experimental probes and treatment sessions were recorded in a quiet room at 44.1 kHz with Samson XPD1 microphones, positioned 5 cm from child’s mouth. Recordings were saved in WAV format.

Dependent Measures

About ten percent of responses (519 of 4980 tokens) in baseline and experimental probes could not be analyzed due to technical issues or extraneous noise (e.g. recording failure, background noise, yelling or laughing during production) or errors in clinician models (e.g. transposing syllables).

Acoustic Measures

Acoustic analyses of pre- and post-treatment Treated Plosive, Untreated Plosive, and Real Word stimuli were completed by trained research assistants using Praat signal-processing software.Citation44 Segmentation was measured as intersegment duration (ISD), defined as the time from the last glottal pulse, as indicated by the end of F1 and F2 in the wideband spectrogram, to the onset of the plosive burst in the following syllable. For some items, an intersegment duration could not be marked due to either omission or severe distortion of the plosive consonant that made it impossible to distinguish start and end points of segments. In real words containing non-plosive sounds (e.g. bandanna, bicycle), intersegment duration was only measured preceding plosives.

Acoustic measures of lexical stress included duration (ms), maximum fundamental frequency (Hz), and maximum vocal intensity (dB) for the vowels of strong and weak plosive syllables in Treated Plosive, Untreated Plosive, and Real Word stimuli. Vowel duration was measured between the first and last glottal pulse of the vocalic nucleus, as indicated by energy extending through F1 and F2 displayed on the wideband spectrogram, and using fundamental frequency, formant, and intensity contours generated by the Praat software.Citation15,Citation45 Maximum intensity and fundamental frequency of each vowel were generated automatically using Praat algorithms. The pairwise variability index (PVI) of each variable was calculated using EquationEquation (1) to provide a normalized comparison of the strong and weak syllable in each stimulus:

(1) PVI = 100 x dk dk+1/dk+ dk+1/2(1)

where d is the duration of the kth syllable.Citation15,Citation46 A higher PVI value reflects increased contrast in lexical stress, whereas a PVI of zero indicates equal stress across syllables. PVI were calculated for duration (PVI(dur)), fundamental frequency (PVI(f0)), and intensity (PVI(I)).

Real words where either strong or weak syllables did not contain a plosive (e.g. pineapple, tomato) were not included in PVI analyses. In cases where children added an extra syllable, the syllables that best fit the intended stress pattern were analyzed. Stimuli in which the child did not repeat the intended stress target were excluded from analysis of PVI (1% of analyzed files, or 34 of 2902 tokens), as were stimuli where omission or devoicing of the weak syllable prevented measurement of vowel duration (approximately 25% of analyzed stimuli, or 692 of 2902 tokens). Syllable omission rates varied between participants, as well as by stimulus type and time point: weak syllable omission occurred on 32% of WS plosive pseudowords at baseline (range across children was 0–75%) and 16% at post-treatment (range 0–79%), and 15% of SW plosive pseudowords at pre-treatment (range 0–64%) and 1% at post-treatment (range 0–8%). No strong syllables were omitted, and omission rates were lower for real words than pseudowords. Omission rates for each participant are available in Supplementary Materials.

Perceptual Measures

Perceptual measures of accuracy on each of the three features of CAS (segmentation, equal stress, and speech sound distortions) in tokens from pre- and post-treatment experimental probes were scored by a group of eight trained graduate students and clinical faculty. Segmentation was scored based on the perception of increased segment or intersegment durations, or inter-syllable pauses. Accuracy on this criterion required fluent transitions between all syllables with the child matching the clinician’s production of the item, without perception of hesitation or pause. For scoring of lexical stress accuracy, productions were scored as incorrect for equal or reduced stress contrasts across syllables. Consistent with the acoustic analyses, scorers did not score this variable if the child omitted one of the first two syllables in a stimulus or if the child did not repeat the intended stress target for an item (i.e. produced a WS pseudoword instead of SW; again less than 1% of total files).

Speech sound accuracy was scored based on the presence or absence of consonant distortions – plosive sounds in Plosive and Real Word stimulus sets and fricative sounds in the Fricative stimulus set. Transposition errors were not counted as distortions if the substituted consonant was produced without distortion. Distorted substitutions were scored as errors (e.g., nasalization or spirantization of plosives). This measure differed from previous work (e.g. ref. Citation17) who included vowel production in their scoring of speech-sound accuracy. This change was made to explicitly compare plosive accuracy to the untreated fricative phonemes included for experimental control, although feedback on vowel accuracy was provided during intervention.

The eight perceptual scorers were all native English speakers with little prior experience with CAS. Scorers completed two training sessions with first and last authors prior to scoring where they reviewed examples of correct and incorrect productions in each feature and scored sample tokens. Scorers each scored at least one complete data set from one child, with some tokens included twice for intra-rater reliability calculations. All perceptual scorers were blinded to time point of samples and study hypotheses. Order of presentation was randomized within each stimulus set, with Fricative, Real Word, and combined Treated and Untreated Plosive stimulus sets scored separately. Scorers were instructed to listen to each sample twice for each of the three criteria, for a total of six times for each item, using personal headphones in a quiet room. In cases where a child added an extra syllable, scorers were instructed to score the syllables that best fit the target (e.g., score only the last three syllables if “giTAgibu” was produced for “TAgibu”).

Reliability

For acoustic measures, a randomly selected ~15% of each scorer’s samples were rescored by a second rater to calculate inter-rater reliability. Reliability was high for inter-rater comparison of both intersegment duration and vowel duration measures (ICC 0.945 and 0.948 respectively, p < .001, absolute agreement on single measures). The average point-to-point differences for intersegment and vowel duration measures were 11.1 ms (SD = 18.5 ms) and 11.9 ms (SD = 18.7 ms), respectively.

Each rater also re-measured these durations in a randomly selected ~15% of their samples for intra-rater comparisons. Average differences were 9.6 ms (SD = 19.3 ms) for intersegment duration and 9.5 ms (SD = 18.1 ms) for vowel duration. Intra-class correlation coefficients for the combined nine raters were also high (.955 and .958 respectively, p < .001, absolute agreement on single measures). Intra-rater and inter-rater reliability for acoustic durational measures are comparable to those reported in similar studies.Citation27,Citation47,Citation48

For perceptual measures, intra-rater reliability was calculated from a random 10% of each scorer’s total samples. Point-to-point agreement for intra-rater comparison was 82% for segmentation, 76% for stress, and 85% for distortion count. A second listener also rescored 10% of each scorer’s files to calculate inter-rater reliability. Point-to-point agreement on each measure was 77% for segmentation, 67% for stress, and 75% for distortions. Overall, Cohen’s κ values demonstrate moderate to substantial intra-rater agreement (Segmentation: κ = .60, p < .001; Stress: κ = .52, p < .001; Distortions: κ = .68, p < .001) and fair to moderate strength of agreement for inter-rater agreement (Segmentation: κ = .40, p < .001; Stress: κ = .34, p < .001; Distortions: κ = .41, p < .001).Citation49

Data Analysis

The strength of the association between acoustic and perceptual measures of segmentation and stress was calculated using Spearman correlations. SW and WS perceptual stress measures were collapsed onto a 3-point ordinal scale, where 1 was accurate WS stress, 2 was inaccurate stress production (i.e., equal stress across syllables), and 3 was accurate SW stress. Correlations were conducted for PVI measures of duration, intensity, and frequency. For segmentation, the binary perceptual measure of accuracy was compared with the maximum intersegment duration measure for each production. A significance level of 0.05 was set a priori for all statistical analyses.

Descriptive statistics and boxplots were generated for each acoustic variable (see Supplemental Materials), with SW and WS stress patterns treated as separate outcome measures. Based on visual inspection of data distribution followed by manual review of a sample of audio files and spectrograms, outliers greater than two standard deviations outside the mean for each participant were determined to represent uncharacteristic performance (e.g. child speaking with non-habitual voice, presence of background noise) and excluded from further statistical analyses (less than 4% of analyzed data for each variable). Outliers were removed separately for pseudo- and real words, due to significant differences in pre-treatment performance between these stimulus sets. An additional five real word PVI(dur) values (less than 1% of total real word PVI values) were manually removed by visual inspection alone, with consensus between the first and last author. In these cases, the two standard deviation criterion was not used as these cases had fewer than 20 data points due to high degrees of syllable omission. All statistical models were run both with and without outliers to confirm the effect of outlier removal for analytic transparency. Results for the analyses with outliers removed are reported here (see Supplementary Materials for a comparison of analyses with and without outliers).

Generalized linear mixed models (GLMM) with the participant as a random intercept were used to account for the clustering of repeated measures within subjects over time during our analyses of both baseline stability and treatment effect. GLMM offer a flexible statistical approach for assessing longitudinal data, and are accepted as the state-of-the art for modeling complex data structures (e.g., unbalanced designs, repeated measures within subjects).Citation50–52 This method ensures that standard errors are not underestimated, as can occur when Ordinary Least Squares (OLS) models are applied to clustered data structures.Citation53 GLMM enable the user to adjust for random or repeated variables and to specify different covariance structures.Citation54,Citation55 For the acoustic analyses, GLMM with a normal distribution and an identity link function were used (continuous outcome). For perceptual analyses, GLMM with a binomial distribution and a logit link function were used (dichotomous outcome).

For both acoustic and perceptual measures, baseline stability was first assessed using GLMM. No statistically significant differences across baseline time points were observed (p > .05) for any outcome measures. This demonstration of baseline stability, whether baselined over two weeks or six weeks, allowed for the pooling of data across all children to generate a single group average for the baseline time-point for the subsequent analyses of treatment effects.

For analyses of treatment effect and generalization, models initially included fixed effects for treatment phase (baseline, immediate post-treatment, and one-month retention) and stimulus set (treated plosive pseudowords, untreated plosive pseudowords, and untreated real words), and their interaction. Interaction terms were removed from final models if they were not statistically significant. Analyses for perceptual measures included calculation of odds ratios (OR), reflecting exponentiated beta coefficients, with 95% confidence intervals. Calculation of odds ratios for dichotomous outcomes is considered best practice in biostatistics, as it facilitates the interpretation of relative risk and offers an intuitive measure of effect size.Citation56 Post-hoc testing used Bonferroni adjustments for multiple comparisons.

Following the model-building approach recommended by Bryk & RaudenbushCitation51 and Burnham & Anderson,Citation57 we ran the models for each outcome with the random intercept for subject, main effects for Phase and Set, and the interaction of Phase*Set. Model fit was assessed using Akaike’s Information Criterion (AIC) and the Bayesian Information Criterion (BIC; see ref. Citation58) If the model without interactions produced the best fit, then interaction terms were removed from the final model (e.g. for the PVI models, ). Notably, we did not remove Phase and Set terms, even in cases where they were not statistically significant, because we felt that these were essential to the evaluation of treatment effect and generalization and warranted consistent adjustment across models. Cross-level interactions and random slopes are not included in any of the models. An unstructured covariance matrix for random effects was utilized.

Table 3 a. Summary of GLMM for a) intersegment duration b) PVI for duration (dur), fundamental frequency (f0), and intensity (I) in SW stimuli and c) PVI(dur), PVI(f0), and PVI(I) for WS stimuli. Tables show F-tests for main effects and interaction; β-value, standard error, and p-value, for phase (Baseline, Post-treatment, and 1-month Retention) and set (Treated and Untreated pseudoword and Real word) comparisons; ICC – the Intraclass Correlation Coefficient denotes the variability accounted for by the “between-subject” factor with respect to the overall variability in the model; and fit statistics: AIC – Akaike’s Information Criterion – An estimator of the of out-of- sample prediction error and the relative quality of statistical models for a given set of data. In estimating the amount of information lost by a model, AIC deals with the trade-off between the fit of the model and the simplicity of the model; AICC- The AIC Corrected can be used for smaller sample sizes; BIC – Bayesian Information Criterion – A criterion for model selection among a finite set of models. The model with a lower BIC is considered better. When fitting models, it is possible to increase the likelihood (sometimes reflected in a better fit) by adding parameters, although this may result in overfitting the model. BIC and AIC both attempt to resolve this problem by introducing a penalty term for the number of parameters in the model; the penalty term is larger in BIC than in AIC

Results

Segmentation

Acoustic Measures

For acoustic measures of intersegment duration, Generalized Linear Mixed Models (GLMM) with random intercepts for subject revealed a significant treatment phase effect (F(2, 5243) = 101.92, p< .001), stimulus set effect (F(2, 5242) = 15.17, p< .001), and treatment phase by set interaction (F(4, 5242) = 3.78, p= .005). Treatment phase and stimulus set comparisons are summarized in , with the sign preceding the β-value indicating the direction of the effects (e.g. a negative β-value indicates a decrease in intersegment duration between indicated treatment phases, or the desired treatment effect). Adjusted means for each stimulus set across the three treatment phases are also plotted in .

Figure 2. Mean intersegment duration for the combined group of 11 participants, with each of three stimulus sets (treated plosive pseudowords, untreated plosive pseudowords, and untreated real words) graphed separately across baseline, immediate post-treatment (Post), and one-month retention time-points. Errors bars show standard error

Figure 2. Mean intersegment duration for the combined group of 11 participants, with each of three stimulus sets (treated plosive pseudowords, untreated plosive pseudowords, and untreated real words) graphed separately across baseline, immediate post-treatment (Post), and one-month retention time-points. Errors bars show standard error

Bonferroni post-hoc comparisons revealed significant decreases in intersegment duration between baseline and both post-treatment time-points, controlling for set (p< .001). The significant treatment phase by set interaction was driven by significant differences between pseudoword and real word sets at baseline (p< .001); however, within each stimulus set, decreases in intersegment duration between baseline and retention were significant (p< .001), indicating maintenance of treatment effects across the retention period for treated and untreated pseudo- and real words.

Treatment phase and stimulus set comparisons are summarized in , with the sign preceding the β-value indicating the direction of the effects (e.g. a negative β-value indicates a decrease between indicated treatment phases). Adjusted group means for each stimulus set across the three treatment phases are also shown in .

Perceptual Measures

GLMM with random intercepts for subject revealed a significant phase effect for perceptual measures of segmentation (F(2, 3351) = 27.65, p< .001), with odds ratios (OR) of 1.985 (95% CI 1.615–2.440, p<.001) at retention. This illustrates that, controlling for stimulus set, participants at retention were roughly twice as likely to show favorable outcome scores (i.e. produce stimuli with fluent transitions between syllables) as compared to baseline. There was also a significant effect for stimulus set (F(2, 3351) = 18.13, p< .001), driven by more accurate performance on real words (OR of 1.654, 95% CI 1.365–2.004, p< .001, compared to treated pseudowords as referent). Odds ratios for performance on all perceptual variables at each post-treatment time-point, as compared to baseline performance, are summarized in . Odds ratios are also presented visually in . An OR greater than 1.0 (with a 95% CI that does not overlap 1.0) indicates a statistically significant increase in the outcome relative to baseline, while an OR less than 1.0 (with a 95% CI that does not overlap 1.0) indicates a statistically significant decrease in the outcome measure relative to baseline.

Table 4. Summary of perceptual outcome measures. Odds ratios, 95% confidence intervals, standard error, and p-values are shown for each phase comparison (Baseline, Post-treatment, and 1-month Retention) and set comparison (Treated and Untreated Plosives and Real Word)

Figure 3. Odds ratios for combined 11 participants for each perceptual variable, showing the likelihood of accuracy in each variable at immediate post-treatment and one-month retention time points as compared to performance at baseline. Error bars show 95% confidence intervals

Figure 3. Odds ratios for combined 11 participants for each perceptual variable, showing the likelihood of accuracy in each variable at immediate post-treatment and one-month retention time points as compared to performance at baseline. Error bars show 95% confidence intervals

Lexical Stress

Durational Contrast Acoustic Measures

GLMM revealed a significant treatment phase effect for PVI(dur) for both SW and WS stimuli (SW: F(2, 1149) = 26.03, p< .001; WS: F(2, 906) = 11.20, p< .001). There was a significant effect for stimulus set for WS stimuli only (SW: F(2, 1147) = 0.47, p= .63; WS: F(2, 905) = 15.21, p< .001). Bonferroni post-hoc comparisons revealed significant improvements in durational contrasts (PVI(dur)) between baseline and post-treatment time points, controlling for set (post-treatment: p< .001 for both stress patterns; retention: SW, p< .001; WS, p= .008). There were no significant differences between post-treatment and retention time points (SW: p= 1.00; WS: p= .68), indicating maintenace of treatment effects across the retention period. Treatment phase and stimulus set comparisons are summarized in , for SW and WS stimuli respectively. Again, the sign preceding the β-value indicating the direction of the effects (e.g. a positive β-value indicates a more positive PVI following treatment, the desired treatment effect for SW stimuli; a negative β-value indicates a more negative PVI, or the desired treatment effect for WS stimuli). Adjusted means for PVI(dur) for both SW and WS stimuli are plotted in .

Figure 4. Mean pairwise variability indices of strong and weak vowel duration contrasts (PVI(dur), top panel), maximum fundamental frequency contrasts (PVI(f0), middle panel), and maximum intensity contrasts (PVI(I), bottom panel), for the combined group of 11 participants. Each stimulus set (treated plosive pseudowords, untreated plosive pseudowords, and untreated real words) is graphed separately across baseline, immediate post-treatment (Post), and one-month retention time points. Means for strong-weak (SW; positive values) and weak-strong (WS; negative values) stimuli are graphed separately for each variable, with error bars showing standard error

Figure 4. Mean pairwise variability indices of strong and weak vowel duration contrasts (PVI(dur), top panel), maximum fundamental frequency contrasts (PVI(f0), middle panel), and maximum intensity contrasts (PVI(I), bottom panel), for the combined group of 11 participants. Each stimulus set (treated plosive pseudowords, untreated plosive pseudowords, and untreated real words) is graphed separately across baseline, immediate post-treatment (Post), and one-month retention time points. Means for strong-weak (SW; positive values) and weak-strong (WS; negative values) stimuli are graphed separately for each variable, with error bars showing standard error

Acoustic Measures of Pitch and Volume Contrasts

Treatment phase was also significant for PVI(f0) and PVI(I) variables for both stress patterns, although counter to the hypothesized direction (i.e. children demonstrated decreased pitch and volume contrasts post-treatment, see ; PVI(f0) SW: F(2, 1129) = 6.87, p= .001; PVI(I) SW: F(2, 1143) = 5.35, p= .005; PVI(f0) WS: F(2, 906) = 8.74, p< .001; PVI(I) WS: F(2, 915) = 7.50, p= .001). Stimulus set was significant for PVI(I) SW and PVI(f0) WS variables (PVI(I) SW: F(2, 1141) = 10.79, p< .001; PVI(f0) WS: F(2, 905) = 8.04, p< .001). Treatment phase and stimulus set comparisons for pitch and intensity contrasts are summarized in . Re-analysis of outcome measures with and without outliers revealed some changes [in post-hoc comparisons] for PVI(f0), which are detailed in Supplemental Materials.

Perceptual Measures of Stress

GLMM revealed a significant treatment phase effect for perceptual stress accuracy for both SW and WS patterns (SW: F(2, 1510) = 8.29, p< .001; WS: F(2, 1363) = 10.00, p< .001), with OR of 1.847 (95% CI 1.337–2.552, p< .001) and 1.835 (95% CI 1.331–2.531, p< .001) at one-month retention timepoint for SW and WS stress patterns, respectively. This illustrates that, controlling for stimulus set, participants were significantly more likely to produce stimuli with accurate lexical stress at retention than at baseline. ORs for both immediate post-treatment and one-month retention time points, as compared to baseline, are again summarized in and shown visually in . Stimulus set was not significant for either stress pattern (SW: F(2, 1510) = 1.33, p= .27; WS: F(2, 1363) = 1.06, p= .35), indicating no significant differences in performance on treated and untreated stimulus sets.

Distortions

Perceptual Measures

GLMM revealed a significant treatment phase effect for perceptual measures of distorted plosive sounds (F(2, 3346) = 35.09, p< .001) with OR of 1.940 (95% CI 1.567–2.401, p< .001), indicating that subjects were roughly twice as likely to produce stimuli with no plosive distortions at the one-month retention timepoint compared to baseline. There was also a significant effect for set (F(2, 3346) = 49.58, p < .001). OR calculations revealed this difference was due to higher accuracy on real words as compared to the treated psuedoword stimuli, with OR of 2.272 (95% CI 1.872-2.758, p< .001).

A separate GLMM for perceptual measures of distortions in the fricative stimulus set revealed no significant phase effect (F(2, 1069) = 1.27, p= .28), with OR of 1.431 (95% CI 0.915–2.239, p= .12) at the one-month retention timepoint. This indicates that there was no significant change in number of distorted fricative pseudwords across treatment phases, or that children were as likely to produce fricative distortions at baseline and post-treatment timepoints. OR for both plosive and fricative sounds are plotted in and summarized in .

Acoustic-Perceptual Correlations

Spearman correlation analysis revealed a significant correlation between acoustic and perceptual measures of segmentation (rs = −0.445, p < .001). The correlations between acoustic (i.e., PVI) and perceptual measures of stress accuracy were also moderate to strong and highly significant (PVI(dur): rs = 0.702, p < .001; PVI(f0): rs = 0.280, p < .001; PVI(I): rs = 0.335, p < .001).

Discussion

This study investigated the efficacy of TEMPOSM in improving acoustic measures of segmentation and stress contrasts in a group of eleven children with CAS. Critically, participants demonstrated stable performance across baselines on all acoustic and perceptual outcome measures, which allowed for examination of treatment effects. Results replicate previously reported treatment effects, with evidence of generalization and retention of improvements across both acoustic and perceptual variables. This work adds to the growing literature on the positive effects of this treatment approach in remediation of the primary features of CAS,Citation15–18 and provides additional support for the application of PML in apraxia of speech intervention (for a review, see ref. Citation59–62). Details of principles critical to successful learning (i.e., generalization and retention) are summarized in the methods and include intensive, high-frequency, randomized practice and delayed, reduced frequency, knowledge of results feedback.

The novel use of an acoustic measure of intersegment duration substantiates the efficacy of this intervention in reducing segmentation of speech, a prominent feature of CAS. The acoustic measures of segmentation and stress used in this study, both of which were significantly correlated with perceptual ratings, provide a more sensitive measure of change than binary perceptual measures of prosodic accuracy. Changes in acoustic measures can precede changes in perceptual measures and capture smaller increments of improvement than may be perceptually measurable. In addition, acoustic measures provide more fine-tuned detail that allows for provision of an underlying explanation of perceptual measures. Therefore, these improvements in specific acoustic variables provide strong evidence to support the efficacy of TEMPOSM as an intervention for CAS. Ultimately, the development and clinical implementation of semi-automated acoustic measurements will allow for more accurate, sensitive measures of the speech of individuals with motor speech disorders in future clinical practice.

This study was also unique in the inclusion of four children with below average receptive language scores (range 69–80 Receptive Language Index, CELF-5). These participants all demonstrated substantial improvements across the three features of the disorder, with generalization and maintenance of treatment effects (see Supplemental Material for individual data). Evidence that children with language impairments can successfully engage in an intensive speech motor programming treatment suggests a broad applicability of TEMPOSM to a large number of children with co-occurring CAS. It may be that the use of nonsense syllables minimizes the demands placed on the language system.

Improvements in Segmentation

Acoustic measures of intersegment duration showed reduced segmentation following treatment, consistent with our finding of perceptually “smoother” speech post-treatment with fluent transitions between syllables. Combined, these acoustic and perceptual findings demonstrate that TEMPOSM improves speech motor programming skill in children with CAS. Children with CAS have a deficit in the programming and storage of motor units in INT prior to speech initiation (see ref. Citation6); consequently, speech production for these children is limited as they can only hold shorter and less complex motor programs in this impaired buffer prior to execution, resulting in segmentation of speech. The use of pseudowords in TEMPOSM simulates novel word learning processes to allow for isolated and targeted focus on the motor programming system. Critically, practice of novel syllable strings trains the reorganization of multiple speech units into a single motor program, as it is this concatenation process that is hypothesized to be impaired in apraxia of speech.Citation6 As speech motor learning occurs during treatment, shorter segments – such as gi, ta, and bu – are combined into a multi-syllable unit such as gitabu. Significant decreases in perceptual and acoustic measures of segmentation in both treated and untreated stimuli provide empirical support for the hypothesized mechanism of action of the treatment in establishing more efficient organization of speech motor programs. The observed generalization also provides strong support for the hypothesis that treatments that successfully target the underlying mechanism improve overall speech production, beyond the specific stimuli targeted in treatment sessions.

Improvements in Stress Contrasts

Participants demonstrated significant improvements in stress contrasts for both SW and WS stress patterns, as indexed by acoustic PVI(dur) measures. These improvements were also observed in secondary perceptual measures of stress accuracy. Our findings of stronger correlations between perceptual ratings of stress accuracy and PVI(dur) corroborate previous findings that durational contrasts play a dominant role in the perception of atypical stress production in children with CAS.Citation15 Unexpectedly, children overall demonstrated significantly reduced contrast for PVI(f0) and PVI(I) measures following treatment despite significant improvements in perceptual and durational measures of stress. This finding differs from previous work by Ballard et al. where changes in durational contrasts following this intervention were accompanied by improved intensity and pitch contrasts.Citation15 This discrepancy is likely due to the high degree of variability in the presentation of the disorder across participants, the complex interaction between these three prosodic features in the perception of accurate stress production, and the influence of the specific elicitation task used (see ref. Citation47 for a discussion of task effects). Children with CAS may compensate for a limited motor programming buffer capacity through strategic trade-offs in resource allocation, as has been demonstrated in both typical children and adults with acquired apraxia.Citation63,Citation64 This strategic allocation of resources explains the significant variability in children with CAS, as children may differ in their attention to specific features of speech (e.g. segmental versus prosodic accuracy), or even in their focus on a particular feature in each production of a word. Many of the children in this study appeared to compensate for difficulty controlling durational contrasts with an over-reliance on frequency and intensity to mark stress at baseline, compared to typical speakers.Citation47,Citation65 The decreased pitch and volume contrasts following treatment may reflect the normalization of their stress production to use duration as the primary indicator of stress.

Overall, participants demonstrated more difficulty with production of WS stress contrasts, consistent with later development of this stress pattern in typical English-speaking children. Production of SW patterns is typically mastered around age three, while production of WS stress contrasts continues to develop through at least age eleven.Citation47,Citation65 The increased difficulty of WS words may reflect a trochaic bias resulting from increased exposure to the SW stress pattern in English or the increased physiological demands required for production of WS stimuli. The WS pattern may be particularly difficult for children with CAS because it requires more dramatic durational contrasts, with reported PVI(dur) for typical adults up to twice that for the SW pattern.Citation47 In fact, PVI(dur) measures in WS words have been proposed as a diagnostic criterion for acquired apraxia of speech, as this stress pattern is more sensitive to subtle impairments in control of temporal contrasts.Citation48 This is logical because apraxia, in both childhood and acquired forms, is a disruption in temporal control of speech marked by increased duration and reduced variability in duration of speech segments. Therefore, PVI(dur) in WS words may also be the most sensitive indicator of treatment of prosodic deficits in CAS.

Participants in this study demonstrated considerable difficulty producing all syllables in WS stimuli at baseline, with weak syllable omission in up to 75% of pseudoword stimuli for some participants at baseline (see Supplemental Material for individual data). This mimics the progression, albeit delayed, of typical speakers. Toddlers frequently omit unstressed syllables, with a shift around age three toward equal stress across syllables as weak syllable deletion decreases.Citation66,Citation67 Further analysis of the frequency of syllable omission might be a better initial indicator of progress for some children since inclusion of the weak syllable is a prerequisite to production of appropriate durational contrasts. Children with CAS likely require more than four weeks of intervention to reach typical performance for the increased durational contrasts necessary for WS stimuli.

Improvements in Speech Sound Accuracy

Perceptual results supported our hypothesis that treatment of plosive pseudowords would result in reduced distortions of plosive sounds in both treated and untreated stimuli, but that these improvements would not generalize to a set of untreated fricative pseudowords. This untreated fricative stimulus set was employed to provide additional experimental control, as previous treatment studies of adults with acquired apraxia of speech have shown generalization within phonemes of same manner of production, but not to phonemes of other manners of production.Citation68–71 Within the framework of Schema Theory, phonemes of the same manner have been hypothesized to share a single Generalized Motor Program (GMP), which governs the general muscle tension and force pattern for that speech movement; movement parameters are then varied to define the specific muscle groups involved (i.e. the location of articulation).Citation68,Citation72 In addition to establishing experimental control, the stable performance on the untreated fricative set reported here supports the use of Schema Theory to conceptualize speech motor programming with implications for the selection of treatment stimuli in clinical practice.

Limitations and Future Directions

The acoustic measures reported herein provide evidence of the efficacy of the intervention in remediating specific suprasegmental deficits, with evidence of increased syllable stress contrasts and reduced inter-syllable pauses (segmentation of speech) following intervention. Future work would benefit from an expanded set of outcome measures to include an acoustic measure of speech-sound accuracy, such as voice onset time or vowel formants, to provide sensitive measures of voicing or vowel distortions. An additional limitation of this work is the reliability of perceptual measures, especially judgments of stress accuracy. Future work should include more rigorous training of perceptual scorers and obtain scores for each child from multiple raters to mitigate against inter-rater variability in the perception of these speech features. Additionally, it may be preferable to apply a five-point perceptual scale like that used by Ballard et al.,Citation15 particularly to improve reliability for those items that fall close to the perceptual boundaries between correct and incorrect productions and capture smaller increments of change following treatment. Regardless, the perceptual results support the corresponding acoustic results, which themselves showed strong inter-rater reliability, and attest to the fact that change in various measures of the children’s speech is occurring.

This study was also limited by the relatively small and homogenous group of participants, and therefore, may not be appropriate for all children with CAS. Nevertheless, combined with previous studies, there is now a strong body of work demonstrating the efficacy of this approach in the treatment of CAS. Future steps should include a randomized clinical trial to determine the factors that influence individual response to treatment and establish ideal dosage and treatment intensity to achieve robust generalization to spontaneous connected speech. In addition, future studies should investigate the efficacy of subsequent courses of TEMPOSM, as most children with CAS will require more than a single course of intervention to normalize speech motor programming capabilities.

Another crucial next step in CAS research is the use of neuroimaging to establish the efficacy of treatments in normalizing brain networks in children with CAS. Neuroimaging studies of apraxia have primarily focused on adults with the acquired form of the disorder, but there remains a critical need to establish differences in brain networks in children with CAS and determine the specific neural systems underlying the disorder. Neuroimaging work should also characterize how the treatment induces neural plasticity ultimately allowing for its optimization.

Conclusions

Acoustic measurements demonstrate overall reduced segmentation and improved stress contrasts for eleven children following four weeks of intensive TEMPOSM intervention, with generalization to untreated syllable strings and real words and retention of these treatment effects one-month post-treatment. These results replicate and extend previous studies of this approach and support its efficacy in treatment of CAS.

Supplemental material

Supplemental Material

Download MS Word (111.2 KB)

Acknowledgments

The authors thank the children and families who participated; graduate clinicians Roisin Bermingham, Kathy-Ann Murray, Rachel Platt, Megan Rounds, Emily Schultz, and Savanah Snook; Jill Thorson and Bill Rogers for assistance with programs for acoustic and perceptual analyses; IMPROV Lab research assistants who completed acoustic analyses; and our perceptual scorers.

Disclosure Statement

The authors report no conflicts of interest. Portions of this work were presented at the 2018 Conference on Motor Speech in Savannah, Georgia, and the 2018 American Speech-Language Hearing Association Convention in Boston, Massachusetts, as well as forming part of the first author’s Master’s thesis at the University of New Hampshire.

Supplementary Material

Supplemental data for this article can be accessed on the publisher’s website.

References