1,829
Views
0
CrossRef citations to date
0
Altmetric
Neurology

Trajectories of arm recovery early after stroke: an exploratory study using latent class growth analysis

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 253-265 | Received 05 Feb 2021, Accepted 09 Dec 2022, Published online: 03 Jan 2023

Abstract

Aim

To investigate trajectories of recovery of motor arm function after stroke during inpatient rehabilitation.

Materials and methods

Data were available from 74 consecutively-admitted stroke survivors receiving inpatient rehabilitation from an inception cohort study. Heterogeneity of arm recovery in the first 4-weeks was investigated using latent class analysis and weekly Box and Block Test (BBT) scores. Optimal number of clusters were determined; characterised and cluster associated factors explored.

Results

A 4-cluster model was identified, including 19 participants with low baseline arm function and minimal recovery (‘LOWstart/LOWprogress’, 26%), 15 with moderate function and low recovery (‘MODstart/LOWprogress’, 20%), 15 with low function and high recovery (‘LOWstart/HIGHprogress’, 20%), and 25 with moderate function and recovery (‘MODstart/MODprogress’, 34%). Compared to LOWstart/LOWprogress: LOWstart/HIGHprogress presented earlier post-stroke (β, 95%CI) (−4.81 days, −8.94 to −0.69); MODstart/MODprogress had lower modified Rankin Scale scores (−0.74, −1.15 to −0.32); and MODstart/LOWprogress, LOWstart/HIGHprogress and MODstart/MODprogress had higher admission BBT (23.58, 18.82 to 28.34; 4.85, 0.85 to 9.61; 28.02, 23.82 to 32.21), Upper Limb-Motor Assessment Scale (9.60, 7.24 to 11.97; 3.34, 0.97 to 5.70; 10.86, 8.77 to 12.94), Action Research Arm Test (31.09, 22.86 to 39.33; 12.69, 4.46 to 20.93; 38.01, 30.76 to 45.27), and Manual Muscle Test scores (10.64, 7.07 to 14.21; 6.24, 2.67 to 9.81; 11.87, 8.72 to 15.01).

Conclusions

We found unique patterns of arm recovery with distinct characteristics for each cluster. Better understanding of patterns of arm recovery can guide future models and intervention development.

    KEY MESSAGES

  • Arm recovery early after stroke follows four distinct trajectories that relate to time post stroke, initial stroke severity and baseline level of motor arm function.

  • Identification of recovery patterns gives insight into the uniqueness of individual’s recovery.

  • This study offers a novel approach on which to build and develop future models of arm recovery.

Introduction

Motor arm impairment is common after stroke, affecting between 50% and 70% of people in the first week [Citation1–3]. In Australia the median hospital length of stay is 5 days in acute wards and 22 days in rehabilitation wards [Citation4], giving a combined hospital length of stay of approximately one month. Making predictions about arm function recovery during the first month after stroke is a challenge for clinicians, and creates uncertainty for stroke survivors. While there is a growing body of research, more still needs to be understood about arm recovery early after stroke.

Systematic reviews have identified a number of clinical and neurological biomarkers (i.e. neuroimaging and/or neurophysiological measures of brain) that are associated with reduced or increased arm recovery post-stroke [Citation5–7]. Clinical factors include low levels of baseline arm impairment, higher baseline arm function, and lower levels of leg impairment [Citation7]. Neurological biomarkers include various measures of corticospinal integrity using Transcranial Magnetic Stimulation, Diffusor Tensor Imaging or Magnetic Resonance Imaging [Citation5,Citation7]. A small number of arm recovery models have been proposed using clinical and neurological biomarkers. These models include SAFE (Shoulder Abduction-Finger Extension) [Citation8], and Proportional Recovery [Citation9], which both include clinical factors found to be predictive of arm recovery. The PREP and PREP2 models (predicting recovery potential) include both clinical factors and neurological biomarkers found to be associated with arm recovery [Citation10,Citation11].

There are limitations with these three arm recovery models. First, some studies used small (up to 41 participants) [Citation9,Citation11] or non-consecutive samples [Citation8]. Second, a significant proportion of cases deemed to be outliers were excluded from analysis during the modelling process [Citation9]. Third, the studies included specific stroke types, such as ischaemic [Citation8,Citation9] or anterior circulation strokes [Citation11], excluding participants with other stroke types (for example haemorrhagic stroke) who also experience arm impairment. Fourth, participants with significant cognitive and/or language impairment were typically excluded; these participants represent an important sub-group of stroke survivors [Citation8,Citation11]. Fifth, the measurement of outcomes occurred at limited time points, such as 3- and 6-months post-stroke, providing a limited picture of arm recovery over time. Finally, many models use population averaged recovery which may not adequately reflect an individual’s course. These limitations raise questions about whether the models are representative of and applicable to, the broader stroke population [Citation5–7].

Studies suggest that inclusion of neurological biomarkers improves the predictive ability of arm recovery models [Citation10,Citation11]. However, there is a lack of robust evidence that models with neurological biomarkers are statistically superior in their predictive ability to models with clinical factors alone [Citation12]. Many neurological biomarkers involve expensive investigative procedures (e.g. MRI) and/or cannot be completed routinely with all hospitalised stroke survivors. Transcranial magnetic stimulation (TMS) is used in PREP and PREP2 to assess the presence of motor evoked potentials (MEPs), however many rehabilitation therapists are not trained in TMS and do not have the necessary equipment to complete the TMS assessment.

With the exception of PREP and PREP2, previous models have not accounted for heterogeneity in arm recovery post-stroke. Identification of unique recovery trajectories is likely to be an important factor for (i) developing better models of arm recovery and (ii) developing better interventions. Latent class growth analysis (LCGA) is one method to identify meaningful subgroups or classes of individuals that are homogenous amongst a heterogenous population [Citation13] and has been applied in psychological [Citation14] and musculoskeletal research [Citation15]. Latent class growth analysis classifies participants into a finite number of clusters based on recovery trajectory. Participants within each cluster can then be described and compared to other clusters using baseline characteristics.

Latent class growth analysis is being increasingly used in stroke research to investigate trajectories of quality of life [Citation16], functional recovery [Citation17], care needs [Citation18], psychological recovery [Citation19], and post-stroke fatigue [Citation20]. Only one known study to date has used LCGA to predict trajectories of arm recovery in the first 12 months after stroke using elbow kinetic and kinematic measures [Citation21]. They found 2 classes of arm recovery where in the first cluster, participants had low baseline arm function (indicated by a low Fugl-Myer score) a rapid improvement before tapering off gradually [Citation21]. In the second cluster, participants had high baseline arm function which remained constant or increased slightly [Citation21]. The advantages of LCGA over other statistical methods include the ability to explain heterogeneity in longitudinal data, handle missing data points and outliers without their exclusion [Citation22].

We used LCGA to explore trajectories of arm recovery early after stroke (in the first month) during inpatient rehabilitation. Trajectories were explored using an outcome measure which could be easily assessed at the bedside (i.e. Box and Block Test) measured at more frequent intervals (i.e. weekly).

The aims of this study were to identify trajectories of arm recovery in stroke survivors receiving inpatient rehabilitation in the first month after stroke, and explore characteristics associated with cluster membership. We hypothesised that cluster membership would be associated with clinical and demographic characteristics measured at baseline/early after admission.

Materials and methods

Data were obtained from an inception cohort study briefly described below and in detail elsewhere [Citation23].

Study data

The original inception cohort study was conducted in comprehensive stroke unit in Sydney, Australia [Citation23]. Acute and rehabilitation beds are co-located on the unit, with stroke survivors commencing rehabilitation early after admission [Citation23]. In this setting, participants were able to be observed throughout their entire hospital stay. A consecutive cohort of eligible stroke survivors were approached on admission to the unit and written consent obtained to participate in the study [Citation23]. The study was approved by the local health district ethics committee (LNR 11 L/144) [Citation23]. Participants included in the study had a confirmed diagnosis of stroke, stroke related arm impairment, a score of <18 on the Upper Limb-Motor Assessment Scale (Items 6, 7 and 8) on admission and a score of ≥3 on the Modified Rankin Scale [Citation23]. Those unable to participate in an arm rehabilitation program (i.e. medically unstable) or lacked sufficient cognition and communication (based on a score of less than 4/6 on the Six Item Screener [Citation24]) were excluded from the study [Citation23].

Baseline variables were collected early after admission via physical assessment or from the medical record. Arm measures completed on admission and discharge included the Upper Limb-Motor Assessment Scale (UL-MAS), Manual Muscle Test (MMT), Action Research (ARAT) Arm Test and Self-Care items on the Functional Independence Measure (FIM). The primary outcome measure was the Box and Block Test (BBT) which was completed weekly over each participant’s hospital stay. The BBT, UL-MAS and ARAT were used to detect change in motor arm function, MMT was used to detect changes in muscle strength and FIM-Self Care was used to detect change in functional activities involving arm use.

Repetitions of arm practice were counted and recorded prospectively by the stroke participant or treating therapist. Data about the amount and repetitions of arm practice completed during each participants hospital stay were collected prospectively from their medical record. A study had previously been conducted in the same setting showing that therapists can identify rehabilitation inpatients who were capable of accurately counting repetitions of practice completed [Citation25]. All participants completed task-specific arm retraining as part of their usual rehabilitation program, consistent with Australian guideline recommendations [Citation26]. Detailed information on the amount (repetitions and sessions completed), type (arm group, physiotherapy group, dressing group, one-to-one occupational therapy and independent practice) and examples of arm practice completed are reported elsewhere [Citation23].

Analysed participant data

Our primary outcome was motor arm recovery measured by the Box and Block Test, collected on admission and after 1, 2, 3 and 4 weeks unless the person had been discharged. Variables collected via physical assessment or from the medical record early after admission and analysed, included age (years), gender (male versus female), type of stroke (infarct versus haemorrhage), side of stroke (right versus left), co-morbidities (Charlson Comorbidity Index [Citation27] of ≥1), stroke severity (modified Rankin Scale [Citation28]), admission Upper Limb-Motor Assessment Scale score [Citation29,Citation30], admission Manual Muscle Test score [Citation31–33], admission Box and Block Test score [Citation34] and presence of spasticity (admission Tardieu Scale score of ≥2) [Citation35,Citation36]. Data on the dose (repetitions) of arm practice completed in the first week of admission collected as part of the inception cohort study were also used in the analysis. Some clinical variables were selected because they are known to be associated with arm recovery, for example, age, stroke severity and initial arm impairment/function [Citation7,Citation12]. We hypothesised spasticity to be associated with reduced arm recovery. Several systematic reviews and meta-analyses have shown that increasing the dose of rehabilitation can lead to better outcomes after stroke [Citation37–39]. We therefore hypothesised that increased arm practice dose (repetitions) would to be associated with greater arm recovery and predict cluster membership.

Data analysis

Statistical analyses were conducted to [1] identify unique trajectories of arm recovery and [2] characterise cluster membership using variables collected at baseline/soon after admission. Statistical analyses were completed using Mplus version 8 (Muthén & Muthén, California, United States), Excel 2010 (Microsoft, New Mexico, United States) and Stata v16.1 (StataCorp, Texas, United States). The main assumption in our analysis was that a number of distinct unobserved trajectory subgroups (clusters) for arm recovery exist, indicated by the patterns of Box and Block Test scores measured across five time points. Latent class growth analysis (LCGA) was used to identify unique clusters which explain heterogeneity in the data over time, then assign participants to a single cluster based on their probability of belonging. Regression analyses compared characteristics of participants assigned to each cluster against the cluster with the lowest progression. An alternative latent variable modelling approach (growth mixture modelling) was also considered, but demonstrated poorer model fit [Citation13,Citation40]. Guidelines for Reporting on Latent Trajectory Studies (GRoLTS) were followed for model selection [Citation41] (Appendix 1).

Arm recovery was modelled over time using weekly Box and Block Test scores collected during the first month of admission on the stroke unit (baseline, weeks 1, 2, 3 and 4 weeks). Box and Block Test scores had a non-normal (heavy-tailed) distribution so were grouped into quartiles for each time point. This was done to decrease the potential for misclassification of participants in the latent model [Citation42,Citation43], and has been used previously [Citation44–46]. We examined the pattern of missing Box and Block Test scores at each time-point, to find that missingness was independent of important observed factors such as age, gender, days post stroke, side of stroke, spasticity, and admission modified Rankin Scale [Citation47].

One to six-cluster models were tested, with the optimum number of clusters determined using both data driven (goodness of fit indices) and pragmatic criteria (model parsimony, model interpretability). Goodness of fit indices included: Akaike’s information criterion (AIC), Bayesian information criterion (BIC), and sample size-adjusted BIC (ssBIC) with lower scores indicating a better fitting model [Citation13]. Likelihood ratio tests (Vuong-Lo-Mendell-Rubin Likelihood Ratio Test, Lo-Mendell-Rubin Adjusted Ratio Test) determined if a model with k clusters was favoured over the model with k-1 clusters [Citation48,Citation49]. Model pairs were tested until no further improvement in model fit occurred (p-value ≥0.05). Pragmatic fit criteria included acceptable entropy (>0.8) [Citation50], high probability of belonging to each cluster (≥0.9), and potential clinical interpretability which included a minimal cluster size of 10% (7 participants) [Citation51,Citation52]. A post-hoc sensitivity analysis was conducted to assess integrity of the final model (e.g. changes to cluster membership, model fit) when outliers were removed from the model (n = 4 participants who sustained an extension of their stroke during the first four weeks of admission). We define a stroke extension as a period of improvement or stabilization for at least 24 h after the initial stroke, followed by a subsequent lesion extension and neurological worsening [Citation53].

Comparison of characteristics between trajectories

Once the optimal number of clusters and membership to each cluster was established, regression analyses were conducted to determine any difference in participant characteristics across the four clusters by comparison to a reference cluster (identified as having the lowest progression based on Box and Block Test change scores). Baseline variables tested included age (in years), male gender, lesioned hemisphere, stroke sub type (ischaemic versus haemorrhagic), co-morbidity, admission Upper Limb-Motor Assessment Scale score, admission Manual Muscle Test score, admission Box and Block Test score, as well as the presence of spasticity. Other variables collected during admission included length of stay and repetitions of arm exercise completed in the first week. Weekly repetitions of arm exercise were also plotted against Box and Block test scores for individuals in each cluster using bubble plot graphs.

Results

Participant characteristics

Box and Block Test scores were available for 100% (n = 74) of participants at baseline, 100% (n = 74) at 1 week, 84% (n = 61) at 2 weeks, 58% (n = 43) at 3 weeks and 43% (n = 32) at 4 weeks. Missing Box and Block Test data were due to participants being discharged during the study period. One participant died prior to week 4. The pattern of missing data satisfied the missing at random (MAR) assumption (n = 74, χ2 distance = 66, d.f.= 63, p = 0.382), so were unlikely to bias model estimation [Citation54].

Participant baseline characteristics are presented in . The average age was 75.1 years (standard deviation (SD) 12.5). Participants had a moderately severe level of disability as indicated by a median modified Rankin Scale score of 4.5 (IQR, 1.0). Participants’ stroke unit length of stay ranged from 3 to 124 days, with an average of 34.1 days (SD 27.5, median 23.0, IQR 36.5). presents participant characteristics by cluster for the final (4-cluster) model.

Table 1. Characteristics of study participants (n = 74).

Table 2. Characteristics for each cluster in the 4-cluster model and comparisons between clusters.

Latent class growth analysis (selection of ideal number of clusters)

The 1-cluster to 6-cluster goodness of fit indices are presented in . Linear growth curves were fitted for all models. Higher order curves (e.g. quadratic) were not considered given that some participants were discharged prior to week 4. When all models were considered, the 4-cluster LCGA model was chosen based on the best combination of fit indices (AIC = 547.3, ssBIC = 536.3), likelihood ratio tests (VLMR LRT p = 0.101, LMR LRT p = 0.116), high average posterior probability of belonging (range 0.92–0.99), and high entropy (0.86). In addition, the 4-cluster model was the most parsimonious solution to describe (potentially) clinically distinct trajectories. Each cluster was labelled based on the start and progress scores on the Box and Block Test. shows the proportion of participants’ Box and Block Test score within each quartile (0–25%, 26–50%, 51–75%, 76–100%) for each time point in the 4-cluster model.

Figure 1. Proportion of subjects with Box and Block Test scores stratified by quartile (Q1 - Q4) at each time point for each of the 4 clusters. • The KEY displays BBT scores stratified by quartiles for the whole cohort, at each time point. • The stacked columns represent the proportion of subjects within each cluster who scored within a particular quartile for each week. For example, a high proportion of subjects assigned to Cluster 1 had BBT scores in the lowest quartile (Q1) for all weeks.

Figure 1. Proportion of subjects with Box and Block Test scores stratified by quartile (Q1 - Q4) at each time point for each of the 4 clusters. • The KEY displays BBT scores stratified by quartiles for the whole cohort, at each time point. • The stacked columns represent the proportion of subjects within each cluster who scored within a particular quartile for each week. For example, a high proportion of subjects assigned to Cluster 1 had BBT scores in the lowest quartile (Q1) for all weeks.

Table 3. Goodness of fit criteria for 2-cluster to 6-cluster models.

Cluster 1 comprised 19 participants (26%) with low baseline arm function who showed little/no overall recovery over four weeks (‘LOWstart/LOWprogress’). Cluster 2 comprised 15 participants (20%) with moderate baseline arm function who also showed minimal recovery (‘MODstart/LOWprogress’). Cluster 3 comprised 15 participants (20%) with low baseline arm function who showed a high level of recovery (‘LOWstart/HIGHprogress’). Cluster 4 comprised 25 participants (34%) with moderate arm function who showed a moderate level of recovery (‘MODstart/MODprogress’) ().

Figure 2. Weekly Box and Block Test scores by cluster. Light grey lines represent individual trajectories. Blue lines represent averaged trajectory per cluster.

Figure 2. Weekly Box and Block Test scores by cluster. Light grey lines represent individual trajectories. Blue lines represent averaged trajectory per cluster.

Sensitivity analysis was conducted with four outliers retained and removed from the 4-cluster model (n = 4 participants who had an extension of their stroke). Class assignment remained identical for remaining cases (i.e. there was no shift in cluster membership), and there was negligible change in model fit.

Associations between participant characteristics and cluster membership

Baseline characteristics of participants in Clusters 2, 3, and 4 were compared against participants in Cluster 1: LOWstart/LOWprogress (reference cluster) (see ). On average, participants in the LOWstart/HIGHprogress cluster were earlier post stroke on admission (β: −4.81 days, 95% CI: −8.94 to −0.69) compared to the LOWstart/LOWprogress cluster. Participants in the MODstart/MODprogress cluster had lower levels of disability as indicated by modified Rankin Scale scores on admission (β: −0.74, 95% CI: −1.15 to −0.32) compared to the LOWstart/LOWprogress cluster. Participants in the MODstart/LOWprogress and MODstart/MODprogress clusters were less likely to have severe disability (mRS of 5) on admission (OR: 0.18, 95% CI: 0.39 to 0.81 and OR: 0.13, 95% CI: 0.31 to 0.50) respectively) compared to the LOWstart/LOWprogress cluster.

Participants in the MODstart/LOWprogress, LOWstart/HIGHprogress and MODstart/MODprogress clusters were more likely to have a higher admission score on the Box and Block Test (β: 23.58, 95% CI: 18.82 to 28.34; β: 4.85, 95% CI: 0.85 to 9.61; β: 28.02, 95% CI: 23.82 to 32.21 respectively), Upper Limb-Motor Assessment Scale (β: 9.60, 95% CI: 7.24 to 11.97; β: 3.34, 95% CI: 0.97 to 5.70; β: 10.86, 95% CI: 8.77 to 12.94 respectively), Action Research Arm (β: 31.09, 95% CI: 22.86 to 39.33; β: 12.69, 95% CI: 4.46 to 20.93; β: 38.01, 95% CI: 30.76 to 45.27 respectively), and Manual Muscle Test (β: 10.64, 95% CI: 7.07 to 14.21; β: 6.24, 95% CI: 2.67 to 9.81; β: 11.87, 95% CI: 8.72 to 15.01 respectively), compared with the LOWstart/LOWprogress cluster. On average, participants in the MODstart/LOWprogress, LOWstart/HIGHprogress and MODstart/MODprogress clusters had a shorter hospital length of stay than the LOWstart/LOWprogress cluster (β: −31.19, 95% CI: −47.79 to −14.59; β: −21.73, 95% CI: −38.33 to −5.13 and β: −34.72, 95% CI: −49.35 to −20.10 respectively).

Repetitions of arm practice and cluster membership

shows weekly Box and Block Test scores plotted over the first month by cluster. Bubble size represents the number of repetitions of arm practice completed per participant per week (bubble diameter is proportional to number of repetitions completed). The number of repetitions completed by participants varied greatly.

Figure 3. Participants Box and Block Test (BBT) scores plotted by week. Each bubble represents an individual score with bubble diameter representing number of repetitions completed per week (Overall Repetitions Range: 0–10,196; Mean (SD) 268.3 (722.1); Median (IQR): 117 (301) (greater diameter = higher number of repetitions completed).

Figure 3. Participants Box and Block Test (BBT) scores plotted by week. Each bubble represents an individual score with bubble diameter representing number of repetitions completed per week (Overall Repetitions Range: 0–10,196; Mean (SD) 268.3 (722.1); Median (IQR): 117 (301) (greater diameter = higher number of repetitions completed).

Discussion

Our study explored trajectories of early arm recovery using Box and Block Test scores collected over the first four weeks of inpatient rehabilitation. Four distinct clusters of arm recovery were identified in stroke survivors. Two clusters included participants with low to moderate levels of baseline arm function, with little to no improvement at four weeks. The remaining two clusters included participants with low to moderate levels of baseline arm function, and moderate to high rates of improvement over the study period. Compared to the LOWstart/LOWprogress cluster, participants in the LOWstart/HIGHprogress cluster were more likely to be earlier post stroke. Lower levels of disability on admission were strongly associated with the LOWstart/HIGHprogress cluster. Higher levels of baseline arm function were strongly associated with MODstart/LOWprogress, MODstart/MODprogress and LOWstart/HIGHprogress clusters when compared to the LOWstart/LOWprogress cluster.

Our study is the first to identify latent trajectories of arm recovery early after stroke using weekly outcome data. Our 4-cluster model has some similarities to PREP and PREP2 which both identified four recovery groups. In PREP2, Stinear and colleagues identified two groups that showed limited progress, known as ‘poor’ and ‘limited’, and two groups of progressors known as ‘good’ and ‘excellent’ however, they used a regression tree analysis in their study [Citation10]. Further investigation is warranted to develop simple clinician tool using trajectories identified in our study to classify stroke survivors into likely recovery groups and predict recovery without the need for TMS equipment or staff training. In contrast, PREP2 uses a more complex algorithm including a three-level decision tree to classify stroke survivors, based on clinical factors and a neurological biomarker namely MEPs, measured using TMS [Citation10].

The proportional recovery model suggests that in people with non-severe motor arm impairment, the magnitude of recovery in the first 3–6 months after stroke, is approximately 0.7 times the initial impairment measured with Fugl Myer assessment [Citation9]. The model also suggests that 30% of people with severe arm impairment do not show such recovery (‘non responders’) [Citation9]. In our study a high proportion (n = 40; 54%) of people had severe arm impairment at baseline. Of this group 15 (37.5%) had minimal arm recovery representing the LOWstart/LOWprogress group and 25 (65.5%) had high levels of recovery representing the LOWstart/high progress group. Our findings show a slightly higher proportion of ‘non responders’ at 4 weeks however we did not follow up people at 3–6 months to be able to compare to people in the proportional recovery validation studies. This comparison could be conducted in future studies with a longer follow up period.

Several models have included and used the Action Research Arm Test (SAFE and PREP [Citation8,Citation10,Citation11] to measure arm recovery. We also used the Box and Block Test, which was more responsive to change than the ARAT in the first month post-stroke [Citation55]. As with PREP and PREP2, we were unable to establish an association between arm practice dose and cluster membership [Citation10,Citation11]. A recent study conducted a retrospective analysis comparing the responsiveness of stroke participants who completed the 3-week Queen Square Upper Limb program - a high intensity arm training program (6 h daily, 5 days per week for 3 weeks, 90 total hours) to the Rehabilitation Gaming System - a low intensity arm training program (20–30 min/session, 3–5 days a week for 3–12 weeks, 3 to 30 total hours) across different stages of chronicity post-stroke (< 6 months to > 4 years) [Citation56]. Stroke participants in that study who completed the high-intensity Queen Square program had superior arm outcomes to those who completed the low-intensity program at all stages post stroke [Citation56]. Further studies are needed to investigate and model associations between type and dose of arm rehabilitation, and trajectories of recovery. For example, while the Queen Square study provides some evidence that providing arm training of different intensities produced different outcomes, future studies should investigate whether increasing rehabilitation dose (repetitions) changes a person’s arm recovery course, moving them into a more favourable cluster and/or improving their outcome.

Strengths of this study include the use of data from a consecutive sample of participants recruited to a prospective inception cohort study. Second, the use of a cohort of stroke survivors that were receiving usual inpatient rehabilitation rather than an experimental intervention. Third, the inclusion of all types of stroke with arm impairment. A final strength is use of a robust methodology including latent class growth analysis and goodness of fit indices. This approach allowed for all cases to be included, caters for missing data and outliers within the model [Citation22]. This approach also enabled us to evaluate multiple models and determine the optimal number of clusters, reducing the potential for selection bias and misclassifying subjects into clusters. These strengths combined have resulted in the identification of arm recovery trajectories based on a robust methodology, clinical relevance and are based on a sample that is more likely to be representative of usual care.

In our study the largest improvements were observed in the first week. We acknowledge we were interested in very early recovery, and the follow-up period was relatively short. A study which follows participants over a longer period of time, for example over 12 months would potentially help us better understand if those in the poor/limited early recovery clusters continue to have the same recovery trajectory or have a slow/delayed recovery. A recent observational study by Borschmann and colleagues (2020) which followed stroke survivors over a 2-year period found the greatest amount of recovery occurred between 3 and 6 weeks with improvements still occurring at 18 months, and in some individuals at 24 months [Citation57]. The researchers did not however perform any modelling to examine sub groups of recovery (likely due to their small sample size).

There are some other limitations to our study. First, while the sample was larger than that used in other studies [Citation9,Citation11], the sample size was still relatively small. A larger sample with over 100 participants per cluster would have allowed us to explore factors associated with cluster membership and outcome using multivariate statistics. Second, the original cohort study data excluded participants with limited cognition/language ability. Third, data were collected from participants at one comprehensive Australian stroke unit, and may not be generalisable to other settings. Fourth, while the Stroke Recovery and Rehabilitation Roundtable (SRRR) suggest that tools measuring change at the activity level are more likely to represent true recovery [Citation57], we can’t be fully certain of the mechanisms behind the observed clinical improvement. Finally, the regression modelling was based on absolute assignment of participants to each cluster, instead of proportional assignment. However, the very high average probability of membership for each cluster (92%) is unlikely to impact regression estimates [Citation58]. Because of these limitations, and the exploratory nature of the study, our findings should be interpreted with caution.

In conclusion, our findings show heterogeneity in patterns of arm recovery after stroke. Our exploratory model provides a way of classifying stroke survivors into four distinct clusters of arm recovery. The model was developed using clinical data that clinicians can easily collect at the bedside. Identification of four distinct clusters offers clinicians and stroke survivors a better understanding of the potential course of early arm recovery and can guide future model development. Further research is required in a larger cohort of stroke survivors and other rehabilitation settings to develop and validate a model that could be used by clinicians to inform early arm rehabilitation and prognosis in the rehabilitation setting.

Author contributions statement

All authors contributed to the manuscript as follows: AVC, AD, AM and CS: conception, design, analysis and interpretation of data, writing, editing and approval of the final manuscript.

Acknowledgments

Thank you to the study participants.

Disclosure statement

No potential competing interest was reported by the author(s).

Data availability statement

The data that support the findings of this study are available from the corresponding author, AVC, upon reasonable request.

Additional information

Funding

This study was supported by grants from the National Stroke Foundation and Ingham Institute for Applied Medical Research. CS is supported by a fellowship from the National Health and Research Medical Council. No funding agency had any role in the design, conduct, analyses, or reporting of the study or in the decision to submit the manuscript for publication.

References

  • Lawrence ES, Coshall C, Dundas R, et al. Estimates of the prevalence of acute stroke impairments and disability in a multiethnic population. Stroke. 2001;32(6):1279–1284.
  • Nakayama H, Jørgensen HS, Raaschou HO, et al. Recovery of upper extremity function in stroke patients: the Copenhagen stroke study. Arch Phys Med Rehabil. 1994;75(4):394–398.
  • Persson HC, Parziali M, Danielsson A, et al. Outcome and upper extremity function within 72 hours after first occasion of stroke in an unselected population at a stroke unit. A part of the SALGOT study. BMC Neurol. 2012;12(1):1–6.
  • National Stroke Foundation. National stroke audit-rehabilitation services report 2018. Melbourne (Australia): National Stroke Foundation; 2018.
  • Kim B, Winstein C. Can neurological biomarkers of brain impairment be used to predict poststroke motor recovery? A systematic review. Neurorehabil Neural Repair. 2017;31(1):3–24.
  • Chen SY, Winstein CJ. A systematic review of voluntary arm recovery in hemiparetic stroke: critical predictors for meaningful outcomes using the international classification of functioning, disability, and health. J Neurol Phys Ther. 2009;33(1):2–13.
  • Coupar F, Pollock A, Rowe P, et al. Predictors of upper limb recovery after stroke: a systematic review and meta-analysis. Clin Rehabil. 2012;26(4):291–313.
  • Nijland RH, van Wegen EE, Harmeling-van der Wel BC, EPOS Investigators, et al. Presence of finger extension and shoulder abduction within 72 hours after stroke predicts functional recovery: early prediction of functional outcome after stroke: the EPOS cohort study. Stroke. 2010;41(4):745–750.
  • Prabhakaran S, Zarahn E, Riley C, et al. Inter-individual variability in the capacity for motor recovery after ischemic stroke. Neurorehabil Neural Repair. 2008;22(1):64–71.
  • Stinear CM, Byblow WD, Ackerley SJ, et al. PREP2: a biomarker-based algorithm for predicting upper limb function after stroke. Ann Clin Transl Neurol. 2017;4(11):811–820.
  • Stinear CM, Barber PA, Petoe M, et al. The PREP algorithm predicts potential for upper limb recovery after stroke. Brain. 2012;135(Pt 8):2527–2535.
  • Kwah L, Herbert R. Prediction of walking and arm recovery after stroke: a critical review. Brain Sci. 2016;6(4):53.
  • Jung T, Wickrama KA. An introduction to latent class growth analysis and growth mixture modeling. Soc Personal Psychol. 2008;2(1):302–317.
  • Nylund-Gibson K, Choi AY. Ten frequently asked questions about latent class analysis. Transl Issues Psychol Sci. 2018;4(4):440–461.
  • Kongsted A, Kent P, Axen I, et al. What have we learned from ten years of trajectory research in low back pain? BMC Musculoskelet Disord. 2016;17(1):1–11.
  • Pan JH, Song XY, Lee SY, et al. Longitudinal analysis of quality of life for stroke survivors using latent curve models. Stroke. 2008;39(10):2795–2802.
  • Munyombwe T, West R, Hill K, et al. P2-205 heterogeneity in functional recovery after stroke: an exploratory study using longitudinal latent class analysis. J Epidemiol Community Health. 2011;65(Suppl 1):A278–A278.
  • Arling G, Ofner S, Reeves MJ, et al. Care trajectories of veterans in the 12 months after hospitalization for acute ischemic stroke. Circ Cardiovasc Qual Outcomes. 2015;8(6 Suppl 3):S131–S140.
  • West R, Hill K, Hewison J, et al. Psychological disorders after stroke are an important influence on functional outcomes: a prospective cohort study. Stroke. 2010;41(8):1723–1727.
  • Kjeverud A, Østlie K, Schanke A-K, et al. Trajectories of fatigue among stroke patients from the acute phase to 18 months post-injury: a latent class analysis. PLoS One. 2020;15(4):e0231709.
  • Mirbagheri MM, Rymer WZ. Time-course of changes in arm impairment after stroke: variables predicting motor recovery over 12 months. Arch Phys Med Rehabil. 2008;89(8):1507–1513.
  • Berlin KS, Williams NA, Parra GR. An introduction to latent variable mixture modeling (part 1): overview and cross-sectional latent class and latent profile analyses. J Pediatr Psychol. 2014;39(2):174–187.
  • Vratsistas-Curto A, Sherrington C, McCluskey A. Dosage and predictors of arm practice during inpatient stroke rehabilitation: an inception cohort study. Disabil Rehabil. 2021;43(5):640–647.
  • Callahan CM, Unverzagt FW, Hui SL, et al. Six-item screener to identify cognitive impairment among potential subjects for clinical research. Med Care. 2002;40(9):771–781.
  • Scrivener K, Sherrington C, Schurr K, et al. Many participants in inpatient rehabilitation can quantify their exercise dosage accurately: an observational study. J Physiother. 2011;57(2):117–122.
  • Stroke Foundation. Clinical guidelines for stroke management. Melbourne, Australia: stroke Foundation; 2021.
  • Charlson ME, Pompei P, Ales KL, et al. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373–383.
  • van Swieten JC, Koudstaal PJ, Visser MC, et al. Interobserver agreement for the assessment of handicap in stroke patients. Stroke. 1988;19(5):604–607.
  • Lannin NA. Reliability, validity and factor structure of the Upper Limb subscale of the Motor Assessment Scale (UL-MAS) in adults following stroke. Disabil Rehabil. 2004;26(2):109–116.
  • Carr JH, Shepherd RB, Nordholm L, et al. Investigation of a new motor assessment scale for stroke patients. Phys Ther. 1985;65(2):175–180.
  • Herbison GJ, Isaac Z, Cohen ME, et al. Strength post-spinal cord injury: myometer vs manual muscle test. Spinal Cord. 1996;34(9):543–548.
  • Marx RG, Bombardier C, Wright JG. What do we know about the reliability and validity of physical examination tests used to examine the upper extremity? J Hand Surg Am. 1999;24(1):185–193.
  • Schwartz S, Cohen ME, Herbison GJ, et al. Relationship between two measures of upper extremity strength: manual muscle test compared to hand-held myometry. Arch Phys Med Rehabil. 1992;73(11):1063–1068.
  • Mathiowetz V, Volland G, Kashman N, et al. Adult norms for the box and block test of manual dexterity. Am J Occup Ther. 1985;39(6):386–391.
  • Tardieu G, Rondot P, Mensch J, et al. Electromyographic response to muscular stretching in normal subjects. Rev Fr Etud Clin Biol. 1957;2(10):998–1004.
  • Tardieu G, Shentoub S, Delarue R. Research on a technic for measurement of spasticity. Rev Neurol. 1954;91(2):143–144.
  • Kwakkel G, van Peppen R, Wagenaar RC, et al. Effects of augmented exercise therapy time after stroke: a meta-analysis. Stroke. 2004;35(11):2529–2539.
  • Lohse KR, Lang CE, Boyd LA. Is more better? Using metadata to explore dose–response relationships in stroke rehabilitation. Stroke. 2014;45(7):2053–2058.
  • Veerbeek JM, Koolstra M, Ket JC, et al. Effects of augmented exercise therapy on outcome of gait and gait-related activities in the first 6 months after stroke: a meta-analysis. Stroke. 2011;42(11):3311–3315.
  • Wang J, Wang X. Structural equation modeling: applications using mplus. New York: John Wiley & Sons; 2019.
  • Van De Schoot R, Sijbrandij M, Winter SD, et al. The GRoLTS-checklist: guidelines for reporting on latent trajectory studies. Struct Equ Modeling. 2017;24(3):451–467.
  • Bauer DJ, Curran PJ. Distributional assumptions of growth mixture models: implications for overextraction of latent trajectory classes. Psychol Methods. 2003;8(3):338–363.
  • Depaoli S, Winter SD, Lai K, et al. Implementing continuous non-normal skewed distributions in latent growth mixture modeling: an assessment of specification errors and class enumeration. Multivariate Behav Res. 2019;54(6):795–821.
  • Dunn KM, Jordan K, Croft PR. Characterizing the course of low back pain: a latent class analysis. Am J Epidemiol. 2006;163(8):754–761.
  • Downie AS, Hancock MJ, Rzewuska M, et al. Trajectories of acute low back pain: a latent class growth analysis. Pain. 2016;157(1):225–234.
  • Kongsted A, Kent P, Hestbaek L, et al. Patients with low back pain had distinct clinical course patterns that were typically neither complete recovery nor constant pain. A latent class analysis of longitudinal data. Spine J. 2015;15(5):885–894.
  • Li C. Little’s test of missing completely at random. The Stata Journal. 2013;13(4):795–809.
  • Lo Y, Mendell NR, Rubin DB. Testing the number of components in a normal mixture. Biometrika. 2001;88(3):767–778.
  • Chen Q, Luo W, Palardy GJ, et al. The efficacy of common fit indices for enumerating classes in growth mixture models when nested data structure is ignored: a Monte Carlo study. Sage Open. 2017;7(1):215824401770045.
  • Celeux G, Soromenho G. An entropy criterion for assessing the number of clusters in a mixture model. J Classif. 1996;13(2):195–212.
  • Nagin DS, Nagin D. Group-based modeling of development. Cambridge (MA): Harvard University Press; 2005.
  • Nagin DS, Land KC. Age, criminal careers, and population heterogeneity: specification and estimation of a nonparametric, mixed Poisson model. Criminology. 1993;31(3):327–362.
  • Tsivgoulis G, Apostolidou N, Giannopoulos S, et al. Hemodynamic causes of deterioration in acute ischemic stroke. Perspect Med. 2012;1(1-12):177–184.
  • Jackson H, Engelman M, Bandeen-Roche K. Robust respondents and lost limitations: the implications of nonrandom missingness for the estimation of health trajectories. J Aging Health. 2019;31(4):685–708.
  • Vratsistas-Curto A, Sherrington C, McCluskey A. Responsiveness of five measures of arm function in acute stroke rehabilitation. Clin Rehabil. 2018;32(8):1098–1107.
  • Ballester BR, Ward NS, Brander F, et al. Relationship between intensity and recovery in post-stroke rehabilitation: a retrospective analysis. J Neurol Neurosurg Psychiatry. 2022;93(2):226–228.
  • Borschmann KN, Hayward KS. Recovery of upper limb function is greatest early after stroke but does continue to improve during the chronic phase: a two-year, observational study. Physiotherapy. 2020;107:216–223.
  • Davies CE, Giles LC, Glonek GF. Performance of methods for estimating the effect of covariates on group membership probabilities in group-based trajectory models. Stat Methods Med Res. 2018;27(10):2918–2932.

Appendix 1. 

Checklist-guidelines for reporting on latent trajectory studies (GRoLTS)