Publication Cover
Physiotherapy Theory and Practice
An International Journal of Physical Therapy
Volume 36, 2020 - Issue 1
2,355
Views
9
CrossRef citations to date
0
Altmetric
Descriptive Report

Validity and test–retest reliability of the six-spot step test in persons after stroke

, RPT, MSc, , RN, , MD & , RPT
Pages 211-218 | Received 24 May 2017, Accepted 12 Mar 2018, Published online: 06 Jun 2018

ABSTRACT

Background and Purpose: After stroke, asymmetric weight distribution is common with decreased balance control in standing and walking. The six-spot step test (SSST) includes a 5-m walk during which one leg shoves wooden blocks out of circles marked on the floor, thus assessing the ability to take load on each leg. The aim of the present study was to investigate the convergent and discriminant validity and test–retest reliability of the SSST in persons with stroke. Methods: Eighty-one participants were included. A cross-sectional study was performed, in which the SSST was conducted twice, 3–7 days apart. Validity was investigated using measures of dynamic balance and walking. Reliability was assessed using intraclass correlation coefficient, standard error of the measurement (SEM), and smallest real difference (SRD). Results: The convergent validity was strong to moderate, and the test–retest reliability was good. The SEM% was 14.7%, and the SRD% was 40.8% based on the mean of four walks shoving twice with the paretic and twice with the non-paretic leg. Conclusion: Values on random measurement error were high affecting the use of the SSST for follow-up evaluations but the SSST can be a complementary measure of gait and balance.

Introduction

Balance control is a complex function that coordinates the body’s center of gravity relative to the support surface and can be defined as the ability to maintain, achieve, or restore postural stability (Shumway-Cook and Woollacott, Citation2012). After a stroke, hemiparesis with muscle weakness, spasticity, and impaired sensory function may result in an asymmetric weight distribution in standing and walking, with more loads on the non-paretic limb (Langhorne, Bernhardt, and Kwakkel, Citation2011). Balance control may also be affected by impaired vision and cognition. A consequence of an asymm etric walking pattern and impaired balance control is high risk for falls (Hendrickson et al., Citation2014; Kamphuis, De Kam, Geurts, and Weerdesteyn, Citation2013). Several activities in daily living require the ability to perform multiple tasks simultaneously. Persons with stroke have been reported to have difficulties with performing another task while walking (An et al., Citation2014; Yang et al., Citation2007), as dual tasking can affect balance control.

Assessing gait performance after stroke requires measuring instruments that include various components of walking used in everyday life. Recommended instruments that measure balance control during walking activities are the Dynamic Gait Index (DGI) (Shumway-Cook and Woollacott, Citation1995), and the Functional Gait Assessment (FGA) (Lin et al., Citation2010; Wrisley, Marchetti, Kuharsky, and Whitney, Citation2004). The DGI includes eight items where a physiotherapist (PT) rates performance of different walking tasks such as changing walking speed, walking while turning the head, walking around or over obstacles, and walking with a pivot turn. The FGA is based on the DGI adding three more challenging items. A well-known instrument of functional balance is the Timed Up and Go (TUG) test (Podsiadlo and Richardson, Citation1991). The TUG test includes a rise from a chair, a 3-m walk, and a turn. However, in daily life, walking is often performed in complex situations; thus, there can be a need of measuring more advanced balance control during walking also capturing the unilateral limitations and asymmetric weight distribution.

The six-spot step test (SSST) was originally developed as a walking test for persons with multiple sclerosis (MS) and includes a motor dual task. The SSST comprises a short walk of 5 m involving shifting weight from one foot to the other, and single-leg standing while shoving wooden blocks (Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006).

The SSST with the complex combination of a forced weight shift from one foot to another, into a single-leg standing, during a simultaneous ambulation activity involving coordination on time, can give the opportunity to provide a more comprehensive picture of walking capacity after stroke than the DGI, the FGA, and the TUG test. However, psychometric properties of the SSST for use in persons with stroke need to be evaluated. In studies of people with MS, the SSST has had good test–retest reliability with high intraclass correlation coefficient (ICC) values of 0.95–0.98 (Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006; Pavan, Tilbery, Lianza, and Marangoni, Citation2010), and strong associations (r = 0.73–0.92) with the measuring instruments: timed 25-ft walk test, 6-min walk test, and the TUG test (Fritz, Jiang, Keller, and Zackowski, Citation2016; Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006; Pavan, Tilbery, Lianza, and Marangoni, Citation2010; Sandroff, Motl, Sosnoff, and Pula, Citation2015).

Psychometric evaluation includes establishing construct validity that reflects the ability of an instrument to measure a concept (Portney and Watkins, Citation2007). Construct validity can be gathered in different ways such as convergent validity that two measuring instruments are believed to reflect the same phenomenon. Discriminant validity indicates that lower correlations are expected from measuring instruments that are believed to assess different characteristics (Portney and Watkins, Citation2007). Instruments that are believed to measure other aspects of balance control, such as balance confidence and overall lower extremity function, are expected to have a low correlation to the SSST. The aim of the present study was to investigate the convergent and discriminant validity and test–retest reliability of the SSST in persons with stroke. The hypotheses were that the SSST would be strongly associated with measuring instruments of walking balance: the TUG test, the DGI, and the four square step test (FSST). Discriminant validity was investigated using the timed-stands test (TST) and the Activities-specific Balance Confidence (ABC) scale, and low-to-moderate associations with these were hypothesized.

Methods

Study design

This study had a cross-sectional design, and data were collected at five primary health-care centers in Sweden from June 2014 until November 2015.

Subjects

A convenience sample of community-dwelling persons was recruited. The inclusion criteria were (1) a minimum of 6 months poststroke; (2) medically stable with no other disease/condition that significantly influenced gait performance; (3) ability to understand both verbal and written information; and (4) self-reported walking impairment, but ability to walk at least 10 m with or without assistive device. Individuals meeting the inclusion criteria received written information about the study. Thereafter, they were contacted by telephone and invited to participate. Prior to data collection, they signed an informed consent. The study was approved by the Regional Ethics Committee of Uppsala/Örebro, Sweden (2014/058). Reliability studies are recommended to include at least 30–50 participants (Flansbjer, Blom, and Brogardh, Citation2012; Hopkins, Citation2000), with larger sample sizes giving more confident results. Therefore, we set out to recruit about 80 individuals.

Procedures

The SSST is performed in a taped walkway (test field) that is 5 m long and 1 m wide (Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006). A circle with diameter of 20 cm is marked in the middle of each end-line of the test field. Two more circles of the same size are marked on each of the sidelines of the test field, with a distance from the end-line of 1 and 3 on one side, and 2 and 4 m on the other side. One of the end circles is empty and is used as the starting point. In the center of the other five circles, a cylindrical wooden block is placed, with a diameter of 8 cm and a height of 4 cm, and weighing 134 g ().

Figure 1. Six-spot step test (SSST).

The test starts with the person to be tested standing in the start circle at the end line of the field. A wooden block is placed in the center of each of the remaining circles. The person criss-crosses from one circle to the next and shoves the blocks out of the circles. One leg is used for shoving all the blocks during one walk. The test is then repeated with the other leg shoving the blocks. The test is performed twice for each leg.

Figure 1. Six-spot step test (SSST).The test starts with the person to be tested standing in the start circle at the end line of the field. A wooden block is placed in the center of each of the remaining circles. The person criss-crosses from one circle to the next and shoves the blocks out of the circles. One leg is used for shoving all the blocks during one walk. The test is then repeated with the other leg shoving the blocks. The test is performed twice for each leg.

The test starts when the participant first lifts one foot from the start circle and ends when the last block is shoved out of the end circle. The instruction is to do the test in fast walking, criss-crossing from circle to circle and always using the same foot to shove the blocks out of the circles. In the present study, the SSST was performed in four trials, two trials for shoving with the paretic and two for shoving with the non-paretic limb. The total mean time for the four trials was calculated, as well as for shoving with the paretic and the non-paretic leg, respectively.

In the TUG test, time is measured from when a person rises from a chair, walks 3 m at comfortable speed, turns around, walks back, and sits down. The participants performed one trial. The TUG was developed as a measure of functional mobility (Podsiadlo and Richardson, Citation1991) and has been found valid and reliable in persons with stroke (Flansbjer et al., Citation2005).

The DGI consists of eight items assessing dynamic balance in different walking tasks such as changing walking speed, walking while turning the head, walking around or over obstacles, and walking with a pivot turn. Performance in each item was rated by the PT on a 4-point scale ranging from 0 (severe impairment) to 3 (normal walking ability without a walking aid). The maximum total score is 24 points (Shumway-Cook and Woollacott, Citation1995). The DGI has been found valid and reliable in persons with stroke (Jonsdottir and Cattaneo, Citation2007).

The FSST measures the ability to quickly step over four sticks on the floor marking four squares. Four sticks of 2.5-cm height were placed on the floor at 90-deg angles to each other. The person, steps forward, backward, and sideways first in a clockwise and an anticlockwise direction, putting most of the load on one leg at a time but making contact with the floor in each square with each foot (Dite and Temple, Citation2002). The participants were instructed to step in each square, trying not to touch the sticks. Time was noted as the participants completed the task. Two trials were performed and the mean value of these was calculated. If the sequence was not completed correctly, the person lost balance or moved the stick out of place, and it was possible to repeat (restart) the sequence one time. All participants fulfilled two trials of the FSST. The FSST has good intra- and inter-rater reliability as well as validity in stroke (Blennerhassett and Jayalath, Citation2008; Goh, Chua, Hong, and Ng, Citation2013).

The TST is a measuring instrument of overall lower extremity function (Csuka and McCarty, Citation1985) and has been found valid in rheumatoid arthritis and other chronic diseases (Newcomer, Krug, and Mahowald, Citation1993). The time to rise 10 times from sitting in a standardized chair with armrest to full standing was taken.

The ABC scale is a patient-reported questionnaire that assesses self-rated balance confidence when performing daily activities (Myers, Fletcher, Myers, and Sherk, Citation1998). Using an 11-point scale, the participants stated how safe and secure they felt when performing 16 different indoor and outdoor activities, ranging from 0% (not at all safe and secure) to 100% (completely safe and secure). The item scores were summarized and then divided by 16 (the number of items). A validated Swedish version of the ABC scale was used (Forsberg and Nilsagard, Citation2013).

Procedure

Data collection was performed on two occasions, 3–7 days apart, by five PTs specialized in neurological physical therapy (including the first author, MAL). The first author performed 70% of the data collection. All conditions were kept as stable as possible; the same PTs performed both tests, at the same location in an ordinary hall-way at the primary health-care centers, and around the same time at the day. If walking aids were needed, it was used during both test occasions.

Before start of the data collection, a training session was performed where the PTs were introduced to the study protocol, all the measuring instruments and the procedure. Although inter-rater reliability of the DGI has been found good (Jonsdottir and Cattaneo, Citation2007), to achieve consistency of the ratings from the PTs, they watched a video recording of a person performing the DGI and then discussed the scoring. In the ABC scale, the PTs followed the associated instructions and were available for questions when the person rated their own balance confidence.

The first test session lasted approximately 45 min and started with an interview regarding demographic characteristics, use of walking aids indoors and outdoors, and self-reported number of falls during the last 3 months. To describe the participant’s severity of stroke, item 5 (arm) and 6 (leg) from the National Institutes of Health Stroke Scale (Lyden et al., Citation1994) were tested. The participant was in a supine position, asked to lift one arm to 45 deg, and hold it there for 10 s, then lift a leg to 30 deg and hold that position for 5 s. Performance was graded from 0 to 4 where 0 = able to hold the arm for 10 s and the leg for 5 s, and 4 = no movement in either arm or leg, see further information on the scoring in .

Table 1. Baseline characteristics of the 81 participants.

The balance measurements were then performed in a standardized order, starting with the ABC scale, followed by the TST, the FSST, the DGI, the TUG, and, finally, the SSST. On the second test occasion only, the SSST was performed. The participants were instructed not to change their activity habits between the two test occasions. During the tests, they wore comfortable shoes and if needed they could rest between the measures.

Statistical analysis

The distribution of the variables is given as means, standard deviations (SDs), minimums and maximums for continuous variables, and numbers and percentages for categorical variables. To evaluate the external convergent validity between the first test session of the SSST and the other measures, Spearman’s and Pearson’s correlation coefficients were calculated with correlation coefficients <0.30 interpreted as weak, 0.30–0.59 as moderate, and ≥0.60 as strong (Andresen, Citation2000).

Test–retest reliability was evaluated based on the distribution of the change between the two measurements, using intra-individual SD (standard error of the measurements [SEM]), intra-individual coefficient of variation (SEM%), repeatability coefficient (smallest real difference, [SRD] √2 × 1.96 × intra-individual SD), SRD%, and ICC (ICC2.1), Shrout–Fleiss reliability: random set (Bland and Altman, Citation1996; Shrout and Fleiss, Citation1979). Wilcoxon’s signed-rank test was used for analysis of systematic changes between the repeated measures.

A Bland–Altman plot was used to visually present the data (Bland and Altman, Citation1986). Bland–Altman plots are useful for determining whether the difference between the measurements is dependent on the mean of the measurements. The plots can also be used to illustrate heteroscedasticity with a systematic variation around the zero line, and a wider spread with larger and lower mean values and also possible outliers. The possibility of heteroscedasticity can be addressed by performing the Pearsonʼs correlations coefficient test between the mean and the absolute difference between tests 2 and 1 and the means of the two sessions (Bland and Altman, Citation1999; Huang et al., Citation2011). An outlier was considered to be present when the difference between the two tests sessions was outside two SDs. All significance tests were two-sided and conducted at the 5% significance level. Statistical analyses were performed using SAS version 9 (SAS, Cary, NC, USA) and SPSS version 22.0 (SPSS Inc., Chicago, IL, USA).

Results

Eighty-one participants, 34 women and 47 men, with a mean age of 70 years were included. Of the total, 43% had left hemiparesis and 57% had right, and 59% used walking aids outdoors. In , further baseline characteristics are presented.

Correlation coefficients between the SSST and the instruments of walking balance (TUG, DGI and FSST) were r = 0.832–0.918, and the TST r = 0.568 and the ABC scale r = 0.596 (). The mean time for performing the TUG test () was 19.0 s which indicates walking disability.

Table 2. Correlations between the six-spot step test (SSST) and the other tests, using Pearson’s correlation coefficient for timed measures and Spearman’s correlations for rating scales.

In , the mean values from the two test sessions of the SSST are presented. Participants walked significantly faster while shoving the blocks with their non-paretic leg. Thirty-one participants (38%) used one to two canes or crutches to perform all tests.

Table 3. Results of the six-spot step test (SSST) n = 81 performed on two test occasions and showing scores for shoving with the paretic versus the non-paretic limb, and the test–retest results.

The mean change from the first to the second test was about 1 s () (). The Bland–Altman plot gives more values under the line than above, illustrating a better performance at the second test, where 57 (70%) of the participants showed better performance compared with the first test (). The plot also shows that the intra-individual SD increased with increased time to perform the SSST (). Indications of heteroscedasticity with larger variability for higher test values were also seen in the plot. The Pearson correlation coefficient for the absolute difference between tests 2 and 1 and the means of the two sessions were 0.70. One outlier was found (), and after it had been removed, heteroscedasticity reduced to a value of 0.60. The test–retest results of the SSST are presented in . The ICC2.1 values for the SSST ranged from 0.93 to 0.96. The SEM% values ranged between 14.7% and 20.0%. The SRD% values ranged between 40.8% and 55.4%; the lowest value is based on the mean of the four trials on the SSST ().

Figure 2. Bland–Altman plot showing the difference, in seconds, between tests 2 and 1 of the six-spot step test (SSST), compared with the mean of tests 1 and 2. The variation around the zero line is plotted.

Dotted lines represent means and limits of agreement, in seconds.

Figure 2. Bland–Altman plot showing the difference, in seconds, between tests 2 and 1 of the six-spot step test (SSST), compared with the mean of tests 1 and 2. The variation around the zero line is plotted.Dotted lines represent means and limits of agreement, in seconds.

Discussion

This study showed strong convergent validity between the SSST and the TUG test, the DGI and the FSST. Discriminant validity was shown with moderate association with the ABC scale and the TST. Test–retest reliability for the SSST was high; however, the fairly high values on SEM and SRD indicate that sensitivity for repeated measures is limited for the SSST.

Most prior studies of the SSST have been performed in persons with MS where similar results on correlations with other walking tests have been found (Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006; Pavan, Tilbery, Lianza, and Marangoni, Citation2010; Sandroff, Motl, Sosnoff, and Pula, Citation2015). In the MS population, the SSST seems to be most suitable for persons with moderate MS (Fritz, Jiang, Keller, and Zackowski, Citation2016; Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006). In people with MS, a strong correlation between the SSST and the TUG has been shown (Fritz, Jiang, Keller, and Zackowski, Citation2016; Sandroff, Motl, Sosnoff, and Pula, Citation2015). These studies also found a strong association between the SSST and walking tests involving longer distances, such as the 2- and 6-min walking tests. The correlation coefficients between the SSST and the FSST and the DGI indicate that the SSST can capture dynamic balance also in persons with stroke. In earlier studies in persons with Parkinson disease, the combination of the DGI and the TUG is described as providing a complementary and comprehensive understanding of gait and balance control (Huang et al., Citation2011). The SSST, as the DGI, includes a more challenging balance activity, and together with the TUG, the SSST could be an option in assessing dynamic balance in persons with stroke.

Discriminant validity was here investigated with a measure of lower extremity function (the TST) and a questionnaire on self-rated balance confidence (the ABC scale). Sandroff, Motl, Sosnoff, and Pula (Citation2015) presented moderate-to-strong correlation with balance confidence and highlighted the novelty of SSST as a measure of ambulatory function also including elements of balance confidence. The moderate associations in the present study suggest that the SSST is also related to strength and endurance in the lower limbs. Feeling confident to do activities in the home and community as assessed in the ABC scale was also found related to results on the SSST.

In the present study, high retest agreement (ICC 0.96) was found, which indicates consistency between the two measures and between the individuals. Test–retest reliability has also investigated in people with MS with good test–retest reliability when performing the second test about 2 h after the first, with about the same median time in both tests (Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg, Citation2006; Pavan, Tilbery, Lianza, and Marangoni, Citation2010). Callesen et al. (Citation2018) also found high ICC values on test–retest reliability in MS when the SSST was performed 2 days apart. Reliability values were high for performing the SSST shoving with both the paretic and the non-paretic limbs. However, the participants walked significantly faster when standing on their paretic leg while shoving the block with their non-paretic limb, than vice versa. This could be due to the possibility to take a quick step with the non-paretic leg to restore balance after the kick. Another suggestion could be that lack of strength and coordination in the paretic limb makes shoving with that leg difficult and, therefore, takes longer time.

Comprehensive evaluations of reliability should include assessments of the variability of within-subject measurements. The Bland–Altman plot displays that a better performance was seen at the second test, where one explanation could be the participants being familiar with the test procedure. This finding is consistent with other studies (Callesen et al., Citation2018; Green, Forster, and Young, Citation2002) and to minimize this effect, a practice trial allowing participants to familiarize themselves with the test protocol is proposed. An increased intra-individual SD in individuals with slower walking was also seen in the Bland–Altman plot, also seen in earlier reliability gait performance studies in persons with stroke (Flansbjer et al., Citation2005).

Heteroscedasticity was present with a correlation coefficient of 0.7. In the Bland–Altman plot, one outlier was presented, a person with a slow performance on the SSST and with a better performance on the first test occasion. Even when this person was excluded, heteroscedasticity was still present with a correlation coefficient above the suggested value of 0.3 (Atkinson and Nevill, Citation1998). The existence of heteroscedasticity may easily occur in patient groups with variability in functioning (Flansbjer et al., Citation2005). In persons with stroke, it is vital to consider that persons with more challenging balance deficits may present variability in results on walking measures. This may depend on inconsistent level of motor capacity needed for ambulation in persons with a slower walking speed or less walking endurance. Persons with stroke struggle with residual symptoms such as weakness, stiffness, and fatigue, where the influence on balance and walking may vary. To minimize exterior influences, the conditions were kept as stable as possible, with the same PT at both test, in the same location, at the same time at the day, and if needed walking aids, it was the same in both test occasions. To decrease the within variability when evaluating the SSST, a shorter time period between assessments as performed by Nieuwenhuis, Van Tongeren, Sorensen, and Ravnborg (Citation2006) may be needed. Using more defined inclusion criteria may also provide a more homogenous sample, thus probably reducing the existence of heteroscedasticity.

SEM and SRD have not previously been investigated for the SSST. There are no universally accepted limits for SEM% but Flansbjer et al. (Citation2005) suggest that values ≤10% are clinically relevant with respect to sensitivity. Huang et al. (Citation2011) described limits for the percentage of the minimal detectable change (MDC%), a calculation similar to SRD%, where a MDC% score less than 30 is considered acceptable and less than 10 as excellent (Smidt et al., Citation2002). The fairly high SEM% and SRD% in the present study pointed to limited sensitivity for detecting changes beyond threshold of error. The mean time of four trials was 23.2 s, and in this group of individuals, the amount of change to reflect a true difference above measurement error must be larger than 3.4 s. The SRD reflects the minimal detectable difference and with a SRD% value of 40.8%, a person needs to walk almost 9.5 s faster to achieve true improvement. We suggest that for persons with stroke, the SSST is performed twice with each leg, thus calculating a score of mean of four trials. The high values on response stability affect the utility of the SSST as a follow-up measure. As suggested by Huang et al. (Citation2011) and Flansbjer et al. (Citation2005), using a percentage of minimal detectable change is more appropriate for interpreting a true change. To promote the utility of the SSST in stroke rehabilitation, it is necessary to investigate the responsiveness and improve the high SRD% values.

Another aspect of challenge of the forced load in the weight shifting task is the compulsory awareness of both the paretic to the non-paretic leg. In persons with stroke, the awareness of the body can be reduced (Guidetti, Asaba, and Tham, Citation2007) and in the SSST, the person might increase their limits of stability and their awareness of the body, by being forced to take load on each leg. The SSST was easily performed in all the clinical settings and the test takes only about 5–10 min.

Limitations

This study has several limitations. All participants were ambulatory; however, there was variation in balance performance and use of walking aids that are reflected in the large range of scores on the used tests, with relatively high heteroscedasticity. However, the sample can be regarded as representative of a population with mild-to-moderate disability after stroke seen in the primary health care. The inclusion criteria stated self-reported balance impairment where an objective measure of balance would have been appropriate, but the different test results did show that the participants had balance limitations. Furthermore, the results may have been affected by the order of the test items with the SSST as the last measure on the first test occasion. Validity was investigated using well-known measures of walking balance. In previous studies (Callesen et al., Citation2018; Fritz, Jiang, Keller, and Zackowski, Citation2016; Pavan, Tilbery, Lianza, and Marangoni, Citation2010) in people with MS, gait speed and walking distance have been used for validation, which also may be established in stroke. In further studies of the SSST, inter-rater reliability of the SSST also needs to be evaluated.

Conclusion

In summary, the SSST showed strong convergent validity with other instruments of gait and balance validated for stroke. Values on random measurement error were high affecting the use of the SSST as a measure for follow-up evaluations. Based on the SEM% and SRD% values, the most suitable way of applying the SSST is to calculate a person’s mean time in four trials, shoving twice with the paretic leg and twice with the non-paretic leg. The SSST includes a challenging weight shift from the paretic to the non-paretic limb, and the findings in this study suggest that the SSST can be a complementary measure of gait and balance in stroke rehabilitation.

Declaration of interest

The authors report no declarations of interest.

Acknowledgments

We would like to thank the persons with stroke who volunteered to participate, and the physiotherapists who performed the data collection: Helena Tholin, Niclas Borgqvist, Ewa Petterson, and Anna Lövgren

Additional information

Funding

This work was supported by the Swedish Stroke Association: [Grant Number 2011], Norrbacka-Eugenia Foundation: [Grant Number 804-14], and the Research Committee, Örebro County Council: [Grant Number OLL-457471].

References

  • An HJ, Kim JI, Kim YR, Lee KB, Kim DJ, Yoo KT, Choi JH 2014 The effect of various dual task training methods with gait on the balance and gait of patients with chronic stroke. Journal of Physical Therapy Science 26: 1287–1291.
  • Andresen EM 2000 Criteria for assessing the tools of disability outcomes research. Archives of Physical Medicine and Rehabilitation 81: S15–20.
  • Atkinson G, Nevill AM 1998 Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Medicine 26: 217–238.
  • Bland JM, Altman DG 1986 Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1: 307–310.
  • Bland JM, Altman DG 1996 Measurement error. BMJ 313: 744.
  • Bland JM, Altman DG 1999 Measuring agreement in method comparison studies. Statistical Methods in Medical Research 8: 135–160.
  • Blennerhassett JM, Jayalath VM 2008 The Four Square Step Test is a feasible and valid clinical test of dynamic standing balance for use in ambulant people poststroke. Archives of Physical Medicine and Rehabilitation 89: 2156–2161.
  • Callesen J, Richter C, Kristensen C, Sunesen I, Naesby M, Dalgas U, Skjerbaek AG 2018 Test-retest agreement and reliability of the Six Spot Step Test in persons with multiple sclerosis. Multiple Sclerosis doi: 10.1177/1352458517745725. Epub ahead of print
  • Csuka M, McCarty DJ 1985 Simple method for measurement of lower extremity muscle strength. American Journal of Medicine 78: 77–81.
  • Dite W, Temple VA 2002 A clinical test of stepping and change of direction to identify multiple falling older adults. Archives of Physical Medicine and Rehabilitation 83: 1566–1571.
  • Flansbjer UB, Blom J, Brogardh C 2012 The reproducibility of Berg Balance Scale and the Single-leg Stance in chronic stroke and the relationship between the two tests. PM & R 4: 165–170.
  • Flansbjer UB, Holmback AM, Downham D, Patten C, Lexell J 2005 Reliability of gait performance tests in men and women with hemiparesis after stroke. Journal of Rehabilitation Medicine 37: 75–82.
  • Forsberg A, Nilsagard Y 2013 Validity and reliability of the Swedish version of the activities-specific balance confidence scale in people with chronic stroke. Physiotherapy Canada 65: 141–147.
  • Fritz NE, Jiang A, Keller J, Zackowski KM 2016 Utility of the Six-Spot Step Test as a measure of walking performance in ambulatory individuals with multiple sclerosis. Archives of Physical Medicine and Rehabilitation 97: 507–512.
  • Goh EY, Chua SY, Hong SJ, Ng SS 2013 Reliability and concurrent validity of Four Square Step Test scores in subjects with chronic stroke: A pilot study. Archives of Physical Medicine and Rehabilitation 94: 1306–1311.
  • Green J, Forster A, Young J 2002 Reliability of gait speed measured by a timed walking test in patients one year after stroke. Clinical Rehabilitation 16: 306–314.
  • Guidetti S, Asaba E, Tham K 2007 The lived experience of recapturing self-care. American Journal of Occupational Therapy 61: 303–310.
  • Hendrickson J, Patterson KK, Inness EL, McIlroy WE, Mansfield A 2014 Relationship between asymmetry of quiet standing balance control and walking post-stroke. Gait and Posture 39: 177–181.
  • Hopkins WG 2000 Measures of reliability in sports medicine and science. Sports Medicine 30: 1–15.
  • Huang SL, Hsieh CL, Wu RM, Tai CH, Lin CH, Lu WS 2011 Minimal detectable change of the timed “Up & Go” test and the dynamic gait index in people with Parkinson disease. Physical Therapy 91: 114–121.
  • Jonsdottir J, Cattaneo D 2007 Reliability and validity of the dynamic gait index in persons with chronic stroke. Archives of Physical Medicine and Rehabilitation 88: 1410–1415.
  • Kamphuis JF, De Kam D, Geurts AC, Weerdesteyn V 2013 Is weight-bearing asymmetry associated with postural instability after stroke? A systematic review. Stroke Research and Treatment 2013: 692137.
  • Langhorne P, Bernhardt J, Kwakkel G 2011 Stroke rehabilitation. Lancet 377: 1693–1702.
  • Lin JH, Hsu MJ, Hsu HW, Wu HC, Hsieh CL 2010 Psychometric comparisons of 3 functional ambulation measures for patients with stroke. Stroke 41: 2021–2025.
  • Lyden P, Brott T, Tilley B, Welch KM, Mascha EJ, Levine S, Haley EC, Grotta J, Marler J 1994 Improved reliability of the NIH stroke scale using video training. NINDS TPA Stroke Study Group. Stroke 25: 2220–2226.
  • Myers AM, Fletcher PC, Myers AH, Sherk W 1998 Discriminative and evaluative properties of the activities-specific balance confidence (ABC) scale. Journals of Gerontology. Series A, Biological Sciences and Medical Sciences 53: M287–294.
  • Newcomer KL, Krug HE, Mahowald ML 1993 Validity and reliability of the timed-stands test for patients with rheumatoid arthritis and other chronic diseases. Journal of Rheumatology 20: 21–27.
  • Nieuwenhuis MM, Van Tongeren H, Sorensen PS, Ravnborg M 2006 The six spot step test: a new measurement for walking ability in multiple sclerosis. Multiple Sclerosis 12: 495–500.
  • Pavan K, Tilbery CP, Lianza S, Marangoni BE 2010 Validation of the “Six Step Spot Test” for gait among patients with multiple sclerosis in Brazil. Arquivos De Neuro-Psiquiatria 68: 198–204.
  • Podsiadlo D, Richardson S 1991 The timed “Up & Go”: A test of basic functional mobility for frail elderly persons. Journal of the American Geriatrics Society 39: 142–148.
  • Portney LG, Watkins MP 2007 Foundations of Clinical Research Application to Practice 3rd. Upper Saddle Ridge, NJ: Pearson Education (US).
  • Sandroff BM, Motl RW, Sosnoff JJ, Pula JH 2015 Further validation of the Six-Spot Step Test as a measure of ambulation in multiple sclerosis. Gait and Posture 41: 222–227.
  • Shrout PE, Fleiss JL 1979 Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin 86: 420–428.
  • Shumway-Cook A, Woollacott MH 1995 Motor Control: theory and Practical Applications 2nd. Baltimore: Williams & Wilkins.
  • Shumway-Cook A, Woollacott MH 2012 Motor Control: translating Research into Clinical Practice 4th. Baltimore: Lippincott Williams & Wilkins.
  • Smidt N, van der Windt DA, Assendelft WJ, Mourits AJ, Deville WL, de Winter AF, Bouter LM 2002 Interobserver reproducibility of the assessment of severity of complaints, grip strength, and pressure pain threshold in patients with lateral epicondylitis. Archives of Physical Medicine and Rehabilitation 83: 1145–1150.
  • Wrisley DM, Marchetti GF, Kuharsky DK, Whitney SL 2004 Reliability, internal consistency, and validity of data obtained with the functional gait assessment. Physical Therapy 84: 906–918.
  • Yang YR, Chen YC, Lee CS, Cheng SJ, Wang RY 2007 Dual-task-related gait changes in individuals with stroke. Gait and Posture 25: 185–190.