3,170
Views
1
CrossRef citations to date
0
Altmetric
Assessment Procedures

The extremity function index (EFI), a disability severity measure for neuromuscular diseases: psychometric evaluation

, , , &
Pages 1561-1568 | Received 20 Jun 2016, Accepted 24 Feb 2017, Published online: 14 Mar 2017

Abstract

Objective: To adapt and to combine the self-report Upper Extremity Functional Index and Lower Extremity Function Scale, for the assessment of disability severity in patients with a neuromuscular disease and to examine its psychometric properties in order to make it suitable for indicating disease severity in neuromuscular diseases.

Design: A cross-sectional postal survey study was performed among patients diagnosed with a neuromuscular disease.

Methods: Patients completed both adapted extremity function scales, questionnaires for psychometric evaluation, and disease-specific questions. Confirmatory factor analysis was performed, and reliability and validity were examined.

Results: Response rate was 70% (n = 702). The Extremity Function Index model with a two-factor structure – for upper and lower extremities – showed an acceptable fit. The Extremity Function Index scales showed good internal consistency (alphas: 0.97–0.98). The known-groups validity test confirmed that Extremity Function Index scales discriminate between categories of “Extent of limitations” and “Quality of Life.” Convergent and divergent validity tests confirmed that Extremity Function Index scales measure the physical impact of neuromuscular diseases. Relative validity tests showed that the Extremity Function Index scales performed well in discriminating between subgroups of patients with increasing “Extent of limitations” compared to concurrent measurement instruments.

Conclusion: The Extremity Function Index proved to be a sound and easy to apply self-report disability severity measurement instrument in neuromuscular diseases.

    Implications for rehabilitation

  • The Extremity Function Index reflects the functioning of all muscles in the upper and lower extremities involved in activities of daily living.

  • The Extremity Function Index is an easy to administer and patient-friendly disability severity measurement instrument that has the ability to evaluate differences in disability severity between relevant neuromuscular disease subgroups.

  • The Extremity Function Index is a valid and reliable disability severity measurement instrument for neuromuscular diseases.

Introduction

Neuromuscular diseases (NMDs) generally lead to progressive impairment in body functions and therefore have a profound impact on physical and psychosocial life, with loss of mobility as one of the main problems [Citation1,Citation2]. Research into therapeutic approaches to neuromuscular disorders has progressed rapidly over the past decade and shows great promise for the future [Citation3]. Therefore, easy to apply and psychometrically sound assessment tools for evaluating disease severity or impairments in body functions are of growing importance.

Currently, the evaluation of disease severity in NMDs is mainly achieved by assessing muscle power functioning using electromyography, measuring muscle strength using handheld dynamometry or by manual muscle tests. However, such tests can be experienced as harmful and time consuming and do not reflect the subject’s functional abilities [Citation4]. In addition, there are observation-based measurements for NMD – as for example the Motor Function Measure scale [Citation4], and the disease-specific Muscular Dystrophy Functional Rating Scale [Citation5], but these measurements require patient exercise, a physiotherapy room and trained investigators. In order to overcome these disadvantages, self-report measuring instruments were developed, for example the disease-specific Amyotrophic Lateral Sclerosis Functional Rating Scale [Citation6,Citation7], measuring instruments administered by trained evaluators such as the Muscular Dystrophy Functional Rating Scale [Citation5], and a measurement of activity limitations the ACTIVLIM [Citation8], a combination questionnaire for children and adults. However, some of these instruments are disease-specific, evaluator-dependent or limited in feasibility. Also generic health-related quality-of-life (QoL) measurements – the SF-36, for example – are used to measure the impact of disabilities on QoL [Citation2]. Unfortunately, these generic measurements do not have specific items relevant for patients with a NMD, and therefore lacking sensitivity to change, while some of these items will be redundant when applied to NMDs [Citation1].

A well-known and commonly used disability-severity score, in clinical practice often used as indicator for disease severity, is the Expanded Disability Scale (EDSS) developed for patients with multiple sclerosis [Citation9]. This disability-severity score is based on limitations in mobility. The biggest advantages of the self-report version of the EDSS are that: (1) it is an easy instrument to administer in clinical practice and research and (2) it expresses disability severity in terms of a number, so that a change in disability severity can easily be evaluated [Citation10], For these reasons, we opted for limitations in mobility as a starting point for the development of a disability-severity measurement in NMDs that can serve as an indicator for disease-severity. This seems to be appropriate as it is known that muscle function related limitations in activities in NMDs are regarded as indicators of disease severity [Citation11,Citation12].

In summary, a valid and reliable, easy to administer, self-report disability-severity measurement instrument for adults, reflecting the functioning of muscles in the upper and lower extremities involved in activities of daily living covering NMDs is not available yet. Therefore, the aim of this study was to adapt and to combine two validated self-report questionnaires, the Upper Extremity Functional Index [Citation13] (UEFI) and the Lower Extremity Functional Scale [Citation14] (LEFS) as a disability severity measurement instrument in NMDs.

Patients and methods

Sample

A cross-sectional postal survey study was conducted among patients diagnosed with an NMD (n = 1003). These patients were registered at the Department of Neurology of the University Medical Center Groningen, the Netherlands. The sample comprised patients from the four major NMD groups according to Rowland: motor neuron disorders, muscle disorders, junction disorders, and peripheral nerve disorders [Citation15]. Patients were included if they could be assigned to one of these four NMD groups. Furthermore, patients also had to be aged 18 years or older, be able to read and write in Dutch, and able to provide informed consent.

Procedure

Patients received information about the study and were invited to participate. Patient’s informed consent was achieved by returning the completed questionnaire. Respondents completed both (adjusted) extremity function scales, questionnaires for psychometric evaluation, and answered demographic and disease-specific questions. Reminders were sent after two weeks. After the questionnaires had been returned, they were checked for completeness. If a page had not been completed, a copy was returned with a request to complete the missing questions or, if this only concerned one or a few questions, patients were interviewed by telephone.

The Medical Ethical Committee of the University Medical Center of Groningen has assessed the study proposal and concluded that approval was not required (Reference METc2009.310).

Extremity functioning index

The self-report Upper Extremity Functional Index (UEFI) and Lower Extremity Functional Scale (LEFS) were used as a basis for the disability-severity measure, the Extremity Functioning Index. Both scales were developed and validated for easy assessment of (limitations in) functioning. Each scale consists of 20 items assessing functional problems. Items were scored on a 5-point scale with discrete responses ranging from 0 (extremely difficult or unable to perform activity) to 4 (no difficulty). Items for both scales were summed for a total score ranging from 0 to 80 points, with higher scores representing higher levels of functioning. In previous studies, both scales showed good internal consistency (Cronbach’s alphas: 0.90 [Citation16] and 0.96 [Citation14] for the LEFS, and 0.95 [Citation13] for the UEFI), and stability (ICCs: 0.88 [Citation17] and 0.97 [Citation18] for the LEFS, and 0.85 [Citation19] for the UEFI).

For the purpose of this study, the LEFS and the UEFI were translated into Dutch following the procedure proposed by Guillemin et al. [Citation20]. First, the original Canadian English version was translated into Dutch by three researchers (IB, KvdB and HB) who have a working command of Dutch and English at academic level and who worked independently of each other. Secondly, the most satisfactory translation was chosen by consensus among the researchers. Thirdly, this Dutch translation was translated back into English by a native English speaker. Finally, the resulting English version was compared to the original English version, and all discrepancies were discussed by the three researchers. Any remaining discrepancies were discussed with the native English speaker.

The translated version of the LEFS and UEFI was reviewed by three medical specialists in NMDs (JBMK, GD and IB) and a methodologist (BM) on clarity, applicability and patient burden. As a result, six questions in the LEFS were adjusted for reasons of applicability in NMD patients concerning disease-specific limitations to walking distance (questions 11 and 12), sitting time (question 14), running (questions 16 and 17) and hopping (question 19). These questions were adjusted to shorter distances (questions 11 and 12), shorter duration (question 14), walking (questions 16 and 17) and jumping (question 19). Because of these disease-specific adjustments, we have renamed the LEFS into the Lower Extremity Functional Index (LEFI). Next, the feasibility of the UEFI and LEFI was examined by pre-testing in a sample of twenty randomly selected NMD-patients. No barriers or unclear and ambiguous items were found. For the UEFI, the LEFI and the combination of both scales, the EFI, item scores were transformed for both subscales (score range from 0 to 80) and the total scale (score range from 0 to 160) into index scales with scores ranging from 0 (not difficult) to 100 (extremely difficult).

Measurement instruments

To examine the psychometric properties of the EFI, the following measurement instruments were applied:

The Neuromuscular Disease Impact Profile (NMDIP), a broad and generic ICF-based disease impact measurement instrument that includes 36 items and consists of eight scales and four additional items [Citation21]. The 36 items represent the four ICF components. For the Body Functions component items and for the Participation component items scoring options ranged from 0 (no disability) to 4 (complete disability); for the Activities component items scoring options ranged from 0 (no disability) to 3 (complete disability); and for the Environmental Factors component items scoring options ranged from 0 (no support) to 2 (full support). Item scores were summed into a scale, with higher scores indicating more disability. In a previous study among Dutch NMD patients, the NMDIP domains showed satisfactory levels of internal consistency: Cronbach’s alpha ranged from 0.63 to 0.92 and Mean Inter-item Correlation Coefficient from 0.47 to 0.77 [Citation21].

The Medical Outcome Study 36-item Short Form Health Survey (SF-36) is a broad and generic Health-Related Quality Of Life (HRQoL) measurement and consists of 36 items divided over eight domains [Citation22]. For each domain, item scores were coded, summed, and transformed on a scale from 0 (worst health) to 100 (best health). In a previous study among Dutch multiple sclerosis patients, the SF-36 domains showed satisfactory levels of internal consistency: Cronbach’s alpha ranged from 0.81 to 0.94 [Citation21].

The Groningen Activity Restriction Survey (GARS) is a domain-specific instrument for measuring Limitation in activities and consists of 18 items divided over two scales [Citation23]. A four-category response format was used, ranging from 1 (no problem in performing without help) to 4 (impossible to perform). Scores were summed for each subscale. The GARS showed strong levels of internal consistency in a study among Dutch NMD patients: Cronbach’s alphas were 0.93 and 0.95 [Citation21].

Single item variables

The first variable “Extent of Limitations” was evaluated with the Extent of Limitations Visual Analogue Scale (VAS) [Citation24] Respondents were asked to answer the question: “To what extent are you limited due to your NMD?” Scoring options ranged from 0 (no limitation at all) to 10 (most severely limited). The second variable “Quality of Life” (QoL) was adapted from the WHOQOL-bref [Citation25]. Respondents were asked to answer the question: “How would you rate your quality of life?” Response options were: 1 = very poor, 2 = poor, 3 = neither poor nor good, 4 = good and 5 = very good.

Analysis

Descriptive statistics were used for describing the patient characteristics.

To construct the EFI, we hypothesized a two-factor model in which extremity functioning is measured within domains for upper extremity functioning (using items from the UEFI) [Citation13] and lower extremity functioning (using items from the LEFI) [Citation14]. Before testing the two-factor model, the data were examined for the presence of univariate (standardized scores: |z|≥3.30) and multivariate outliers (Mahalanobis Distance: p < 0.001) [Citation26,Citation27]. Next, to test the two-factor model a confirmatory factor analysis (CFA) was conducted using M-Plus 7.1 [Citation28]. The CFA methods used in this software are suitable for not normally distributed ordinal items and are based on polychoric correlations between standardized observed ordinal items [Citation29]. Factor loadings of >0.40 were considered sufficient [Citation30]. Model fit was examined using multiple criteria: (1) as a measure of overall fit, the root means squared error of approximation (RMSEA): ≤0.05 indicate a close fit, whereas values up to 0.08 indicate an adequate fit; and (2) as descriptive measures: a Comparative Fit Index (CFI) ≥ 0.95 and a Tucker–Lewis Index (TLI) ≥ 0.95 indicate an adequate fit [Citation31] To merge the two domains into one disability-severity measurement, a strong correlation was expected (Spearman’s correlation coefficient ≥0.70). For scale construction, the maximum number of missing items allowed to be replaced by the mean scale score was determined by a sufficient Cronbach’s alpha in relation to the number of scale items [Citation32].

Next the EFI scale features were examined. The internal consistency was examined using Cronbach’s alpha. Alpha was considered sufficient if ≥0.70 [Citation33,Citation34]. The distribution of scale scores was evaluated by calculating the median, mean, standard deviation, and the observed score range. Floor and ceiling effects were examined by calculating the proportions of patients with worst and best possible scores. Proportions ≤20% were considered acceptable [Citation35].

For examining psychometric properties, the Kruskal–Wallis test and the Mann–Whitney U-test were used for not normally distributed variables (Shapiro–Wilk test, p < 0.05).

Regarding known-groups validity [Citation36,Citation37], we hypothesized that the EFI scales should discriminate between respondent subgroups known to differ on relevant clinical characteristics. The variables “Extent of Limitations” and “Quality of Life” were used to create such relevant respondent subgroups. Respondents were divided into two groups of “Extend of Limitations”: those with a lower “Extent of Limitations” (score 0–4) and those with a higher “Extent of Limitations” (score 5–10). Respondents were divided into two groups of “Quality of Life”: those with a poor “Quality of Life” (response options 1–3) and those with good Quality of Life (response options 4–5).

Convergent and divergent validity was performed by examining the extent to which correlation values between EFI scales and concurrent measures were consistent with hypotheses. The Spearman rank order correlation coefficient (Rho, p, 2-tailed) was calculated between the EFI scales and concurrent scales. To support convergent validity, the EFI scales needed to have strong correlations (≥0.70), with scales covering the same domain in concurrent measurements (physical functioning scale and activity scales) [Citation38]. To support the divergent validity, the EFI scales should correlate weakly (≤ 0.40) with scales covering different domains (mental health scale) in concurrent measurements [Citation38].

Relative validity (RV) indicates the extent to which a scale or construct is able to discriminate between groups compared to the concurrent measures [Citation22,Citation39]. Respondents were divided into four groups of “Extent of Limitations”: Group A with a “No to low extent of limitation” (score 0–4), Group B with a “moderate extent of limitation” (score 5–6), Group C with a “high extent of limitation” (score 7–8) and, Group D with a “very high extent of limitation” (score 9–10). Next, RV of scales was examined in several steps. First, the Chi-square was computed for each scale by calculating the Kruskal–Wallis H-test. Second, the RV of each scale was computed by dividing each H-ratio by the H-ratio for the scale with the highest H-ratio, and multiplied by one hundred.

To estimate the magnitude of the clinical relevance of statistically significant group differences, the nonparametric effect size (coefficient r) for unrelated samples was calculated [Citation38]. The coefficient r was calculated by dividing the Z-statistic (obtained from the Mann–Whitney U-test) by the root of the sample size (n). To interpret these nonparametric effect sizes, Cohen suggests the following thresholds for interpretation: r < 0.10 indicates a trivial effect; r ≥ 0.10 to <0.24 a small effect; r ≥ 0.24 to <0.37 a moderate effect; and r ≥ 0.37 a large effect. An r ≥ 0.10 reflects a clinically relevant difference between groups [Citation38,Citation40].

Statistical analyses were performed using SPSS 23.0 for Windows and CFA was performed using M-Plus 7.1 (Los Angeles, CA).

Results

Patient characteristics

In sum, 702 patients (70% response rate) completed the questionnaires. The participants’ demographic and disease-specific characteristics are described in . Mean age of participants was 59 years (SD = 15.7), the mean number of years since diagnosis was 12 years, and about 30% of the respondents had retired due to an NMD. The motor neuron disorder group was relatively small compared to the other NMD subgroups (Rowland classification).

Table 1. Sample characteristics (n = 702).

Nonrespondents did not differ from respondents in terms of gender but were statistically significant younger (p values: 0.000, 2-sided) than respondents.

Extremity function index (EFI) structure

CFA confirmed the expected two-factor model with good loadings (). Each observed aspect in terms of use of lower or upper extremities, loaded sufficiently on the expected factor. Model fit indicators were sufficient with RMSEA 0.086 (90% confidence interval: 0.084–0.089), CFI 0.96, and TLI 0.96 and confirmed a good fit of the two-factor model using the Upper Extremity Functional Index (UEFI) and the Lower Extremity Functional Index (LEFI). As expected, the correlation between the UEFI and LEFI was strong (0.87), such that both functioning domains can be merged into one disability-severity measure.

Table 2. Factor loadings of the extremity function index (EFI) model.

Scale features

shows the scale features for the Extremity Function Index (EFI) total scale and EFI subscales for the total sample and for the four major NMD groups. Internal consistency for the EFI and both of the subscales was good. Cronbach’s alphas ranged from 0.97 to 0.98. No negative floor and ceiling effects were found.

Table 3. Scale features of the EFI total scale and subscales UEFI and LEFI (n = 702).

The final version of the EFI scale consists of two subscales each with twenty items, and also a total scale score can be calculated (appendix).

Known-groups validity

The known-groups validity of the EFI scales was confirmed by the expected group differences (). Patients classified as having greater “Extent of Limitations” or higher “Quality of Life” had significantly higher scores on the EFI scales compared with those classified as having lower “Extent of Limitations” or lower reported “Quality of Life”. Effect sizes were very large for “Extent of Limitations” and moderate for “Quality of Life” and confirmed clinical relevance.

Table 4. Known-groups validity of the extremity function index (n = 702).

Convergent and divergent validity

summarizes our findings on the convergent and divergent test of EFI scales. The direction, strength and pattern of correlations are as hypothesized. We found the expected high correlations for most of the similar constructs (bold figures in the table) confirming convergent validity. Unexpected was the moderate correlation with the NMDIP “Muscle Functions” variable. We found the expected low correlations (italic figures in the table) supporting divergent validity. Unexpected were the moderate correlations with the NMDIP “Mental Functions and Pain” variable.

Table 5. Results of convergent and divergent validity of EFI total and subscales (n = 702).

Relative validity (RV)

About 40% (n = 278) of the respondents reported “low extent of limitations” (Group A) due to NMD, while 24% (n = 169) reported a “moderate extent of limitation” (Group B), and 28% (n = 197) reported a “high extent of limitation” (Group C). About 8% (n = 58) of the respondents reported a “very high extent of limitations” (Group D).

Comparisons of the RV coefficients, as summarized in , revealed that the EFI “Lower Extremity Function Index” subscale and the Extremity Function Index total scale were the most valid in discriminating between groups with an increasing “Extent of Limitation.”

Table 6. Relative validity (RV) of the EFI, disease specific, domain specific and generic measurement instruments compared, using subgroups of extent of limitations (n = 702).

We then examined the performance of the EFI in indicating the differences between extreme groups (A–D) and subgroups (A–B, B–C, C–D) regarding the physical functioning construct, as it relates to the similar constructs in the concurrent measurement instruments.

Regarding physical functioning, we found that the NMDIP “Muscle Functions” performed slightly better compared to the “Lower Extremity Function Index.” Subgroup differences (A–B, B–C and C–D) were statistically significant and clinically relevant for all EFI scales.

In summary, the EFI scales showed one small, and furthermore large effect sizes in discriminating between (sub) groups with an increasing “Extent of Limitations” compared to similar physical functioning constructs in concurrent measures.

Discussion

The Extremity Function Index (EFI) appears to be a valid and reliable instrument for evaluating disability-severity in adult patients with an NMD. The confirmed model for the EFI included a two-factor structure with two one-dimensional scales with twenty indicators in the upper extremity function domain and twenty indicators in the lower extremity function domain. The reliability of the EFI and both subscales was good. Known-groups validity was supported by statistically significant and clinically relevant differences between groups of patients with a NMD that differed in terms of “Extent of Limitations” and “Quality of Life”. Expectations regarding the direction and strengths of the convergent and divergent correlations were confirmed for most correlations. Unexpected was the moderate correlation with the “Muscle Functions” variable. Apparently loss of muscle strength is more obvious in lower extremity function than in upper extremity function. Also unexpected were the moderate correlations with the NMDIP “Mental Functions and Pain” variable. Probably, the aspect of pain in this variable caused this stronger correlation with the EFI (sub)scales than expected. Finally, compared to concurrent domain specific and generic QOL measurement instruments the EFI performed well in discriminating between groups of NMD patients with an increasing “Extent of Limitations” as indicated on the visual analog scale.

A major strength of this study lies in the large and representative study population representing the four major NMD groups according to Rowland [Citation15], which improves the generalizability of the study results. As such the EFI may be considered applicable to the broad range of NMD patients that are encountered in clinical practice

A possible study limitation should be noted: the relatively small sample size of the motor neuron disorder group compared to the sample size of the other NMD groups. However, the complete study sample showed good representation of functional limitations in NMDs in terms of the use of upper and lower extremities in daily activities.

The EFI can have important implications for multidisciplinary care, research and for patients. Clinicians now have an easy to administer and patient-friendly disability-severity measurement instrument to evaluate the differences in disability-severity between relevant subgroups of NMD patients. These differences can be seen as an indicator for the ability of this measurement instrument for detecting changes in disability over time. Researchers also can compare disability-severity between groups of NMD patients. EFI could also have implications for patient self-management. For instance, EFI can offer patients a voice in making future decisions about assistive equipment and environmental adjustments.

Further research should focus on examining the relationship between objective and subjective disease-severity, psychometric evaluation concerning stability and sensitivity to change of the EFI, and validation across other populations of neuromuscular disease patients in other cultures.

In conclusion, this study showed that the Extremity Function Index (EFI) appears to be a reliable and valid disability-severity measurement instrument for NMDs. Moreover, the measure is an easy to administer and patient-friendly instrument for clinical practice and can also support clinical trials and epidemiological studies.

Supplemental material

Isa_c_Bos_et_al_supplemental_content.zip

Download Zip (13.4 KB)

Acknowledgements

We wish to thank the NMD patients who participated in this study, and who kindly shared personal information about the consequences of their disease by taking the time to complete our questionnaires. We thank Berrie Middel (BM), PhD, for methodological advice. We also wish to thank the patients’ family members and the students who assisted in data collection: Ronald Brans, Kyra van der Beek (KvdB), Hanna Bosman (HB), Annelies Verschure, Carolien Verschure, and Marieke Verschure. The study was supported by the Department of Neurology of the University Medical Center Groningen, the Netherlands.

Disclosure statement

The authors report no conflicts of interest. The authors alone are responsible for the content and writing of this article.

Additional information

Funding

The study was supported by the Department of Neurology of the University Medical Center Groningen, the Netherlands.

References

  • Burns TM, Graham CD, Rose MR, et al. Quality of life and measures of quality of life in patients with neuromuscular disorders. Muscle Nerve. 2012;46:9–25.
  • Graham C, Rose M, Grunfeld E, et al. A systematic review of quality of life in adults with muscle disease. J Neurol. 2011;258:1581–1592.
  • Arnett ALH, Chamberlain J. Therapy for neuromuscular disorders. Curr Opin Genet Dev. 2009;19:290–297.
  • Berard C, Payan C, Hodgkinson I, MFM Collaborative Study Group, et al. A motor function measure for neuromuscular diseases. Construction and validation study. Neuromuscul Disord. 2005;15:463–470.
  • Lue YJ, Su CY, Yang RC, et al. Development and validation of a muscular dystrophy-specific functional rating scale. Clin Rehabil. 2006;20:804–817.
  • Gordon P, Miller R, Moore D. ALSFRS-R. Amyotrophic lateral sclerosis and other motor neuron sisorders. 2004;5(Suppl): 90–93.
  • Montes J, Levy G, Albert S, et al. Development and evaluation of a self-administered version of the ALSFRS-R. Neurology. 2006;67:1294–1296.
  • Vandervelde L, Van den Bergh PYK, Goemans N, et al. ACTIVLIM: a rasch-built measure of activity limitations in children and adults with neuromuscular disorders. Neuromuscular Disord. 2007;17:459–469.
  • Kurtzke JF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology. 1983;33:1444–1452.
  • Wynia K. Change in disability profile and quality of life in multiple sclerosis patients: a five-year longitudinal study using the multiple sclerosis impact profile (MSIP). Mul Scler J. 2012-5;18:654–661.
  • Jopson NM, Moss-Morris R. The role of illness severity and illness representations in adjusting to multiple sclerosis. J Psychosom Res. 2003;54:503.
  • Rose M, Sadjadi R, Weinman J, et al. Role of disease severity, illness perceptions, and mood on quality of life in muscle disease. Muscle Nerve. 2012;46:351–359.
  • Stratford PW, Binkley JM, Stratford DM. Development and initial validation of the upper extremity functional scale. Physiother Can Fall. 2001;53:259–267.
  • Binkley JM, Stratford PW, Lott SA, et al. The lower extremity functional scale (LEFS): Scale development, measurement properties, and clinical application. North American orthopaedic rehabilitation research network. Phys Ther. 1999;79:371–383.
  • Rowland LP, McLeod JG. Classification of neuromuscular disorders. J Neurol Sci. 1994; 07124 Suppl:109–130.
  • Lin C, Moseley A, Refshauge K, et al. The lower extremity functional scale has good clinimetric properties in people with ankle fracture. Phys Ther. 2009;89:580–588.
  • Yeung TSM, Wessel J, Stratford P, et al. Reliability, validity, and responsiveness of the lower extremity functional scale for inpatients of an orthopaedic rehabilitation ward. J Orthop Sports Phys Ther. 2009;39:468–477.
  • Gabel CP, Michener L, Burkett B, et al. The upper limb functional index: Development and determination of reliability, validity, and responsiveness. J Hand Ther. 2006;19:328–348.
  • Hefford C, Abbott JH, Arnold R, et al. The patient-specific functional scale: Validity, reliability, and responsiveness in patients with upper extremity musculoskeletal problems. J Orthop Sports Phys Ther. 2012;42:56–65.
  • Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol. 1993;46:1417–1432.
  • Bos I, Kuks J, Wynia K. Development and testing psychometric properties of an ICF-based health measure: the neuromuscular disease impact profile. J Rehabil Med. 2015;47:445–453.
  • Ware JE, Kemp JP, Buchner DA, et al. The responsiveness of disease-specific and generic health measures to changes in the severity of asthma among adults. Qual Life Res.1998;7:235–244.
  • Kempen GI, Miedema I, Ormel J, et al. The assessment of disability with the Groningen activity restriction scale. conceptual framework and psychometric properties. Soc Sci Med. 1996;1243:1601–1610.
  • Wynia K, Middel B, van Dijk JP, et al. The multiple sclerosis impact profile (MSIP). development and testing psychometric properties of an ICF-based health measure. Disabil Rehabil. 2008;30:261–274.
  • The WHOQOL Group. Development of the world health organization WHOQOL-bref quality of life assessment. World Health Forum. 1998;28:551–558.
  • McClelland GH. Nasty data: Unrully, ill-mannered observations can ruin your data. In: Reis HT, Judd CM, editors. Handbook of research methods in social and personality psychology. New York, NY: Cambridge University Press; 2000;23:608–627.
  • Finch WH. Distribution of variables by method of outlier detection. Front Psychol. 2012;3:211. doi: 10.3389/fpsyg.2012.00211
  • Muthén LK, Muthén BO. Mplus statistical analysis with latent variables; users guide [computer program]. 7th ed. Los Angeles, CA: Muthén & Muthén; 1998–2012.
  • Flora DB, Curran PJ. An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychol Methods. 2004;9:466–491.
  • Muthen B. Mplus technical appendices (1998–2004). Los Angeles, CA: Muthen & Muthen; 2004.
  • Browne MW, Cudeck R. Alternative ways of assessing model fit. In: Bollen KA, Long JS, editors. Testing structural equation models. Newbury Park (CA): Sage; 1993.
  • Sonderen v. E. Omgaan met ontbrekende gegevens in het bijzonder bij schaal items (how to handle missing data in particular scale items). Verpleegkunde, Nederlands-Vlaam Wetenschappelijk Tijdschrift Voor Verpleegkundigen. 2000;15(2):104–111.
  • Nunnally JC. Bernstein IH., editors. Psychometric theory. 3rd ed. New York: McGRAW-HILL, INC.; 1994.
  • Clark LWD. Constructing validity: basic issues in objective scale development. Psychol Assess. 1995;7:309–319.
  • Hobart JC, Riazi A, Lamping DL, et al. Improving the evaluation of therapeutic interventions in multiple sclerosis: Development of a patient-based measure of outcome. Health Technol Assess. 2004;8:1–48.
  • Polit DF BC, editor. Nursing research: principles and methods. Philadelphia, New York, Baltimore: Lippincott; 2004.
  • Streiner DNG. Health measurement scales, a practical guide to their development and use. 4th ed. Oxford: Oxford University Press; 2008.
  • Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, NJ: Lawrence Erlbaum Associates; 1988.
  • Luo N, Johnson JA, Shaw JW, et al. Relative efficiency of the EQ-5D, HUI2, and HUI3 index scores in measuring health burden of chronic medical conditions in a population health survey in the united states. Med Care. 2009;47:53–60.
  • Andersen M, Johnson U, Lindwall M, Ivarsson A. To adjust or not adjust: Nonparametric effect sizes, confidence intervals, and real-world meaning. Psychol Sport Exerc. 2013;14:97–102.