1,622
Views
27
CrossRef citations to date
0
Altmetric
Research Articles

Development and initial evaluation of the SCI-FI/AT

, , , , , , , , , , , & show all

Abstract

Objectives

To describe the domain structure and calibration of the Spinal Cord Injury Functional Index for samples using Assistive Technology (SCI-FI/AT) and report the initial psychometric properties of each domain.

Design

Cross sectional survey followed by computerized adaptive test (CAT) simulations.

Setting

Inpatient and community settings.

Participants

A sample of 460 adults with traumatic spinal cord injury (SCI) stratified by level of injury, completeness of injury, and time since injury.

Interventions

None

Main outcome measure

SCI-FI/AT

Results

Confirmatory factor analysis (CFA) and Item response theory (IRT) analyses identified 4 unidimensional SCI-FI/AT domains: Basic Mobility (41 items) Self-care (71 items), Fine Motor Function (35 items), and Ambulation (29 items). High correlations of full item banks with 10-item simulated CATs indicated high accuracy of each CAT in estimating a person's function, and there was high measurement reliability for the simulated CAT scales compared with the full item bank. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in the original SCI-FI item banks.

Conclusion

With the development of the SCI-FI/AT, clinicians and investigators have available multidimensional assessment scales that evaluate function for users of AT to complement the scales available in the original SCI-FI.

Introduction

Functional performance is a critical outcome of many clinical trials. Generic functional outcome measures developed for the general population often lack the sensitivity needed to detect meaningful differences in the spinal cord injury (SCI) population, and they also contain irrelevant and sometimes offensive questions that lack validity in this population.Citation1 Patient or condition-specific instruments often lack the scope of coverage that limits their utility in broad, heterogeneous samples of patients with a SCI.Citation2Citation5

In recognition of these limitations, the Spinal Cord Injury Functional Index (SCI-FI) was developed by a team from seven SCI Model Systems Centers. This successful collaboration resulted in a new comprehensive activity limitations outcome instrument (called SCI-FI) that offers unique advantages over outcome measures developed using classical test theory methods. The content of the SCI-FI is based upon an extensive review of the literature, input from patients with SCI and clinician focus groups, and grounded within the International Classification of Function, Disability, and Health (ICF) framework.Citation6 An extensive item pool was developed across several hypothesized functional domains and administered to 855 adults with a traumatic SCI. Investigators applied factor analysis and item response theory (IRT) methods to identify five distinct functional domains (Ambulation, Basic Mobility, Fine Motor Function, Self-care and Wheelchair Mobility). Computer adaptive test (CAT) and short form methodologies were employed to allow the SCI-FI to achieve measurement precision while reducing participant burden.Citation7Citation9

There is a growing body of evidence to support the psychometric propertiesCitation10 of the SCI-FI item banks.Citation10Citation12 A SCI-FI pilot study in 269 adults with recent SCI demonstrated excellent responsiveness to change during the first post-injury year. The SCI-FI achieved excellent 1–2 week test-retest reliability, with Pearson's r values ranging from 0.92 (Wheelchair Mobility) to 0.96 (Self-care).Citation10

Assessing the impact of Assistive Technology (AT) on a person's function is a significant problem that must be addressed when developing a rehabilitation outcome measure. When ability to do an activity is scored without regard to use of AT, the functional score may improve without a concomitant change in ability, thus weakening the relationship between impairment and activity.Citation13 Unfortunately, the rehabilitation field lacks a consistent approach for assessing function when AT is used. Rust and Smith analyzed 100 rehabilitation measures and categorized how these measures addressed AT use.Citation14 AT use was ignored in some measures, while other measures raised or lowered functional scores for items when the person used AT. Current SCI functional measures, such as the Spinal Cord Independence Measure, Quadriplegic Index of Function and the Functional Independence Measure account for the impact of AT using a hybrid approach where the use of AT is considered for some, but not all items. For items that consider AT, the item score is lower when AT is used than when it is not. However, because of the inconsistent consideration of AT in item scores, the summary scores derived from these measures do not fully reflect the impact of AT on a person's ability to function.Citation4,Citation5,Citation15

The original SCI-FI banks contain few items in the calibrated item pool that measure a person's performance using AT. Outside of the wheelchair mobility scale, only 17 of 219 items calibrated in the SCI-FI item pool involve the use of AT. Moreover, the instructions to examinees who complete the SCI-FI item banks instruct participants to rate each item on their ability to perform the task without the use of AT or help from another person.

Given the paucity of AT related items and the focus on completion of tasks without devices, the original SCI-FI is likely to be less sensitive to how people perform activities in their usual settings when they would typically use AT to complete a task. The SCI-FI banks are also not likely to be precise and sensitive to clinically relevant change for interventions where the focus is on function using usual and customary AT. For this reason, we developed the SCI-FI/AT to provide supplemental item content that is most appropriate for AT users.

In this manuscript, we present analysis of the structure of the SCI-FI/AT item banks, describe the calibration and development of the SCI-FI/AT items and scales in each identified domain, discuss the initial psychometric properties of the SCI-FI/AT item banks, and compare the SCI-FI/AT item level difficulties with parallel item content in the original SCI-FI.

Methods

Sample

This SCI-FI/AT sample included 460 participants with a diagnosis of traumatic SCI recruited from 9 national SCI Model Systems programs. Eligibility criteria included age of 18 or older and the ability to speak and understand English. To best test functional performance with AT, the sample was stratified to include approximately equal numbers of individuals with the following characteristics: level of injury (paraplegia vs. tetraplegia), severity of injury (complete vs. incomplete) and time since injury (<1 year, 1–3 years, >3 years). The sample was representative of the key demographic variables reported by the National Spinal Cord Injury Statistical Center (NSCISC).Citation16 The original SCI-FI sample is described in detail elsewhere.Citation7,Citation8

Item bank development

We reviewed the previously calibrated SCI-FI item banks to retain as many items as possible from the original SCI-FI version so that similar items would be administered in the SCI-FI/AT, but with different instructions about the use of AT. The original SCI-FI items in each item bank were reviewed by two investigators (Alan M. Jette, Mary D. Slavin) to determine if items could be retained in SCI-FI/AT. This review identified original SCI-FI items that included use of a specific type of AT or human assistance. These items were removed, as they were inappropriate for SCI-FI/AT instructions which ask participants to select a response based on their ability “without help from another person, but using the equipment or devices they normally use.” After inappropriate items were removed, the SCI-FI/AT item content for each domain was again reviewed to ensure that all important functional activities assessed in the original SCI-FI were also included in SCI-FI/AT. It was noted that items related to bathing and skin inspection had been removed from the Self Care domain, so we wrote two new items using the relevant item content from the original version, but removing the reference to AT.

The final item pools administered in the SCI-FI/AT calibration study included: 47 Basic Mobility items, 35 Fine Motor Function items, 78 Self-care items, and 30 Ambulation items. The Basic Mobility item bank contains items about an individual's ability to carry out activities involving changing and maintaining body position, transfers, moving and carrying objects and moving around in different locations. The Ambulation items assess the ability to engage in walking activities in different locations that vary based on speed, time and condition and the ability to manage stairs under different conditions.

Data collection procedures

All study personnel completed a 2-hour training session and successfully completed a certification program. Each site received a detailed manual of procedures and data collectors participated in bi-weekly quality control calls. To replicate the procedure used in the original SCI-FI calibration study, all SCI-FI/AT items were presented in interview format, either over the phone or in-person. Demographic information (e.g. age, sex, ethnicity, race), descriptors related to the spinal cord injury (e.g. date of injury, mechanism of injury, level of lesion, and severity) and responses to screener questions to select appropriate sets of supplemental items (e.g. mobility status, living situation, use of bowel and bladder program) were collected for all participants. Color-coded response cards were used to direct participants to the appropriate response set for each item or set of items. In contrast to the SCI-FI instructions, SCI-FI/AT participants were asked to select the response that best describes their current ability to do each activity without help from another person, but using the equipment or devices they normally use. Participants could skip an item if they were not able to determine a correct response. The data collection system automatically added supplemental items based on each participant's responses to screener questions (e.g. items about walking were only administered to ambulatory participants).

Data analytic procedures

We employed confirmatory factor analysis (CFA) to examine the unidimensionality in each domain based on weighted least squares means and variance adjusted (WLSMV) estimation. We used the χ2 test and multiple fit indices (e.g. Tucker-Lewis Index, Comparative Fit Index)Citation17,Citation18 to assess the model fit. We also examined local item dependence (LID)Citation19 by looking at the residual correlation matrix output from Mplus.Citation20 Any item pair with an absolute residual correlation greater than 0.2Citation21 was considered to exhibit unacceptable LID and one of the items was removed from the item pool.

Prior to finalizing the item banks, we used a three-step procedure to examine Differential Item Functioning (DIF) in group variables (sex and diagnosis—tetraplegia and paraplegia). First, each item response was fitted to two ordinal logistic regression models. The first model included the raw summed score as the single independent variable, while the second model included the three independent variables of raw summed score, group variable, and interaction between the raw summed score and group variable. Any item that exhibited an R-squared change of greater than 0.02Citation22 between the two models was flagged for potential DIF. For each flagged item, we examined item category frequency distributions across subgroups (i.e. female vs. male or paraplegia vs. tetraplegia); if any subgroup sample size was less than 100 or if an item category's percentage of responses was greater than 90% (i.e. a dominating response category), this item's calibration was rejected for this subgroup.Citation23,Citation24 Next, we set the non-DIF items' parameters equal across subgroups and calibrated the DIF items separately in each subgroup based on a two-group IRT model. We then examined the item parameter equality based on Wald's χ2 testCitation25 and used the Benjamini–Hochberg (BH) procedureCitation26 to adjust the P value. Finally, we applied the weighted Area Between the expected score Curves (wABC) to examine the DIF impact.Citation27 The wABC was calculated by integrating the absolute difference between the expected core functions of reference and focal samples over the latent distribution. Any DIF item with wABC > 0.4Citation27 was treated as demonstrating DIF. Since our goal was to retain as many items as possible, we either calibrated the DIF item separately in relevant subgroups if the sample size in each subgroup was greater than 100 and there was no dominating response category, or only calibrated an item in one subgroup with the expectation that these items will only be administered in this subgroup in the future. We wanted to retain the maximum number of items and obtain stable item calibrations. Therefore, we adopted the least restricted requirement for Rasch models (100 subjects). Most of the subgroup sample sizes were around 200 in the examined items, which is the minimum number requirement in 2-parameter model. The analytic model was successfully converged, items showed a reasonable hierarchical difficulty order and we examined the item fit based on calibrated item parameters.

Each item bank was calibrated with the Graded Response ModelCitation28 as implemented in IRTPRO. The Pearson χ2 test, which examines the discrepancy between the expected and actual number of subjects in each category at each summed score level (S-X2 Test) was used to assess item fit. An item with a test statistic P value less than 0.05 was eliminated from the final item bank ().

Figure 1. Algorithms for Statistical Procedures used to construct Item Pool. *Treated as the same item in subgroup; **treated as different item in subgroup.

Figure 1. Algorithms for Statistical Procedures used to construct Item Pool. *Treated as the same item in subgroup; **treated as different item in subgroup.

Subject scores on the final item banks were estimated with weighted maximum likelihood estimation. We also conducted real data simulations in SAS 9.13 (SAS Institute Inc., Cary, NC, USA) to estimate person scores on 10-item fixed-length computerized adaptive tests. To compare the differences in item parameters between common items to the SCI-FI and SCI-FI/AT, we identified 41, 68, 33 and 29 common items between two versions in Basic Mobility, Self-care, Fine Motor and Ambulation domains, respectively. One item in basic mobility, 6 items in Self-care and 5 items in Fine Motor Function calibrated separately in subgroups in SCI-FI/AT. We eliminated those items in an equality test. To test item parameter invariance in the evaluation of IRT-based DIF, we would need to have the full covariance matrix of the item parameter estimations from both versions. However, SCI-FI was calibrated in PARSCALE which doesn't provide such information. Based on the similarity of sample characteristic across two calibration samples, we re-calibrated the common items across both versions at the same time using IRTPRO (in the Ambulation scale, we only looked at the item parameters when we calibrated the items based on the subjects who walk some or all the time). We assessed invariance of the paired common items by using a Wald χ2 test (BH adjusted P value) and plotted the paired common item difficulty in each domain. Item difficulty is defined as the average value of threshold parameters in the Graded Response Model. The threshold parameter for response category x represents the point on the underlying concept at which the subject will have 50% probability of endorsing this category or higher. All the analyses were done in Mplus 6.0,Citation20 IRTPRO 2.10,Citation29 and SAS 9.13.Citation30

Results

displays the background characteristics of the SCI-FI and SCI-FI/AT samples. The demographic makeup of the two samples was quite similar: just over half of each sample with tetraplegia, and in both samples the average age was 43 years and over three quarters of participants were male and living at home. One notable difference is that there were more participants of Hispanic origin in the SCI-FI (11%) sample as compared with the SCI-FI/AT sample (5%).

Table 1. Background Characteristic of original SCI-FI and SCI-FI/AT Calibration Samples

presents the CFA results along with goodness-of-fit indices for unidimensional CFA models of the SCI-FI/AT item banks. Results replicate the original SCI-FI structure with acceptable fit statistics for distinct domains of Basic Mobility, Self-care, Fine Motor Function, and Ambulation.

Table 2. Goodness-of-fit Indices for Unidimensional CFAs for the SCI-FI/AT final Item Banks*

As seen in , not all initial items were retained in the final calibrated item banks. Of the 190 items included in the SCI-FI/AT study, 176 distinct items were retained and calibrated with IRT analysis. One misfitting item and 9 LID items were removed from the final item bank. No items exhibited DIF for sex. However, 17 identified items exhibited DIF by level of injury. Of these, 5 items were removed and the remaining 12 items with DIF by level of injury were calibrated separately by level of injury and retained in the final item banks. We ended up with 41 items in the Basic Mobility domain, 71 in the Self-care domain, 35 in the Fine Motor Function domain, and 29 in the Ambulation domain.

Table 3. SCI-FI/AT Item Bank Refinement

As seen in , the calibration dataset was used to compare the accuracy and breadth of coverage of the full bank and a 10-item simulated CAT for each SCI-FI/AT domain by level of injury. The correlation of the 10-item CATs with the full item bank exceeded 0.97 for all 4 SCI-FI/AT domains across both paraplegia and tetraplegia. For those with tetraplegia, there were minimal ceiling and floor effects for all 4 functional domains. For those with paraplegia, there were minimal to no floor effects seen for all domains and a minimal ceiling effect for the Basic Mobility domain. As expected, 35% or more of the sample with paraplegia were at the ceiling of the Fine Motor Function scale and 15–18% of this sample was at the ceiling of the Self-care function scale. , which displays the distribution of response categories/items for each functional domain, illustrates the relative paucity of response categories/items above 2 standard deviations from the mean for the SCI-FI/AT domains of Fine Motor Function and Self-care which correlates with the ceiling effects seen in participants with paraplegia.

Figure 2. Distribution of the SCI-FI/AT items/calibrations for each content domain.

Figure 2. Distribution of the SCI-FI/AT items/calibrations for each content domain.

Table 4. Accuracy, Breath of Coverage for a SCI-FI/AT 10-item CAT and Full Item Bank for Each Content Domain by Neurologic Level

presents the reliability of each domain scale for the simulated 10-item CAT and full item bank. As these data illustrate, 75–98% of subjects achieved reliability >0.90 in each content domain.

Table 5. Number (%) of subjects with reliability >0.9 for 10-item CATs and the full item bank for each content domain

Finally, A–D presents a comparison of the level of difficulty of common items in SCI-FI versus SCI-FI/AT for each content domain. While there is little difference in item difficulty in the Basic Mobility domain scale (12.5%), for the most part, SCI-FI/AT items in the domains of Self-care (90.3%), Fine Motor Function (75.0%) and Ambulation (51.7%) were less difficult than the same items in SCI-FI, which is consistent with the DIF results.

Figure 3. Comparison of Item Difficulty in SCI-FI versus SCI-FI/AT for each content domain. (A) Basic Mobility. (B) Self-care. (C) Fine Motor Function. (D) Ambulation.

Figure 3. Comparison of Item Difficulty in SCI-FI versus SCI-FI/AT for each content domain. (A) Basic Mobility. (B) Self-care. (C) Fine Motor Function. (D) Ambulation.

Discussion

The SCI-FI instruments are multidimensional activity limitation measures appropriate for persons with SCI, regardless of level or severity of injury. The original SCI-FI scales allow for the assessment of functional capacity in the domains of Basic Mobility, Ambulation, Self-care, and Fine Motor Function. With the development of the SCI-FI/AT, we now have available scales that evaluate a person's ability to function using AT in the areas of Basic Mobility, Self-care, Ambulation, and Fine Motor Function along with the previously calibrated Wheelchair domain. In this study, the SCI-FI/AT replicated the domain structure in the original SCI-FI instrument and revealed strong psychometric properties for all domain scales in a sample of adults with paraplegia or tetraplegia. High correlations of simulated 10-item CATs with full item banks indicated high accuracy of the CAT in estimating a person's underlying functional ability. Furthermore, there was high measurement reliability for the simulated CAT scales compared with the full item bank.

SCI clinical trials, efforts to enhance our understanding about the plasticity of the nervous system and potential for neurorecovery, and studies of the efficacy of current rehabilitation interventions require functional outcome instruments that are capable of evaluating not only a person's functional capability but also one's ability using whatever AT and accommodations that are employed by the patient to accomplish the desired activity.Citation31 Although they are sometimes used interchangeably, clear distinctions exist between the concept of functional capacity and functional performanceCitation32 and these distinctions must be reflected in the development and selection of outcomes instruments.Citation13 Capability refers to what a person can do without assistance while performance reflects what the person does in his or her environment using whatever means are available.Citation33 For instance, studies examining the actual neurorecovery experienced by individuals or evaluating new physical therapy interventions designed to restore underlying functional capacity would benefit from the inclusion of SCI-FI banks. In contrast, the SCI-FI/AT would be a much more important measure and potentially more sensitive measure to include in trials of new types of adaptive equipment or of occupational therapy interventions designed to improve daily functioning at a constant level of functional capacity. Furthermore, the SCI-FI could be used to justify continued rehabilitation to payers who may require documentation of continued recovery of functional capacity (SCI-FI) or functional performance (SCI-FI/AT) as a requirement for continuity of care.

The CAT methodology for administering the SCI-FI assessments is ideally suited for the complexity faced when assessing functional abilities of those with SCI and overcomes many of the conceptual and methodological limitations with traditional measurement approaches.Citation2Citation5,Citation15 A CAT can be used to assess selective functional domains with a common metric and the existence of SCI-FI and SCI-FI/AT versions allows for the investigator or clinician to assess either functional capacity or functional performance, or both. Instead of using multiple traditional instruments, one can employ filter questions to select the appropriate domains and items that reflect a person's clinical and demographic background, thus avoiding inappropriate functional domains and items.

Although the SCI-FI now provides researchers and clinicians with the ability to assess functional performance across multiple domains for persons with SCI, there are several limitations to this study that should be noted. For those with paraplegia, the breadth of coverage in the Self-care and Fine Motor Function scales could be improved by including more challenging tasks in each item pool, thus reducing some of the ceiling effects that were observed in the calibration sample. Secondly, further psychometric analyses on new samples is needed to confirm the results reported in this study. In fact, a validation study of the SCI-FI/AT instrument is underway in an independent sample that will examine its validity and responsiveness to clinically meaningful change in comparison with the SCI-FI and legacy instruments. And finally, having more participants of Hispanic origin in the original SCI-FI sample (11%) as compared with the SCI-FI/AT sample (5%) could account for some of the difference in item difficulty that were observed across the two versions of SCI-FI.

Conclusion

The findings from this study provide preliminary evidence that the SCI-FI/AT's domain structure is similar to the SCI-FI and revealed strong psychometric properties for all functional domain scales in a sample of adults with paraplegia or tetraplegia. High correlations of simulated 10-item CATs with full item banks indicated high accuracy of the CAT in estimating a person's functional ability and there was high measurement reliability for the simulated CAT scales compared with the full bank scores. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in SCI-FI item banks. With the development of the SCI-FI/AT, we have available multidimensional assessment scales that evaluate function using AT in the areas of Basic Mobility, Self-care, Ambulation, Fine Motor Function, and Wheelchair Mobility to complement the domain scales already available in the SCI-FI.

Disclaimer statements

Contributors No other contributors.

Funding This work was supported by US Department of Education, National Institute of Disability and Rehabilitation Research Grant Numbers: H133N120002, H133N060014, H133N110006, H133N110020, H133N110002, H133N110021, H133N110011, H133N110007, H133N110009.

Conflicts of interest None.

Ethics approval The Institutional Review Board at each site reviewed and approved this project.

References

  • Hall KM, Cohen ME, Wright J, Call M, Werner P. Characteristics of the Functional Independence Measure in traumatic spinal cord injury. Arch Phys Med Rehabil 1999;80(11):1471–6.
  • Mills T, Holm MB, Trefler E, Schmeler M, Fitzgerald S, Boninger M. Development and consumer validation of the Functional Evaluation in a Wheelchair (FEW) instrument. Disabil Rehabil 2002;24(1–3):38–46.
  • Demers L, Weiss-Lambrou R, Ska B. Development of the Quebec User Evaluation of Satisfaction with assistive Technology (QUEST). Assist Technol 1996;8(1):3–13.
  • Gresham GE, Labi ML, Dittmar SS, Hicks JT, Joyce SZ, Stehlik MA. The Quadriplegia Index of Function (QIF): sensitivity and reliability demonstrated in a study of thirty quadriplegic patients. Paraplegia 1986;24(1):38–44.
  • Itzkovich M, Gelernter I, Biering-Sorensen F, Weeks C, Laramee MT, Craven BC, et al. The Spinal Cord Independence Measure (SCIM) version III: reliability and validity in a multi-center international study. Disabil Rehabil 2007;29:1926–33.
  • Slavin MD, Kisala PA, Jette AM, Tulsky DS. Developing a contemporary functional outcome measure for spinal cord injury research. Spinal Cord 2009;48(3):262–7.
  • Heinemann A, Dijkers M, Ni P, Tulsky D, Jette AM. Measurement properties of the spinal cord injury-functional index (SCI-FI) short forms. Arch Phys Med Rehabil 2014;95(7):1289–97.
  • Tulsky DS, Jette AM, Kisala PA, Kalpakjian C, Dijkers MP, Whiteneck G, et al. Spinal cord injury-functional index: item banks to measure physical functioning in individuals with spinal cord injury. Arch Phys Med Rehabil 2012;93(10):1722–32.
  • Jette AM, Tulsky DS, Ni P, Kisala PA, Slavin MD, Dijkers MP, et al. Development and initial evaluation of the spinal cord injury-functional index. Arch Phys Med Rehabil 2012;93(10):1733–50.
  • Kisala PA. The validation and responsiveness of the SCI-FI and SCI-QOL item banks. In Kisala P. (chair), New developments in the SCI-QOL/SCI-FI measurement system. Symposium presented at the 90th American Congress of Rehabilitation Medicine annual conference, Orlando, FL, Nov 2013.
  • Kisala PA. Spinal Cord Injury Functional Index (SCI-FI1) longitudinal study results. [ASIA abstract] Top Spinal Cord Inj Rehabil. 2014;20 (Suppl 1).
  • Tulsky DS, Kisala PA, Victorson D, Choi SW, Gershon R, Heinemann AW, et al. Methodology for the development and calibration of the SCI-QOL item banks. J Spinal Cord Med 2015;38(3):270–87.
  • Marino RJ. Domains of outcomes in spinal cord injury for clinical trials to improve neurological function. J Rehabil Res Dev 2007;44(1):113–22.
  • Rust KL, Smith RO. Assistive technology in the measurement of rehabilitation and health outcomes. Am J Phys Med Rehabil 2005;84(10):780–93.
  • Guide for the Uniform Data Set for Medical Rehabilitation (including the FIM(TM) instrument), Version 5.1. Buffalo, NY 14214–3007: State University of New York at Buffalo; 1997.
  • SCI Facts and Figures at a Glance. J Spinal Cord Med. 2014;37(2):243–4.
  • Tucker L, Lewis C. A reliability coefficient for maximum likelihood factor analysis. Psychometrika 1973;38(1):1–10.
  • Bentler PM. Comparative fit indexes in structural models. Psychol Bull 1990;107(2):238–46.
  • Steinberg L, Thissen D. Use of item response theory and the testlet concept in the measurement of psychopathology. Psychol Methods 1996;1(1):81–97.
  • Muthén LK, Muthén BO. Mplus User's Guide. 6th edn. Los Angeles, CA: Muthén & Muthén; 1998–2011.
  • Reeve BB, Hays RD, Bjorner JB, Cook KF, Crane PK, Teresi JA, et al. Psychometric evaluation and calibration of health-related quality of life item banks: plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Med Care 2007;45 (5 Suppl 1):22–31.
  • Bjorner JB, Kosinski M, Ware JE Jr. Calibration of an item pool for assessing the burden of headaches: an application of item response theory to the headache impact test (HIT). Qual Life Res 2003;12(8):913–33.
  • Linacre JM. Sample size and item calibration stability. Rasch Measurement Transactions 1994;7(4):328.
  • Forero CG, Maydeu-Olivares A. Estimation of IRT graded models for rating data: Limited vs. full information methods. Psychol Methods 2009;14(3):275–99.
  • Woods CM, Cai L, Wang M. The Langer-improved Wald test for DIF testing with multiple groups: evaluation and comparison to two-group IRT. Educ Psychol Meas 2012;73(3):532–47.
  • Thissen D, Steinberg L, Kuang D. Quick and easy implementation of the Benjamini-Hochberg procedure for controlling the false positive rate in multiple comparisons. J Educ Behav Stat 2002;27:77–83.
  • Edelen MO, Stucky BD, Chandra A. Quantifying ‘problematic’ DIF within an IRT framework: application to a cancer stigma index. Qual Life Res 2015;24(1):95–103.
  • Samejima F, van der Liden W, Hambleton R. The graded response model. Handbook of modern item response theory. New York, NY: Springer; 1996:85–100.
  • Cai L, du Toit SHC, Thissen D. IRTPRO: Flexible, multidimensional, multiple categorical IRT modeling [Computer software]. Chicago: SSI International; 2011.
  • SAS Institute Inc., SAS 9.1.3 Help and Documentation, Cary, NC: SAS Institute Inc.; 2000–2004.
  • Ditunno JF, Jr, Burns AS, Marino RJ. Neurological and functional capacity outcome measures: essential to spinal cord injury clinical trials. J Rehabil Res Dev 2005;42(3 Suppl 1):35–41.
  • WHO Global Report on Falls Prevention in Older Age. World Health Organization. Geneva, Switzerland: WHO Press; 2007.
  • Holsbeeke L, Ketelaar M, Schoemaker MM, Gorter JW. Capacity, capability, and performance: different constructs or three of a kind? Arch Phys Med Rehabil 2009;90(5):849–55.