2,609
Views
10
CrossRef citations to date
0
Altmetric
Review Article

Content and psychometric evaluations of questionnaires for assessing physical function in people with neck disorders: a systematic review of the literature

&
Pages 2227-2235 | Received 03 Nov 2016, Accepted 19 May 2017, Published online: 02 Jun 2017

Abstract

Purpose: The purpose was to investigate how physical function is assessed in people with musculoskeletal disorders (MSD) in the neck. Specifically, we aimed to determine: (1) Which questionnaires are used to assess physical function in people with MSD in the neck? (2) What do those questionnaires measure? (3) What are the measurement properties of the questionnaires?

Materials and methods: A systematic review was performed to identify questionnaires and psychometric evaluations. The content of the questionnaires was categorized according to the International Classification of Function, Disability and Health, and the psychometric properties were quality-rated using the COnsensus-based Standards for the selection of health Measurement INstruments checklist.

Results: Ten questionnaires and 32 articles evaluating measurement properties were analyzed. Most questionnaires covered only the components body functions and activity and participation, more often activity participation than body function. Internal consistency was adequate in most questionnaires, whereas responsiveness was generally low. Neck Disability Index was most evaluated, but the evaluations of all questionnaires tended to cover most properties in the checklist.

Conclusions: The questionnaires differed substantially in items and extent to which their psychometric properties had been evaluated. Focus of measurement was on activities in daily life rather than physical function as such.

    Implications for Rehabilitation

  • To provide early diagnostics and effective treatment for patients with neck disorders, valid and reliable instruments that measure relevant aspects of the disorders are needed.

  • This paper presents an overview of content and quality of questionnaires used to assess physical function in neck disorders, which may facilitate informed decisions about which measurement instruments to use when evaluating the course of neck disorders.

  • Most of the questionnaires need more testing to judge the quality, however the NDI was the most frequently tested questionnaire.

  • The COnsensus-based Standards for the selection of health Measurement INstruments checklist is a useful tool in relation to psychometric testing of questionnaires, but clear definitions of interpretation of the quality criteria in each study would enhance comparability of results.

Introduction

Musculoskeletal disorders (MSD) are common, and cause a burden to both individuals and society. World-wide prevalence rates of 33% for shoulder pain, 50% for low back pain, and 20% for widespread pain in a 1-month period have been reported [Citation1]. Similar numbers are presented for neck disorders in, e.g., the Netherlands (31% 1-year prevalence) [Citation2]. To reduce these disorders, we need early diagnostics and effective treatment/rehabilitation. In order to achieve this, valid and reliable instruments that measure relevant aspects of the disorders are required. This is of importance both for correctly diagnosing a patient's condition prior to treatment, and for evaluating intervention effects.

A cost-efficient way of assessing a patient’s condition is to use patient rated outcome measure scales, as they provide the necessary information without involving a lot of personnel to perform health examinations or interviews. One problem with using patient rated outcome measure scales, however, is the lack of consistency in selecting outcome measures, both in general pain research [Citation3,Citation4], and in research focusing on MSD [Citation5]. The Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials (IMMPACT) [Citation4] has recommended that the following four domains should be included: pain intensity, physical function, emotional function, and general improvement.

When assessing physical function, IMMPACT recommends the Brief Pain Inventory [Citation6] or the Multidimensional Pain Inventory [Citation7,Citation8]. A closer examination of the items included in the Multidimensional Pain Inventory shows that they tend to measure the consequences of disability in daily life rather than the disability itself, and it is not clear which components and aspects of function and disability are being measured according to the International Classification of Functioning, Disability and Health (ICF) [Citation9]. It may be components related to body functions and body structures, i.e., impairment level, or components related to activities and participation. In order to correctly diagnose and provide specific, targeted interventions to patients, we need to distinguish physical function per se (e.g., ability to turn the head) from the consequences of daily life (e.g., ability to dress).

Vast numbers of instruments are available for measuring physical function in MSD. Reasons for this may be differences in definitions of MSD, and that many instruments are specific to certain body regions. In this study, we focus on questionnaires intended and evaluated for measuring physical function among people with nonspecific musculoskeletal conditions in the neck, hence we exclude studies where the questionnaires have been used in populations with symptoms due to trauma, infection, systemic diseases and congenital or acquired deformities. This limitation was made to assess the utility of the questionnaires in a homogeneous sample. Besides what is being captured by the instrument, the quality of the instrument is also important when deciding which instrument to use. A useful tool for determining which properties have been evaluated for an instrument, and subsequently the validity and reliability of the instrument, is the COSMIN checklist (COnsensus-based Standards for the selection of health Measurement INstruments) [Citation10–13].

The aim of this systematic review was to investigate how physical function is assessed in people with MSD in the neck. Specifically, we aimed to determine:

  • Which questionnaires are used to assess physical function in people with MSD in the neck?

  • What do those questionnaires measure?

  • What are the measurement properties of the questionnaires?

Materials and methods

Search for questionnaires

The study was performed as a systematic literature review [Citation14,Citation15] searching the databases PubMed, Cinahl, Web of Science, and PsycInfo for studies published until 22 November 2016. contains the keywords used in the search, which were defined based on the ideas behind the PICO model [Citation16]. We selected suitable keywords for P (population), I (intervention), O (outcome), while C (comparison) was excluded as comparative studies were not the focus of this study.

Table 1. Keywords sorted in word groups.

The systematic literature review process is illustrated in a flowchart (). The literature search in PubMed resulted in 1333 articles, in Web of Science 457 articles, in PsycInfo 91 articles, and in Cinahl 41 articles. After excluding duplicates, the selection of relevant articles to include was performed by reviewing the title and abstracts first, for a crude culling of irrelevant articles. At this stage, 284 articles were included from PubMed, 139 articles from Web of Science, 0 article from Cinahl, and 0 article from PsycInfo. Thereafter, the abstracts, and when needed the full text, of the articles were reviewed to find articles presenting the original version of questionnaires for measurement of physical function in MSD. Both authors performed the review separately and then discussed the selection, to reach consensus. Excluded were questionnaires regarding general health, fibromyalgia, spinal stenosis, scoliosis, ankylosing spondylitis, ratings of functional capacity, work disability, not patient rated outcome measure scales, and questionnaires for use among specific occupational groups or adolescents/children.

Figure 1. Flowchart; systematic literature review process.

Figure 1. Flowchart; systematic literature review process.

A total of 98 different questionnaires were identified. Despite the keywords and exclusion criteria used in the search, some questionnaires had to be excluded because they were measuring pain beliefs or were physical tests to be performed by clinicians. One questionnaire, the Patient Specific Functional Scale [Citation17] was excluded at this stage, as it did not have fixed items but asked the patient to specify two examples of their disability, which made further analysis impossible. For the purpose of this review, only the questionnaires used to measure physical function among people with neck pain were selected for further analysis. They were classified according to the ICF [Citation9] to determine their focus of measurement and coverage of items. The ICF-classification was performed independently by both authors, followed by a comparative discussion to reach consensus.

Search for quality assessment of questionnaires

To identify the measurement properties that have been evaluated for the questionnaires, a second literature search in the previous databases was performed using the key words “validity OR reliability OR responsiveness AND neck AND name of questionnaire” to find articles that had quality-assessed the questionnaires.

Only articles which analyzed the original version of the questionnaire were included. Studies concerning translated versions of the questionnaires and studies where ≥50% of the sample met our previously used exclusion criteria were excluded.

Quality assessment of the questionnaires

The finally selected articles were examined with respect to relevant quality indicators of validity and reliability using the COSMIN checklist [Citation11,Citation13]. This examination was performed by both authors, first separately, then together to reach consensus regarding what properties of methodological quality had been assessed for each questionnaire. For each of the studies assessing the quality of the questionnaires, we rated the findings of the measurement properties as “adequate” or “not adequate” according to guidelines in the COSMIN checklist. Our quality criteria were inspired by the work of Schellingerhout et al. [Citation18], and are listed below.

  • Internal consistency (adequate = unidimensional (sub)scale or Cronbach’s alpha(s) ≥ 0.70; not adequate = otherwise or cannot be determined)

  • Reliability (adequate = intraclass correlation/weighted Kappa ≥0.70 or Pearson’s r ≥ 0.80; not adequate = otherwise or cannot be determined)

  • Measurement error (adequate = standard error of measurement, smallest detectable change or limits of agreement presented; not adequate = otherwise or cannot be determined)

  • Content validity (adequate = the target population was involved in the development of the questionnaire; not adequate = otherwise or cannot be determined)

  • Construct validity (adequate = factors explained at least 50% of the variance; not adequate = otherwise or cannot be determined)

  • Criterion validity (adequate = correlation with criterion instrument(s) ≥ 0.50; not adequate = otherwise or cannot be determined)

  • Responsiveness (adequate = correlation with anchor instrument ≥0.50 or area under receiver operator curve ≥0.70; not adequate = otherwise or cannot be determined)

  • Interpretability was marked “all” if the distribution of scores, floor/ceiling effects, scores for relevant groups and minimal important change or minimal important difference were reported, and “some” otherwise.

For criterion validity, only questionnaires (i.e., not single scales such as visual analog scales) were considered as criterion instruments. If an instrument (either evaluated instrument or criterion instrument) contained several scales, the methodological quality was determined by taking the lowest rating of any of the scales.

Results

From the literature search, we identified 10 questionnaires that are used to assess physical function among people with neck pain [Citation19–28]. They are listed in . When each questionnaire’s items were classified according to the ICF, we found that most questionnaires covered only the components body functions and activity and participation ( and ). The Cervical Spine Outcomes Questionnaire differed by also including items about environmental factors.

Table 2. Included neck questionnaires.

Table 3. ICF classification of the included neck questionnaire.

Table 4. ICF codes with explanation.

Within the component body functions, items tended to cover “mental functions” and “sensory functions and pain”. For mental functions, the most common items assessed were “sleep functions” (b134), “thought functions” (b160) and “emotional functions” (b152). For sensory functions and pain, item “pain” (b280) was most commonly assessed. The widest coverage was found for the ProFitMap-neck questionnaire and Cervical Spine Outcomes Questionnaire, each covering five of seven aspects of body functions ( and ). The Neck Functional Status Questionnaire did not include any items on sensory functions such as pain.

Within the component activity and participation, the questionnaires generally covered more aspects than in other components, the exception thus being the ProFitMap-neck questionnaire and Cervical Spine Outcomes questionnaire. Most items in activity and participation addressed “mobility”, while “communication” was not considered in any of the questionnaires. Items concerning the ability “to lift and carry objects” (d430), “dressing” (d540), and “recreation and leisure” (d920) were most frequently included in the questionnaires.

In the search for studies evaluating measurement properties of the 10 questionnaires, we identified 32 articles that fulfilled the inclusion criteria [Citation19–50]. From these, we extracted and categorized the properties of methodological quality that was assessed for each of the questionnaires according to the COSMIN checklist (). The results show that Neck Disability Index has been evaluated in many studies, and it also has been assessed for all analyzed properties in the checklist. Neck OutcOme Score has also been assessed for all analyzed properties in the check list, but only in few studies. While remaining questionnaires have been evaluated in fewer studies than the Neck Disability Index, the evaluations tend to cover most properties in the checklist. The exception is the Neck Pain and Functional Limitation Scale that have been assessed for three properties only. Notably, internal consistency has been assessed for each of the 10 questionnaires, but measurement error has been assessed for only five of the 10 questionnaires.

Table 5. Mapping of psychometric assessment of the included questionnaires.

The methodological quality of the questionnaires is summarized in . Internal consistency of the questionnaires clearly varied between studies. For most questionnaires, however, the internal consistency was found to be adequate in the majority of studies investigating the property. Test–retest reliability, on the other hand, was generally only adequate in questionnaires where it had been evaluated in a single study. While measurement error was reported adequately in few studies, most of these studies evaluated Neck Disability Index. Thus, several studies report on the measurement error of Neck Disability Index.

Table 6. Methodological quality of the included questionnaires.

Content validity was found to be adequate in four questionnaires only: Cervical Spine Outcomes Questionnaire, Neck OutcOme Score, Neck Pain and Functional Limitation Scale, and ProFitMap-neck. When assessing construct validity, we found that factor analysis of the construct to be measured had been performed for most questionnaires, and the majority of the studies reported adequate validity. It should be noted, however, that the findings are based on few studies. For criterion validity, the findings were rated for each of the criterion instruments listed in . The two most commonly used criterion instruments were Neck Disability Index and the Short Form 36 Item Health Survey (SF-36). For NDI the results were inconsistent. Overall, the criterion validity toward SF-36 was low, except for Neck OutcOme Score which showed adequate criterion validity toward the bodily pain subscale of SF-36, according to the authors’ hypotheses [Citation47]. The Oswestry Disability Index was used as criterion instrument in four studies. Its correlation with Cervical Spine Outcomes Questionnaire was low, and the findings with respect to Neck Pain and Disability Scale were inconsistent. Responsiveness has been evaluated for eight of the 10 questionnaires, and most results were negative. Only for Neck Disability Index, Neck Pain and Disability Scale, and ProFitMap-neck, the majority of the findings were positive. For each of the latter two questionnaires, responsiveness was only evaluated in one study.

In addition to the ratings of quality, we assessed the degree to which important characteristics relating to the instrument’s interpretability were reported in the studies. Almost all studies reported some characteristics of the instrument, but only one study evaluating ProFitMap-neck reported all characteristics considered important.

Discussion

Most questionnaires included questions about body functions as well as activity and participation. However, the questionnaires tended to have a larger focus (more items) on activity and participation than on body function. In other words, they tended to measure disability or the ability to cope in everyday life, rather than physical function as such. Two questionnaires constituted an exception; Cervical Spine Outcomes Questionnaire and ProFitMap-neck questionnaire. While the former was nearly equally balanced in items between the two components, the latter had an overweight on body function.

Reviewing the literature, we found four articles [Citation51–54] that classified the content of MSD questionnaires according to ICF. Only one of these [Citation54] investigated neck questionnaires. In that study, five of the questionnaires included in the present study (Copenhagen Neck Functional Disability Scale, Neck Bournemouth Questionnaire, Neck Disability Index, Neck Pain and Disability Scale, Northwick Park Neck Pain Questionnaire) were analyzed. Similarly to our findings, Ferreira et al. [Citation54] showed that the questionnaires contained more items about activity and participation than about body function. They concluded that the Neck Bournemouth Questionnaire, Neck Disability Index, and Neck Pain and Disability Scale have a well-balanced distribution of items over the ICF-components. In the present study, however, we found the Cervical Spine Outcomes Questionnaire to be even more equally balanced than the other questionnaires included.

For activity and participation, items regarding mobility were most commonly assessed, followed by personal care and community, social and civic life. This seems reasonable, since being able to move, take care of oneself and participate in social life are basic abilities to be self-sufficient for everyone. For body functions, mental functions and sensory functions and pain were covered by almost all questionnaires, which is relevant as depression, difficulty concentrating, and pain are common symptoms in this disorder [Citation5]. Neuromusculoskeletal function, on the other hand, was assessed in Neck Pain and Disability Scale and Neck Functional Status Questionnaire (by one item), in Cervical Spine Outcomes Questionnaire and Neck OutcOme Score (by two items), and in ProFitMap-neck (by seven items). It is somewhat surprising that items on neuromusculoskeletal functions were not included in all questionnaires, as it would appear to be a highly relevant aspect to measure in patients with neck MSD where radiating pain, sensory loss and muscle weakness are common [Citation55].

According to a previous review of psychometric evaluations performed on neck specific questionnaires, the evidence for validity and reliability of the questionnaires was mostly limited, and at least 50% of the information about measurement properties per questionnaire was lacking [Citation18]. Our results showed that most properties of the COSMIN checklist have been assessed for each of the included questionnaires, although the evaluations have generally been performed in few studies. An obvious exception is Neck Disability Index, which has been evaluated in more than twice as many studies as the other questionnaires. This is consistent with the previous review, where Neck Disability Index also was reported to be the most frequently evaluated questionnaire. Among the measurement properties investigated in the questionnaires, measurement error was the least assessed, and should be considered in future studies.

Unfortunately, the instruments in present study that covered most aspects of body function were only evaluated in few studies. Three studies were found for the Cervical Spine Outcomes Questionnaire [Citation19,Citation29,Citation30] and two for the ProFitMap-neck questionnaire [Citation28,Citation50]. However, as psychometric evaluations continue to be made, more studies on the included questionnaires may already be underway.

Making comparisons to previous reviews on methodological assessment of questionnaires comes with some practical problems. We found two reviews that appeared to be comparable; one by Schellingerhout et al. [Citation18] and another by Leahy et al. [Citation56]. Methodologically these studies are comparable to the present, as all are based on the COSMIN checklist. However, the interpretation of the included quality criteria slightly differs. Furthermore, they used different criteria for inclusion and exclusion of samples and questionnaires. For example, Schellingerhout et al. [Citation18] also included people with whiplash and traumatic neck injuries in their sample, and Leahy et al. [Citation56] included questionnaires concerning pain anywhere in the spine region. As a consequence, comparisons can only be made to Schellingerhout et al. [Citation18], and only for some of the included questionnaires. The comparable questionnaires included in both studies are Neck Disability Index, Neck Pain and Disability Scale, Neck Bournemouth Questionnaire, and Copenhagen Neck Functional Disability Scale.

Similar to Schellingerhout et al. [Citation18], we found Neck Disability Index to be the most frequently assessed questionnaire. Its internal consistency was confirmed in slightly more than half of the studies evaluating it. Our findings of questionable reliability and adequate responsiveness are in agreement with Schellingerhout et al. [Citation18], whereas we did not find evidence of adequate content validity in the included studies. Also, the measurement error of Neck Disability Index was reported in several studies, albeit not with minimal important change which was required in Schellingerhout et al. [Citation18].

Compared to Neck Disability Index, few studies have evaluated Neck Pain and Disability Scale, Neck Bournemouth Questionnaire, and Copenhagen Neck Functional Disability Scale. Previous reports of limited positive evidence of responsiveness for Neck Pain and Disability Scale [Citation18] were confirmed in this study. In addition, our results show poor content validity and adequate construct validity in the studies reviewed. For Neck Bournemouth Questionnaire, the findings of internal consistency were inconsistent. Contrary to Schellingerhout et al. [Citation18], we found poor responsiveness of the questionnaire. Due to few studies evaluating Copenhagen Neck Functional Disability Scale, no firm conclusions can be drawn about its reliability or validity.

As indicated previously, it is difficult to interpret and compare results from different reviews, depending on differences in inclusion/exclusion criteria, differences in ways of determining the quality criteria in the COSMIN checklist, and differences in the ways the results are presented. Reasons for low inter-rater reliability may be differences in interpretation of checklist items [Citation57]. This was evident in the review by Leahy et al. [Citation56] where content validity was judged excellent for the Extended Aberdeen Back Pain Scale, based on the results of a study using qualitative content analysis to compare interviews with people with neck pain and the content of questionnaires [Citation5]. That study does not make any psychometric assessment of the included questionnaires. Moreover, it only notes that only Extended Aberdeen Back Pain Scale and ProFitMap-neck have used the experiences of the affected in development, and the correspondence between experiences and questionnaires was mostly low. In the present study, we were methodologically stringent by including a homogenous sample, only neck questionnaires, and clearly defining and describing our use of the quality criteria.

To conclude, we found that questionnaires for assessing physical function in people with neck disorders tend to focus more on the ability to engage in activities of daily life than on physical function per se. There are some exceptions, but the measurement properties of these questionnaires have been evaluated in few studies.

As we have experienced, the results from quality assessments differ between studies, even though the same checklist has been used. The COSMIN checklist [Citation11,Citation13] is a useful contribution in quality assessment of patient rated outcome measure scales, but clear descriptions of the interpretation of these quality criteria in each study would enhance the comparability between studies. A homogenous sample would also facilitate comparisons.

When investigating neck disorders, it is vital to measure aspects that are of importance for the individual wellbeing and ability to function in work and daily life. Likewise, including the most relevant aspects of physical function in research studies will increase the validity of the results. Therefore, we recommend that more psychometric testing is performed of the questionnaires, especially with respect to content validity and measurement error, and particularly of questionnaires that cover important aspects of the ICF categories.

Disclosure statement

The authors declare that they have no conflict of interest.

References

  • McBeth J, Jones K. Epidemiology of chronic musculoskeletal pain. Best Pract Res Clin Rheumatol. 2007;21:403–425.
  • Picavet HSJ, Schouten JSAG. Musculoskeletal pain in the Netherlands: prevalences, consequences and risk groups, the DMC3-study. Pain. 2003;102:167–178.
  • SBU. Methods for long term pain treatment. Stockholm: The Swedish Council on Technology Assessment in Health Care; 2006.
  • IMMPACT. Initiative on methods, measurement, and pain assessment in clinical trials 2009; [cited 2009 Jan 22]. Available from: http://www.immpact.org/index.html
  • Wiitavaara B, Björklund M, Brulin C, et al. How well do questionnaires on symptoms in neck-shoulder disorders capture the experiences of those who suffer from the disorders? A content analysis of questionnaires and interviews. BMC Musculoskelet Disord. 2009;10:30.
  • Cleeland CS, Ryan KM. Pain assessment: global use of the brief pain inventory. Ann Acad Med Singap. 1994;23:129–138.
  • Kerns RD, Turk DC, Rudy TE. The West Haven-Yale Multidimensional Pain Inventory (WHYMPI). Pain. 1985;23:345–356.
  • Bergström KG, Jensen IB, Linton SJ, et al. A psychometric evaluation of the Swedish version of the Multidimensional Pain Inventory (MPI-S): a gender differentiated evaluation. Eur J Pain (London, UK). 1999;3:261–273.
  • WHO. International Classification of Functioning, Disability and Health: ICF. Short version [Internet]; 2001. Available from: http://www.who.int/classifications/icf/en/
  • Mokkink LB, Terwee CB, Patrick DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19:539–549.
  • Mokkink LB, Terwee CB, Knol DL, et al. The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: a clarification of its content. BMC Med Res Methodol. 2010;10:22.
  • Mokkink LB, Terwee CB, Patrick DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63:737–745.
  • Terwee CB, Mokkink LB, Knol DL, et al. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21:651–657.
  • Grant MJ, Booth A. A typology of reviews: an analysis of 14 review types and associated methodologies. Health Inform Libr J. 2009;26:91–108.
  • Fink A. Conducting research literature reviews. 3rd ed. Thousand Oaks (CA): SAGE Publications Inc.; 2010.
  • Cooke A, Smith D, Booth A. Beyond PICO: the SPIDER tool for qualitative evidence synthesis. Qual Health Res. 2012;22:1435–1443.
  • Westaway MD, Stratford PW, Binkley JM. The patient-specific functional scale: validation of its use in persons with neck dysfunction. J Orthop Sports Phys Ther. 1998;27:331–338.
  • Schellingerhout JM, Verhagen AP, Heymans MW, et al. Measurement properties of disease-specific questionnaires in patients with neck pain: a systematic review. Qual Life Res. 2012;21:659–670.
  • BenDebba M, Heller J, Ducker TB, et al. Cervical spine outcomes questionnaire: its development and psychometric properties. Spine. 2002;27:2116–2123.
  • Jordan A, Manniche C, Mosdal C, et al. The Copenhagen Neck Functional Disability Scale: a study of reliability and validity. J Manipulat Physiol Ther. 1998;21:520–527.
  • Bolton JE, Humphreys BK. The Bournemouth Questionnaire: a short-form comprehensive outcome measure. II. Psychometric properties in neck pain patients. J Manipulat Physiol Ther. 2002;25:141–148.
  • Vernon H, Mior S. The Neck Disability Index: a study of reliability and validity. J Manipulat Physiol Ther. 1991;14:409–415.
  • Wheeler AH, Goolkasian P, Baird AC, et al. Development of the Neck Pain and Disability Scale. Item analysis, face, and criterion-related validity. Spine. 1999;24:1290–1294.
  • Leonard JH, Choo CP, Manaf MR, et al. Development and evaluation of ‘neck pain and functional limitation scale’: a validation study in the Asian context. Indian J Med Sci. 2009;63:445–454.
  • Wang YC, Cook KF, Deutscher D, et al. The Development and Psychometric Properties of the Patient Self-Report Neck Functional Status Questionnaire (NFSQ). J Orthop Sports Phys Ther. 2015;45:683–692.
  • Juul T, Sogaard K, Roos EM, et al. Development of a patient-reported outcome: the Neck OutcOme Score (NOOS)–Content and construct validity. J Rehabil Med. 2015;47:844–853.
  • Leak AM, Cooper J, Dyer S, et al. The Northwick Park Neck Pain Questionnaire, devised to measure neck pain and disability. Br J Rheumatol. 1994;33:469–474.
  • Bjorklund M, Hamberg J, Heiden M, et al. The ProFitMap-neck–reliability and validity of a questionnaire for measuring symptoms and functional limitations in neck pain. Disabil Rehabil. 2012;34:1096–1107.
  • Skolasky RL, Riley LH, Albert TJ. Psychometric properties of the Cervical Spine Outcomes Questionnaire and its relationship to standard assessment tools used in spine research. Spine J. 2007;7:174–179.
  • Skolasky RL, Albert TJ, Vaccaro AR, et al. Patient satisfaction in the cervical spine research society outcomes study: relationship to improved clinical outcome. Spine J. 2009;9:232–239.
  • Pickering PM, Osmotherly PG, Attia JR, et al. An examination of outcome measures for pain and dysfunction in the cervical spine: a factor analysis. Spine. 2011;36:581–588.
  • Gay RE, Madson TJ, Cieslak KR. Comparison of the Neck Disability Index and the Neck Bournemouth Questionnaire in a sample of patients with chronic uncomplicated neck pain. J Manipulat Physiol Ther. 2007;30:259–262.
  • Geri T, Piscitelli D, Meroni R, et al. Rasch analysis of the Neck Bournemouth Questionnaire to measure disability related to chronic neck pain. J Rehabil Med. 2015;47:836–843.
  • Bolton JE. Sensitivity and specificity of outcome measures in patients with neck pain: detecting clinically significant improvement. Spine. 2004;29:2410–2417.
  • Hurst H, Bolton J. Assessing the clinical significance of change scores recorded on subjective outcome measures. J Manipulat Physiol Ther. 2004;27:26–35.
  • Hains F, Waalen J, Mior S. Psychometric properties of the neck disability index. J Manipulat Physiol Ther. 1998;21:75–80.
  • McCarthy MJH, Grevitt MP, Silcocks P, et al. The reliability of the Vernon and Mior neck disability index, and its validity compared with the short form-36 health survey questionnaire. Eur Spine J. 2007;16:2111–2117.
  • van der Velde G, Beaton D, Hogg-Johnston S, et al. Rasch analysis provides new insights into the measurement properties of the neck disability index. Arthritis Rheum. 2009;61:544–551.
  • Hung M, Cheng C, Hon SD, et al. Challenging the norm: further psychometric investigation of the neck disability index. Spine J. 2015;15:2440–2445.
  • Cleland JA, Fritz JM, Whitman JM, et al. The reliability and construct validity of the Neck Disability Index and patient specific functional scale in patients with cervical radiculopathy. Spine. 2006;31:598–602.
  • Cleland JA, Childs JA, Whitman JM. Psychometric properties of the neck disability index and numeric pain rating scale in patients with mechanical neck pain. Arch Phys Med Rehabil. 2008;89:69–74.
  • Young BA, Walker MJ, Strunce JB, et al. Responsiveness of the Neck Disability Index in patients with mechanical neck disorders. Spine J. 2009;9:802–808.
  • Young IA, Cleland JA, Michener LA, et al. Reliability, construct validity, and responsiveness of the neck disability index, patient-specific functional scale, and numeric pain rating scale in patients with cervical radiculopathy. Am J Phys Med Rehabil/Assoc Acad Physiatr. 2010;89:831–839.
  • Riddle DL, Stratford PW. Use of generic versus region-specific functional status measures on patients with cervical spine disorders. Phys Ther. 1998;78:951–963.
  • En MCC, Clair DA, Edmondston SJ. Validity of the Neck Disability Index and Neck Pain and Disability Scale for measuring disability associated with chronic, non-traumatic neck pain. Manual Ther. 2009;14:433–438.
  • Gabel CP, Cuesta-Vargas AI, Osborne JW, et al. Confirmatory factory analysis of the Neck Disability Index in a general problematic neck population indicates a one-factor model. Spine J. 2014;14:1410–1416.
  • Juul T, Sogaard K, Davis AM, et al. Psychometric properties of the Neck OutcOme Score, Neck Disability Index, and Short Form-36 were evaluated in patients with neck pain. J Clin Epidemiol. 2016;79:31–40.
  • Goolkasian P, Wheeler AH, Gretz SS. The neck pain and disability scale: test–retest reliability and construct validity. Clin J Pain. 2002;18:245–250.
  • Sim J, Jordan K, Lewis M, et al. Sensitivity to change and internal consistency of the Northwick Park Neck Pain Questionnaire and derivation of a minimal clinically important difference. Clin J Pain. 2006;22:820–826.
  • Bjorklund M, Wiitavaara B, Heiden M. Responsiveness and minimal important change for the ProFitMap-neck questionnaire and the Neck Disability Index in women with neck-shoulder pain. Qual Life Res. 2016;26:161–170.
  • Oner FC, Jacobs WC, Lehr AM, et al. Toward the development of a universal outcome instrument for spine trauma: a systematic review and content comparison of outcome measures used in spine trauma research using the ICF as reference. Spine. 2016;41:358–367.
  • Grotle M, Brox JI, VNK. Functional status and disability questionnaires: what do they assess? A systematic review of back-specific outcome questionnaires. Spine. 2005;30:130–140.
  • Ibsen CS, Christensen B, Melchiorsen H, et al. Do patient-reported outcome measures describe functioning in patients with low back pain, using the brief international classification of functioning, disability and health core set as a reference? J Rehabil Med. 2016;48:618–624.
  • Ferreira ML, Borges BM, Rezende IL, et al. Are neck pain scales and questionnaires compatible with the International Classification of Functioning, Disability and Health? A systematic review. Disabil Rehabil. 2010;32:1539–1546.
  • Wiitavaara B, Fahlstrom M, Djupsjobacka M. Prevalence, diagnostics and management of musculoskeletal disorders in primary health care in Sweden: an investigation of 2000 randomly selected patient records. J Eval Clin Pract. 2016;23:325–332.
  • Leahy E, Davidson M, Benjamin D, et al. Patient-Reported Outcome (PRO) questionnaires for people with pain in any spine region. A systematic review. Manual Ther. 2016;22:22–30.
  • Mokkink LB, Terwee CB, Gibbons E, et al. Inter-rater agreement and reliability of the COSMIN (COnsensus-based Standards for the selection of health status Measurement Instruments) checklist. BMC Med Res Methodol. 2010;10:82.