475
Views
4
CrossRef citations to date
0
Altmetric
Respiratory

Co-calibrating quality-of-life scores from three pulmonary disorders: implications for comparative-effectiveness research

, &
Pages 596-603 | Received 30 Nov 2015, Accepted 27 Jan 2016, Published online: 19 Feb 2016

Abstract

Background Efficient use of health resources requires accurate outcome assessment. Disease-specific patient-reported outcome (PRO) measures are designed to be highly relevant to patients with a specific disease. They have advantages over generic PROs that lack relevance to patient groups and miss crucial impacts of illness. It is thought that disease-specific measurement cannot be used in comparative effectiveness research (CER). The present study provides further evidence of the value of disease-specific measures in making valid comparisons across diseases.

Methods The Asthma Life Impact Scale (ALIS, 22 items), Living with Chronic Obstructive Pulmonary Disease (LCOPD, 22 items) scale, and Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR, 25 items) were completed by 140, 162, and 91 patients, respectively. The three samples were analyzed for fit to the Rasch model, then combined into a scale consisting of 58 unique items and re-analyzed. Raw scores on the three measures were co-calibrated and a transformation table produced.

Results The scales fit the Rasch model individually (ALIS Chi2 probability value (p-Chi2) = 0.05; LCOPD p-Chi2 =0.38; CAMPHOR p-Chi2 =0.92). The combined data also fit the Rasch model (p-Chi2 =0.22). There was no differential item functioning related to age, gender, or disease. The co-calibrated scales successfully distinguished between perceived severity groups (p < 0.001).

Limitations The samples were drawn from different sources. For scales to be co-calibrated using a common item design, they must be based on the same theoretical construct, be unidimensional, and have overlapping items.

Conclusions The results showed that it is possible to co-calibrate scores from disease-specific PRO measures. This will permit more accurate and sensitive outcome measurement to be incorporated into CER. The co-calibration of needs-based disease-specific measures allows the calculation of γ scores that can be used to compare directly the impact of any type of interventions on any diseases included in the co-calibration.

Introduction

As healthcare resources become more strained by increasing longevity and rates of chronic disease, the ability to evaluate health outcomes accurately becomes more crucial. One approach to the problem is comparative effectiveness research (CER). The central question of CER is which treatment works best, for whom, and in which circumstancesCitation1. While such research is generally limited to alternative interventions for a single disease, CER would be equally valuable where decisions need to be taken about whether to treat Disease A or Disease B in terms of health gains. In the UK, the National Institute of Clinical Excellence (NICE) recommend the use of generic patient-reported outcomes (PRO)—particularly the EQ-5DCitation2—for such researchCitation3. However, the psychometric limitations of this PRO mean that it proves difficult to show differences between the effectiveness of alternative interventionsCitation4–6. Recently, NICE reversed its position and postponed implementation of value-based assessmentCitation7. Similarly, in the US, the Patient Protection and Affordable Care Act specifically ruled out employing Quality Adjusted Life Years using the EQ-5D. Despite this reversal, CER remains crucial to comparing the outcome of different packages of interventions. NICE has now recognized that, for CER to be a worthwhile alternative, sensitive and valid PRO measures are required that take into account the wider impact of diseaseCitation7.

Available generic PROs generally have poor psychometric properties, including problems with response functioning, item functioning, and measurement rangeCitation8–10. When generic PROs are used to combine patients with different diseases, items in the scales have exhibited disease-related differential item functioning (DIF)Citation11–13. The DIF arises because the same question may well be interpreted differently by patients with different diseases. Furthermore, generic measures have relatively poor sensitivity, making it questionable whether generic PROs are suitable for use in CER studies.

Modern PRO indicators focus on disease-specific outcomesCitation14. Because the indicators can ask highly relevant questions and avoid irrelevant issues they are more valid and reliable than generic outcome descriptors. Evidence also demonstrates that such disease-specific measures are more sensitive to change than generic indicatorsCitation15–20. As disease-specific measures are relevant to only one disease it would be expected that they could not be used in CER. The purpose of this study was to assess whether PRO measures developed using Rasch analysis can overcome this shortcoming.

Most PROs were developed using classical test theory (CTT), also known as true score theoryCitation21. CTT assumes that the observed score for each individual is comprised of their true ability and a random error componentCitation22. CTT has been widely applied in PRO development, possibly due to its weak underlying assumptions that require little mathematical knowledge, making it simple to useCitation23. However, there are several limitations of CTT. The method relies on correlational analyses such as internal consistency and factor analysis. Furthermore, the application of CTT in PRO development produces numbers that constitute only ordinal level data. As a result, the requirements for parametric assessment are not met. Non-parametric analyses are less powerful than parametric analyses and do not justify the calculation of change scores in clinical trialsCitation24.

In contrast, Rasch analysisCitation25 gives greater emphasis to item-level information and provides superior detail on the plausible measurement properties of scalesCitation26–28. The Rasch model has particularly strong mathematical characteristics compared with other item response theory (IRT) approaches. When data fit the Rasch model fundamental measurement is achieved and ordinal scores are transformed into interval-level variablesCitation24,Citation29. Rasch analysis also allows responses from different disease-specific PRO measures to be co-calibrated onto the same measurement scale, allowing the impact of different diseases to be combined and compared. Such co-calibration has been widely used in educational settings to equate tests of varying difficulty levelsCitation30, but is increasingly also being applied in health researchCitation31–33.

The aim of this study was to determine whether it is possible to co-calibrate scores on three previously validated, disease-specific, needs-based PROs; the Asthma Life Impact Scale (ALIS)Citation34, the Living with Chronic Obstructive Pulmonary Disease (LCOPD) scaleCitation35, and the Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR)Citation36 to allow scores from patients with different respiratory conditions to be combined and compared validly. The secondary aim was to assess whether the PROs could be used in respiratory CER studies.

The study used a common item design to co-calibrate the three scales in which respondents only complete one scale, but the scales share common item content. This design has been employed successfully previously in dermatologyCitation31. In order to co-calibrate scores from different scales using a common item design, the scales must meet strict criteria. They must be based on the same theoretical outcome construct, fit the Rasch model (be unidimensional) and have overlapping items. The three scales used in the study satisfied these requirementsCitation37.

Patients and methods

Participants

The asthma (n = 140) and chronic obstructive pulmonary disease (COPD) samples (n = 162) were recruited from Peterborough District Hospital and patient support groupsCitation34,Citation35. The pulmonary hypertension (PH) sample (n = 91) was recruited from Papworth Hospital, CambridgeCitation36. Ethics committee approval was obtained for each sample separately.

Measures

The CAMPHOR is a suite of three measures, including a quality of life (QoL) scale. For the present study, only the QoL scale was employed. The ALIS and LCOPD are disease-specific outcome measures of QoL. All three scales were developed with the needs-based model of QoL as the theoretical basis. The model theorizes that QoL is determined by the ability and capacity of an individual to satisfy particular human needs. QoL is high when these needs are satisfiedCitation37.

The ALIS and LCOPD consist of 22 items and the CAMPHOR QoL scale has 25. All response options are dichotomous (‘True’ or ‘Not True’) and overall scores are calculated by summing the individual item scores. High scores on the measures signify poor QoL.

All three scales were developed using the Rasch model.

Procedure

Three stages were involved in the analysis:

1.

Data from the three measures were analyzed individually to confirm their fit to the Rasch model.

2.

Data from the measures were combined onto the same measurement scale using a common item design. Rasch analysis was applied to assess fit statistics for the combined scale.

3.

The validity of the co-calibrated scale was investigated by relating Rasch scores to disease type, perceived general health, and illness duration.

Rasch analysis

The Rasch model is a type of IRT model that can be applied to scales to assess their measurement properties. The response patterns achieved from a set of items are tested against what is expected by the Rasch model. The Rasch model is a probabilistic version of the Guttman scaleCitation38. According to the Rasch model, the probability of an individual affirming a particular item is a logistic function of the relative distance between the person location parameter and the item location parameter. In other words, the probability of an individual affirming an item is dependent only on the individual’s level of the construct (e.g., QoL) and the level of the construct assessed by the item. The Rasch model places items in hierarchical order of severity on the same underlying metric. The mathematical unit of the Rasch model is the logit. The mean location of the items on the metric is anchored at ‘0’ and item locations range either side of this fixed point. Within the context of PROs, milder items are represented by negative logit values and more severe items by positive logit values.

Because the three scales had two possible response options for each item, the dichotomous Rasch model was employed. Several analyses were used to assess fit to the Rasch model. Internal reliability was analyzed using the Person Separation Index (PSI). A PSI score of 0.70 is considered the minimum acceptable levelCitation39. Mean item and person fit residual statistics provide information on overall fit to the model. The statistics measure the extent to which observed item and person estimates deviate from model expectations. The co-calibrated scale’s overall fit to the model was analyzed and reported as a Chi2 statistic. A Chi2 probability value (p-Chi2) of less than 0.05 (Bonferroni corrections applied) means that the scale has significantly deviated from the expected pattern and misfits the Rasch model.

Fit statistics for individual items were also investigated. Item fit residuals should lie within ±2.5 if all individuals respond as anticipated. High negative item fit residuals suggest item redundancy, whereas high positive item fit residuals denote multidimensionality. Chi2 statistics were employed to assess whether observed scores deviated from model expectations. Probability values below 0.05 (Bonferroni corrections applied) indicate that observed scores for an item deviate from those expected by the Rasch model.

A requirement of the Rasch model is item invariance across groups, which is tested by examining differential item functioning (DIF) for different sample groups. DIF occurs when different groups (e.g., males and females) respond in a different pattern to an individual item, despite having equal levels of the construct being measured (e.g., QoL)Citation26. An analysis of variance (ANOVA) of standardized residuals was used to examine DIF by gender and age group. DIF by disease was investigated for the items common to more than one scale. An ANOVA p-value of below 0.05 (Bonferroni corrections applied) indicates presence of DIF. DIF can be uniform or non-uniform.

Targeting of items to the QoL of respondents was assessed by examining person-item distribution graphs. These show the ordering of both persons and items on the logit scale and indicate whether the items in the scale are well matched to the respondents.

A further test of the unidimensionality of scales is also calculatedCitation40. In this test, two sub-sets of responses are identified that are highly divergent (based on items loading at either extremes of the first factor of the residual principal components analysis). The two sub-sets of responses are used to create separate estimates for each individual. Paired t-tests are carried out to assess whether the two test sub-sets produced significantly different estimates for each respondent. Where fewer than 5% of the tests are statistically significant the scale is considered unidimensional. A 95% binomial confidence interval is also provided.

The Rasch Unidimensional Measurement Model (RUMM) 2030Citation41 program was used in the analyses.

Co-calibration of the ALIS, LCOPD, and CAMPHOR

There are 10 common items linking the three scales, all of which were used for the co-calibration. Five of these items are common to the ALIS and LCOPD only, two to the ALIS and CAMPHOR only, and two to the LCOPD and CAMPHOR. One item is common to all three measures, although this is not necessary for co-calibration. As participants only completed one of the measures, their responses are linked via the common or ‘anchor items’. The analyses of fit to the Rasch model (described above) were conducted.

Analysis of the co-calibrated scale

The Rasch location of all items on the combined measurement scale was linearly transformed to allow each item to be expressed as a standardized score that can range from 0–100. This is referred to as the gamma (γ) score below.

Further analyses were conducted to see how the γ scores performed. Patient scores for asthma, COPD, and PH were compared using an ANOVA In addition, γ scores were combined and related to perceived severity (for ALIS and LCOPD) and New York Heart Association (NYHA) Functional Classification (for CAMPHOR), using an ANOVA and t-test respectively. P-values of < 0.01 were used to indicate statistically significant differences between disease groups.

Results

Demographic details of the samples are shown in . Perceived illness severity was recorded for asthma and COPD samples. For the PH sample the NYHA Functional Classification was used as an indicator of disease severity.

Table 1. Sample characteristics.

Rasch analysis of individual scales

shows overall fit statistics for the ALIS, LCOPD and CAMPHOR. All three measures fitted the Rasch model (p-Chi2, p > 0. 05) initially. The PSIs were high, indicating that the measures had appropriate levels of reliability.

Table 2. Overall fit statistics.

Item 12 of the LCOPD (‘My illness frightens me’) had a high fit residual (2.9), indicating misfit to the Rasch model. However, the item characteristic curve showed no pronounced deviation between observed and expected scores. After applying Bonferroni corrections, none of the items in any of the three scales showed DIF by age group (below and above median) or gender.

As the fit indicators did not support the removal of items, all items were retained and carried forward to the combined analysis.

Rasch analysis of combined samples

Item and person fit residual statistics for the combined scales are also shown in . Overall fit to the model was achieved (p-Chi2 >0.05) and reliability was good (PSI > 0.85). The LCOPD item (‘My illness frightens me’) continued to have a high fit residual, but was retained as other fit indicators did not support its removal. There was no DIF found for age group (below and above median), gender, or disease.

Targeting of the items to the respondents is displayed in . The graph shows that the items had a good spread and were well matched to the person locations on the logit scale. However, there was some lack of coverage at the mild end of the scale. As shown in the figure, the items were well spread across the logit scale, indicating minimal item redundancy. displays the person and item locations for the combined scale. The most severe item in the combined scale was ‘I don’t want to talk to anybody’. The mildest item was ‘I feel dependent on my treatment’. The logit coverage (the distance between the most severe and mildest items) of the combined sample was 7.0.

Figure 1. Targeting of items for the combined scale.

Figure 1. Targeting of items for the combined scale.

Figure 2. Item map showing the item and person locations for the combined scale.

Figure 2. Item map showing the item and person locations for the combined scale.

The logit positions of the common linking items are shown in . None of the items displayed DIF by gender, age group or disease after Bonferroni corrections were applied. The logit coverage of the common items was 4.4. The most severe common item was ‘My illness affects my close relationships’, whereas the mildest was ‘I have to pace myself’.

Table 3. Logit positions of the common linking items.

shows example transformed γ scores for the three measures. The mean γ scores (standard error) for the individual scales were 43.63 (4.18) for the ALIS, 46.93 (4.12) for the LCOPD, and 54.71 (4.25) for the CAMPHOR. As stated in the Method section, an individual’s score is the number of items they affirmed. A raw score of 10 would equate to a γ score of 42 on ALIS, 45.2 on LCOPD, and 49.4 on CAMPHOR.

Table 4. Example γ scores for the ALIS, LCOPD, and CAMPHOR.

Analysis of the co-calibrated scale

shows mean γ scores and corresponding 95% confidence intervals based on the standard error of the mean for disease groups, severity groups, and NYHA classifications. High γ scores indicate worse QoL. Comparison of samples showed that γ scores were able to distinguish between disease (F = 30.58, p < 0.001) and disease severity for both asthma (F = 38.16, p < 0.001) and COPD patients (F = 27.58, p < 0.001). PH patients with a NYHA classification of III/IV scored significantly higher than those with a classification of I/II (t = −4.67, p < 0.001). Comparison of samples showed that asthma patients had significantly lower γ scores than patients with COPD (t = −7.38, p < 0.001) and PH (t = −5.07, p < 0.001). However, there were no significant differences in γ scores between COPD and PH patients (t = 1.37, p = 0.17). It should be noted that the samples were not matched for disease severity.

Figure 3. Mean γ scores and 95% confidence intervals by disease, severity group, and NYHA classification.

Figure 3. Mean γ scores and 95% confidence intervals by disease, severity group, and NYHA classification.

Discussion

This study successfully co-calibrated three different disease-specific respiratory PRO measures onto the same measurement scale, using Rasch analysis. The combined scale fit the Rasch model and no adjustments to the scale were required. There was no DIF related to age, gender, or disease. The validity of the γ scores generated from the combined scale was assessed by comparing scores by perceived disease severity (for ALIS and LCOPD) and NYHA classification (for CAMPHOR). The scale was able to distinguish between severity groups, providing evidence of its construct validity. The current study supports previous research that has successfully combined two dermatology disease-specific PRO measuresCitation31.

A common item design was employed to combine the scales due to the overlapping content of the three measures. All three scales were based on the same theoretical construct and developed using the needs-based model of QoL. The needs-based model postulates that QoL is determined by the ability of an individual to satisfy their needs; QoL is high when most needs are fulfilledCitation37. Since its development in 1992, the needs-based model of QoL has been used as the theoretical basis in the development of 20 disease-specific PRO measures. Disease-specific measures include the PRIMUS (multiple sclerosis)Citation42, CLIQ (Crohn’s disease)Citation43, RAQoL (rheumatoid arthritis)Citation44 and ASQoL (ankylosing spondylitis)Citation45. Because needs are universal there are commonly overlapping items in the PRO measures. For example, a loss of autonomy occurs in a number of chronic illnesses. Indeed, overlapping content is a requirement for co-calibration.

A clear application for co-calibrated needs-based measures is in CER. NICE was established in 1999 to produce guidelines on the value (including costs and benefits) of various interventions. The guidelines are used to make decisions about which treatment options to resourceCitation46. There are many demands on health service resources, but budgets will always be limited. It is, therefore, crucial that decision-makers are able to tell clearly which therapeutic interventions are having the biggest effect. NICE is in the process of considering other outcomes that could be included in value-based assessments and has acknowledged that the wider impact of the disease should be consideredCitation7. PRO measures arising from the needs-based model of QoL are well suited for this purpose as they assess how a disease affects patients’ lives and limits their ability to meet their needs.

The availability of γ scores would have additional advantages. They would provide a standardized index of outcome that is common across situations. For example, treating asthma with Drug A may result in an average gain of 3 γ. Treating COPD with Drug B may lead to an average improvement of 2 γ. The relative value of exercise programs could be evaluated using the same indices. For example, exercise programs are currently being evaluated for the three respiratory diseasesCitation47–49. Combining disease-specific PRO measures onto the same measurement continuum would allow the effectiveness of such programs on different respiratory conditions to be directly comparable.

Further research is required to establish the validity of such γ scores. However, they could provide the basis for a new outcome technology system. Following co-calibration of the 20 needs-based disease-specific measures, patient scores on any of them could be converted into γ scores. The γ scores would allow direct, valid comparisons across a wide range of diseases. New needs-based instruments could then be added to the system.

Limitations of the study should be noted. The samples were drawn from different sources. In order to co-calibrate scores from different scales using a common item design, the scales must meet strict criteria. They must employ the same theoretical outcome construct, fit the Rasch model (be unidimensional), and have overlapping items. All outcomes scales should have such qualities. However, other than the needs-based measures, few PRO scales fit the Rasch model or have been developed using a consistent theoretical basis. Therefore, the method described in the current study is not applicable with most available PROs.

Conclusion

The impact of different interventions for asthma, COPD and PH can be compared directly in CER using the co-calibration approach demonstrated in this article. Utilizing disease-specific measures for comparing different diseases has advantages over relying on generic PROs in terms of validity and responsiveness.

Transparency

Declaration of funding

No funding was provided for this study.

Declaration of financial/other relationships

The authors are employees of Galen Research Ltd, a research company that develops and evaluates disease-specific patient-reported outcome measures. JME peer reviewers on this manuscript have no relevant financial or other relationships to disclose.

Acknowledgments

No assistance in the preparation of this article is to be declared.

References

  • Chalkidou K, Anderson G. Comparative effectiveness research: International experiences and implications for the United States. Washington DC: Academy Health, 2009 http://www.nihcm.org/pdf/CER_International_Experience_09.pdf. Accessed October 30, 2015
  • Group EQ. EuroQol - a new facility for the measurement of health-related quality of life. Health Policy 1990;16:199-208
  • National Institute for Health and Care Excellence. Guide to the methods of technology appraisal. London: National Institute for Health and Care Excellence, 2008
  • van Boven JF, Stuurman-Bieze AG, Hiddink EG, et al. Effects of targeting disease and medication management interventions towards patients with COPD. Curr Med Res Opin 2016;32:229-39
  • Schofield DJ. How should we measure the impact of chronic pain? Pain 2014;155:1918-19
  • Torrance N, Lawson KD, Afolabi E, et al. Estimating the burden of disease in chronic pain with and without neuropathic characteristics: does the choice between the EQ-5D and SF-6D matter? Pain 2014;155:1996-2004
  • Kusel J. Why has value based assessment been abandoned by NICE in the UK? Value & Outcomes Spotlight 2015;1:22-5
  • Taylor WJ, McPherson KM. Using Rasch analysis to compare the psychometric properties of the Short Form 36 physical function score and the Health Assessment Questionnaire disability index in patients with psoriatic arthritis and rheumatoid arthritis. Arthritis Rheum 2007;57:723-9
  • Jenkinson C, Fitzpatrick R, Garratt A, et al. Can item response theory reduce patient burden when measuring health status in neurological disorders? Results from Rasch analysis of the SF-36 physical functioning scale (PF-10). J Neurol Neurosurg Psychiatry 2001;71:220-4
  • Twiss J, Meads DM, Preston EP, et al. Can we rely on the Dermatology Life Quality Index (DLQI) as a measure of the impact of psoriasis or atopic dermatitis? J Invest Dermatol 2012;132:76-84
  • Tang K. Disease-related differential item functioning in the work instability scale for rheumatoid arthritis: converging results from three methods. Arthritis Arthritis Care Res (Hoboken) Res 2011;63:1159-69
  • Lundgren-Nilsson A, Tennant A, Grimby G, et al. Cross-diagnostic validity in a generic instrument: an example from the Functional Independence Measure in Scandinavia. Health Qual Life Outcomes 2006;4:55-63
  • Wann-Hansson C, Klevsgård R, Hagell P. Cross-diagnostic validity of the Nottingham Health Profile index of distress (NHPD). Health Qual Life Outcomes 2008;6:47-60
  • McKenna SP. Measuring patient-reported outcomes: moving beyond misplaced common sense to hard science. BMC Med 2011;9:86-98
  • Velanovich V. Comparison of generic (SF-36) vs. disease-specific (GERD-HRQL) quality-of-life scales for gastroesophageal reflux disease. J Gastrointest Surg 1998;2:141-5
  • Bessette L, Sangha O, Kuntz KM, et al. Comparative responsiveness of generic versus disease-specific and weighted versus unweighted health status measures in carpal tunnel syndrome. Med Care 1998;36:491-502
  • Jenkinson C, Gray A, Doll H, et al. Evaluation of index and profile measures of health status in a randomized controlled trial: comparison of the medical outcomes study 36-Item Short Form Health Survey, EuroQoL, and disease specific measures. Med Care 1997;35:1109-18
  • Weldam SWM, Schuurmans MR, Lui R, et al. Evaluation of Quality of Life instruments for use in COPD care and research: a systematic review. Int J Nurs Stud 2013;50:688-707
  • Puhan MA, Guyatt GH, Goldstein R, et al. Relative responsiveness of the Chronic Respiratory Questionnaire, St George’s Respiratory Questionnaire and four other health-related quality of life instruments for patients with chronic lung disease. Resp Med 2007;101:308-16
  • Wiebe S, Guyatt G, Weaver B, et al. Comparative responsiveness of generic and specific quality-of-life instruments. J Clin Epidemiol 2003;56:52-60
  • Allen MJ, Yen WM. Introduction to measurement theory. Long Grove, IL: Waveland Press, 2002
  • Kline TJB. Psychological testing: a practical approach to design and evaluation. California: SAGE Publications, Inc, 2005
  • De Champlain AF. A primer on classical test theory and item response theory for assessments in medical education. Med Educ 2010;44:109-17
  • Tennant A, McKenna SP, Hagell P. Application of Rasch analysis in the development and application of quality of life instruments. Value Health 2004;7:S22-6
  • Rasch G. Probabilistic models for some intelligence and attainment tests. Chicago, IL: University of Chicago Press, 1960 (Reprinted 1980)
  • Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Rheum 2007;57:1358-62
  • Andrich D. Rasch models for measurement: Sage University Paper Series on quality application in the social sciences. Newbury Park, CA: Sage Publications Inc, 1988
  • Bond TG, Fox CM. Applying the Rasch model: fundamental measurement for the human sciences. Mahwah, NJ: Erlbaum, 2001
  • Wright BD. Fundamental measurement. Rasch Meas Tansact 1997;11:558
  • Qadir MJ, Gilani IG, Hameed A. Rasch calibration of general science test at Grade-VIII in Pakistan. J Educ Vocat Res 2012;3:98-106
  • Twiss J, McKenna SP. Comparing the impact of psoriasis and atopic dermatitis on quality of life: Co-calibration of the PSORIQoL and QoLIAD. Qual Life Res 2015;24:105-13
  • Latimer S, Covic T, Tennant A. Co-calibration of deliberate self-harm (DSH) behaviours: towards a common measurement metric. Psychiatry Res 2012;200:26-34
  • Bezruczko N, Perkins K. Multi-factor scale consolidation when theory is weak. J Appl Meas 2012;13:77-96
  • Meads DM, McKenna SP, Doward LC, et al. Development and validation of the Asthma Life Impact Scale (ALIS). Respir Med Med 2010;104:633-43
  • McKenna SP, Meads DM, Doward LC, et al. Development and validation of the living with chronic obstructive pulmonary disease questionnaire. Qual Life Res 2011;20:1043-52
  • McKenna SP, Doughty N, Meads DM, et al. The Cambridge Pulmonary Hypertension Outcome review (CAMPHOR): a measure of health-related quality of life and quality of life for patients with pulmonary hypertension. Qual Life Res 2006;15:103-1
  • Hunt SM, McKenna SP. The QLDS: a scale for the measurement of quality of life in depression. Health Policy 1992;22:307-19
  • Guttman L. The basis for scalogram analysis. In: Stouffer SA, ed. Measurement and prediction. New York: Wiley, 1950
  • Ayele DG, Zewotir T, Mwambi H. Using Rasch modeling to re-evaluate rapid malaria diagnosis test analyses. Int J Environ Res Public Health 2014;11:6681-91
  • Smith EV, Jr. Detecting and evaluating the impact of multidimensionality using item fit statistics and principal component analysis of residuals. J Appl Meas 2002;3:205-31
  • Andrich D, Sheridan B, Ludo G. Rasch unidimensional measurement models: a Windows-based item analysis program employing Rasch models (RUMM2020). Perth, Australia: Murdoch University, 2005
  • Doward LC, McKenna SP, Meads DM, et al. The development of patient-reported outcome indices for multiple sclerosis (PRIMUS). Mult Scler 2009;15:1092-102
  • Wilburn J, McKenna SP, Twiss J, et al. Assessing quality of life in Crohn's disease: development and validation of the Crohn's Life Impact Questionnaire (CLIQ). Qual Life Res 2015;24:2279-88
  • Whalley D, McKenna SP, de Jong Z, et al. Quality of life in rheumatoid arthritis. Br J Rheumatol 1997;36:884-8
  • Doward LC, Spoorenberg A, Cook SA, et al. Development of the ASQoL: a quality of life instrument specific to ankylosing spondylitis. Ann Rheum Dis 2003;62:20-6
  • Slutsky JR, Clancy CM. AHRQ's Effective Health Care Program: why comparative effectiveness matters. Effective Care: comparative matters. Am J Med Qual 2009;24:67-70
  • Mereles D, Ehlken N, Kreuscher S, et al. Exercise and respiratory training improves exercise capacity and quality of life in patients with severe chronic pulmonary hypertension. Circulation. 2006;114:1482-9
  • Gloeckl R, Marinov B, Pitta F. Practical recommendations for exercise training in patients with COPD. 2013;22:178-86
  • Meyer A, Günther S, Volmer T. A 12-month, moderate-intensity exercise training program improves fitness and quality of life in adults with asthma: a controlled trial. BMC Pulm Med 2015;15:56

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.