3,327
Views
8
CrossRef citations to date
0
Altmetric
Original Articles: Survivorship

Self-reported questionnaires for lymphoedema: a systematic review of measurement properties using COSMIN framework

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 379-391 | Received 26 Aug 2020, Accepted 05 Dec 2020, Published online: 21 Jan 2021

Abstract

Introduction

Lymphoedema may develop as a result of numerous genetic and traumatic causes; however, treatment for cancer is the most common cause of its development in more economically developed nations. This systematic review critically appraised, compared and summarised the measurement properties of lymphoedema-specific self-reported questionnaires (SRQs) measuring various patient-reported outcomes including quality of life (QOL), function, morbidity, and symptoms.

Methods

Seven databases were searched to identify studies of the measurement properties of SRQs. Two review teams independently evaluated the quality of the individual studies using the risk of bias tool from the Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN). Measurement properties of the SRQs presented in the studies were then rated. Study level ratings were summarised for an SRQ if they were reported in multiple studies, and their overall quality of the evidence were then graded.

Results

Forty articles, reporting on 19 SRQs were identified from 8615 records. The focus of the 19 SRQs included eight on QOL, four on symptoms, two on function, and two on impairment. The other three SRQs were on illness perception, self-efficacy, and patient-relevant treatment benefit, respectively. Eight and three SRQs were upper limb and lower limb-specific, respectively, whereas seven questionnaires were for both upper and lower limb lymphoedema. One SRQ was developed for head and neck lymphoedema. According to the COSMIN framework, none of the SRQs reviewed had sufficient evidence to support all nine measurement properties. In lower limb questionnaires, the LYMQOL-leg has sufficient content, structural, and construct validity as well as internal consistency and reliability. For upper limb lymphoedema questionnaires, the Lymph-ICF-UL had sufficient content and construct validity as well as reliability.

Conclusion

LYMQOL-leg SRQ is recommended with confidence for evaluation of QOL of people with lower limb lymphoedema while the Lymph-ICF-UL is recommended for evaluation of the QOL of the breast cancer-related lymphoedema with some confidence. In view of the high level of the indeterminate ratings of the measurement properties of the existing SRQs, further research is desirable.

Background

Lymphoedema is a progressive and disabling condition, characterised by an accumulation of protein-rich fluid in the interstitial spaces of the affected body part. This condition is broadly classified into two categories: (1) primary lymphoedema caused by congenital or hereditary abnormalities of the lymphatic system and (2) secondary lymphoedema caused by cancer treatments, infections, or trauma damaging the lymphatic system. The treatment for cancer is the most common cause of secondary lymphoedema in developed countries [Citation1]. Lymphoedema is a particularly feared consequence of the treatment for cancer as, if not well managed, it may impact negatively on the physical and psychosocial functions as well as the quality of life (QOL) of the people with lymphoedema [Citation2].

Evaluation of lymphoedema should include self-reported questionnaires (SRQs) to capture patient-reported outcomes [Citation3,Citation4], such as symptoms, physical and psychosocial functions as well as QOL. These patient-reported outcomes, apart from objective lymphoedema measures, are increasingly recognised as an integral part of patient-centred care [Citation5,Citation6]. However, to evaluate patient-reported outcomes accurately, the SRQs must have good measurement properties, which include domains like validity, reliability, and responsiveness [Citation5–7]. Validity is the extent to which an SRQ measures the outcome it intends to measure. Reliability is the extent to which the SRQ is free from measurement error. Responsiveness is the ability of an SRQ to detect change over time in the outcome it measures [Citation8]. There are several lymphoedema-specific SRQs currently available to evaluate similar patient-reported outcomes. To inform the selection of an SRQ with good measurement properties, a comprehensive systematic review that evaluates the measurement properties of all lymphoedema-specific SRQs is required [Citation7].

A few systematic reviews with limited scope have evaluated measurement properties of lymphoedema-specific SRQs [Citation9–12]. For example, a recent systematic review reported on the number of domains and symptoms included in SRQs that evaluated QOL of breast cancer-related lymphoedema (BCRL) [Citation10]. However, the review was limited to BCRL QOL SRQs and did not report on the measurement properties. Three other systematic reviews examined studies that had reported the utilisation of the SRQs in the evaluation of QOL of BCRL and cancer-related lower limb lymphoedema [Citation9,Citation11,Citation12]. These reviews reported on the commonly used SRQs, noting the lack of consensus regarding which SRQ to use. To achieve the consensus of use of SRQs, they argued for the need of a comprehensive systematic review of the measurement properties of the lymphoedema specific SRQs using a standard method such as the Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) [Citation9]. The current comprehensive systematic review was, therefore, conducted to critically appraise, compare and summarise the measurement properties of lymphoedema-specific SRQs measuring various patient-reported outcomes (constructs) such as QOL, function, morbidity, and symptoms.

Methods

This systematic review was registered prospectively with the International Prospective Register of Systematic Reviews (PROSPERO reference no: CRD42017064645) and conducted based on the COSMIN guideline for systematic reviews [Citation7,Citation13–15].

Database search

The following electronic databases were searched from their date of inception: MEDLINE, Cochrane Central Register of Controlled Trials (CENTRAL), CINAHL, AMED, EMBASE, Scopus, and PsycINFO initially in May 2017, with updated searches in March 2019 and April 2020 using the comprehensive search strategy. The search was not restricted for any criteria such as English language or sex, although only studies published in English language were included in this review. The search strategy was developed using keywords from patient-reported outcomes of interest (QOL, function and symptom experience), target population (lymphoedema), and measurement properties identified in COSMIN taxonomy (Supplementary material 1) [Citation8,Citation16]. This search strategy was refined based on the results of a pilot search, and the documentation of the database search for a database (Medline) is given in Supplementary material 2.

Study selection

After removing duplicates, articles were screened by two teams of reviewers (team one: VP and SK; team two: ED and MJ) independently from each other for eligibility using the selection criteria. The titles and abstracts were first screened for eligibility, followed by full-text articles that were not rejected in the initial phase. Any discrepancies in the study selection between reviewers were resolved through discussion. Endnote (ClarivateTM, Philadelphia, PA, USA) and Covidence (Melbourne, Australia) were both used for study review and selection.

Selection criteria

Studies were included for this review if they: reported on the development of lymphoedema-specific SRQs; evaluated the measurement properties of lymphoedema-specific SRQs identified within the COSMIN taxonomy (Supplementary material 1) [Citation8]; were lymphoedema-specific SRQs that evaluated any one patient-reported outcome including the QOL, function, morbidity, and symptom experience; and included participants aged 18 or above with primary or secondary lymphoedema, and written in English. Articles were excluded if they were: reviews; studies that used SRQs only as an outcome measurement tool; studies in which only clinician or proxy-reported questionnaires were used; SRQs developed for screening or diagnostic purpose only; studies that included participants with oedema other than lymphoedema; or were not published in English. However, studies that included participants with and without lymphoedema to evaluate the known group validity were included.

Data extraction

Data extraction forms were based on COSMIN Methodology for Systematic Reviews of Patient-Reported Outcome Measures (PROMs) user manual [Citation17]. Two teams of reviewers (team one: VP and SK; team two: ED and MJ) independently extracted the data from each article. Discrepancies in extracted data were resolved through discussion. Data extraction included details related to: (1) development and testing of the SRQs including the patient-reported outcomes and population, recall period, subscales, response options, scoring details, target language, and the completion time; (2) participants’ details, details of their lymphoedema and information related to the SRQs administration; and (3) the design of the measurement property studies, the requirements of statistical tests performed and the results for each assessed measurement property [Citation8].

Risk of bias assessment

Two teams of reviewers (team one: VP and SK; team two: ED and MJ) independently assessed risk of bias for each measurement property reported in the study using the COSMIN Risk of Bias checklist [Citation14]. Any discrepancies in the assessment between the teams of reviewers were resolved through consensus. Each measurement property was rated as ‘very good’, ‘adequate’, ‘doubtful’, or ‘inadequate’ based on standards specific to that measurement property. These standards include requirements for both the design and statistical tests used to assess the measurement property. The lowest rating from the several standards that comprised the measurement property was its overall rating [Citation14]. Each measurement property included an item which asked if there were any other flaws; no additional flaws were identified and so these were uniformly scored ‘very good’. In addition, for evaluating ‘test–retest reliability’, a time interval between tests of one to two weeks was determined by the team to receive a rating of ‘very good’.

Rating of measurement property results

Each of the measurement properties reported in a study were rated using the updated criteria for good measurement properties by two independent teams of reviewers (team one: VP and SK; team two: ED and MJ) [Citation7]. Disagreement was resolved through discussion. A measurement property was rated as ‘sufficient (+)’ if it met the criteria, ‘insufficient (–)’ if it failed to meet the criteria, or ‘indeterminate (?)’ if there was lack of adequate information to determine whether it was sufficient or insufficient [Citation18,Citation19].

Summarising the data

When more than one study reported on the same measurement property for an SRQ, the results from all studies were qualitatively summarised. Using the COSMIN criteria for good measurement properties (V2, 2016 [Citation19]), an overall rating of ‘sufficient’, ‘insufficient’, or ‘indeterminate’ was determined for each measurement property for each SRQ [Citation7,Citation18,Citation19]. The results of the development and content validity studies were also qualitatively summarised, with the overall rating for these areas also determined for each SRQ [Citation15].

Grading the quality of evidence

The overall quality of the evidence was graded as ‘high’, ‘moderate’, ‘low’, or ‘very low’ using the modified Grading of Recommendation, Assessment, Development, and Evaluation (GRADE) criteria [Citation7]. The quality of the evidence of each reported measurement property was initially graded as ‘high’ and the level was downgraded step-wise based on following factors: (1) the methodological quality of the studies (risk of bias); (2) imprecision due to small sample size; and (3) indirectness of the result (i.e., evidence from the studies conducted at different setting and different populations than the interest of the review). For ease of interpretation, the quality of the evidence was dichotomised to good and poor-quality evidence; good-quality evidence denotes ‘moderate’ to ‘high’ gradings and poor-quality evidence denotes ‘low’ to ‘very low’ gradings.

Results

Study selection

The electronic databases search identified 8573 articles and the scoping of the reference lists of the relevant studies and other reviews identified 42 articles. After removing the duplicates and screening the titles and abstracts, 96 articles were selected for full-text review. Out of these 96 full-text articles, 40 articles, reporting on 19 different SRQs (Supplementary material 3), were selected for final review (). Articles that were excluded at full-article review are listed in Supplementary material 4, with the reason why they were excluded.

Figure 1. PRISMA flowchart of study selection.

Figure 1. PRISMA flowchart of study selection.

Characteristics of SRQs

Included SRQs were developed for measuring different patient-reported outcomes (). FLQA-LS, ULLQoL, LyQLI, SLQOLI, LYMQOL-arm, LYMQOL-leg, FLQA-L, and ULL 27 SRQs were for the measurement of the QOL [Citation20–34]. LSIDS-LL, LISIDS-A, BCLE-SEI, and LSIDS-H&N SRQs were for the measurement of the lymphoedema symptoms [Citation35–39]. Lymph-ICF-LL and Lymph-ICF-UL SRQs were for the evaluation of the Functioning, Disability and Health based on The International Classification of Functioning, Disability and Health (ICF) [Citation40–49]. LLIS Version 1 and 2 SRQs were for the evaluation of impairments [Citation50–55]. IPQ-R BCRL, EBS, and PBI-L SRQs were for the evaluation of illness perception, self-efficacy, and patient-relevant treatment benefit, respectively [Citation56–59].

Table 1. Characteristics of the included SRQs.

Included SRQs were developed either for a mixed population or a specific patient population. The questionnaires developed for a mixed population, including primary and secondary lymphoedema, were the FLQA-LS, LyQLI, SLQOLI, LYMQOL-leg, FLQA-L, LSIDS-LL, Lymph-ICF-LL, LLIS-V2, LLIS-V1, and PBI-L [Citation20,Citation22–24,Citation26,Citation27,Citation29–32,Citation35,Citation46–55,Citation58,Citation59]. The questionnaires developed exclusively for secondary lymphoedema were the ULLQoL, LYMQOL-arm, ULL 27, LSIDS-A, BCLE-SEI, LSIDs-H&N, Lymph-ICF-UL, IPQ-R BCRL, and EBS [Citation21,Citation26–28,Citation34,Citation36–42,Citation45,Citation56,Citation57]. These questionnaires were developed for cancer-related lymphoedema of the upper limb, such as BCRL, except the LSIDs-H&N and Exercise barriers self-efficacy questionnaires, which were for head and neck lymphoedema and any cancer-related lymphoedema, respectively [Citation39,Citation57]. Although IPQ-R BCRL included only women who are at-risk of BCRL in the validation study, this SRQ was included in this review ().

Other characteristics of the SRQs, such as recall period, number of subscales, number of items, response options, and language availability varied among the reviewed SRQs. The recall period ranged from the present [Citation56,Citation57] to the past four weeks [Citation22–24,Citation33,Citation34]. However, the recall period for three SRQs either varied or was not explicitly presented for some items [Citation25–31,Citation58,Citation59]. Except for the Exercise Barrier Self-Efficacy SRQ [Citation57], which had no subscales, other SRQs had between two [Citation21,Citation37,Citation38] and eight subscales [Citation35]. The number of items in the SRQs ranged from 5 to 188 items [Citation24,Citation57]. Combinations of scales, such as the Likert, visual analogue scale, or dichotomous (yes or no), were used in 10 SRQs [Citation20–23,Citation26–28,Citation30,Citation31,Citation35–39,Citation55]. Likert type scales only were used in six SRQs [Citation32–34,Citation50–55,Citation58,Citation59], and visual analogue scales only were used in three SRQs [Citation40–49,Citation57]. With inclusion of the translated questionnaires, 10 SRQs were in English [Citation21,Citation26,Citation35,Citation36,Citation38,Citation39,Citation52,Citation55], and 17 SRQs were in other languages including: German [Citation20,Citation32,Citation59]; Dutch [Citation29,Citation33,Citation40,Citation42,Citation45,Citation49]; Turkish [Citation25,Citation28,Citation30,Citation31,Citation44,Citation47,Citation51,Citation53,Citation58]; Chinese [Citation37,Citation46,Citation56]; Brazilian Portuguese [Citation48]; French [Citation34,Citation41]; Danish [Citation43]; Arabic [Citation50]; Persian [Citation54]; and Swedish [Citation27]. Other characteristics and available translations are presented in .

Characteristics of participants

Overall, 5833 participants were included in the 40 studies that evaluated the measurement properties of the 19 SRQs () [Citation20–59]. Participants were predominantly women (89.9%) with a mean age ranging from 48 to 64 years. Sample size varied among the included studies from 10 [Citation48] to 446 [Citation54]. Eighteen studies were conducted exclusively on women [Citation25,Citation28,Citation33,Citation34,Citation36–38,Citation40–46,Citation48,Citation51,Citation56,Citation57]. The severity and duration of lymphoedema were not reported in all studies, and those reported did not report them uniformly (). Other details regarding the instrument administration are presented in .

Table 2. Characteristics of the included study populations.

Summary of findings

Seven of the nine measurement properties identified in the COSMIN taxonomy were rated and graded for the SRQs included in this review (Supplementary material 3 and ). Although eight SRQs were translated into different languages, none of them could be rated for cross-cultural adaptation because they did not assess measurement invariance [Citation24,Citation25,Citation27–31,Citation33,Citation37,Citation41,Citation43,Citation44,Citation46–48,Citation50,Citation51,Citation53,Citation54,Citation58]. Similarly, despite two of the included SRQs reporting on criterion validity, this finding was not rated due to lack of a gold standard SRQ as a criterion with which they could be compared [Citation26,Citation37].

Table 3. Summary of findings of the measurement properties of the included SRQs.

From the 19 SRQs reviewed, the ULLQoL, LYMQOL-arm, LYMQOL-leg, LSIDs-H&N, Lymph-ICF-UL, Lymph-ICF-LL, LLIS Version 1 SRQs were rated as sufficient for content validity with good-quality evidence () [Citation21,Citation27,Citation28,Citation31,Citation36,Citation39,Citation45,Citation49,Citation52,Citation55,Citation59]. The LYMQOL-leg SRQ was also rated sufficient for structural validity, internal consistency, and hypothesis testing with good-quality evidence [Citation26,Citation27,Citation29–31]. Only content validity was reported for the LSIDs-H&N SRQ. The LYMQOL-arm, Lymph-ICF-LL, and LLIS Version 1 were rated insufficient for structural validity based on good-quality evidence. The structural validity of the ULLQoL was not graded due to indeterminate rating, and the structural validity for the Lymph-ICF-UL was not reported. Although high Cronbach’s alphas for internal consistency were reported for 13 SRQs, internal consistencies were rated indeterminate due to lack of evidence for their structural validity [Citation21,Citation25–28,Citation40–49,Citation53–55]. The hypothesis testing for the ULLQoL and Lymph-ICF-UL were rated sufficient based on good-quality evidence. The responsiveness for the ULLQoL and the reliability for the Lymph-ICF-UL were rated sufficient based on good-quality evidence. Quality grading of only IPQ-R SRQ [Citation56] was downgraded since the developers included only at-risk women in the one available study rather than women with BCRL. Overall, only 3% of the reported measurement properties were rated as insufficient based on good-quality evidence ().

Table 4. Decision matrix for the interpretation of the results.

Discussion

Nineteen lymphoedema-specific SRQs were identified, and their measurement properties critically appraised and summarised. According to the COSMIN framework, none of the SRQs was rated as sufficient based on good-quality evidence for all nine measurement properties. However, LYMQOL-leg and Lymph-ICF-UL SRQs were rated as sufficient with respect to content validity, reliability, and construct validity based on good-quality evidence [Citation26,Citation27,Citation29–31,Citation40–45]. Each of the 17 other SRQs had one or more measurement properties that were rated as sufficient and supported by good quality evidence. Overall, only 3% of the reported measurement properties were rated as insufficient based on good-quality evidence and therefore not acceptable. However, in all SRQs, the criteria for at least one or more measurement properties were found to be indeterminate (33%) or not reported on at all (33%). Reasons why criteria were indeterminate were related to lacking details in the reporting of the methods, whereas for others, it may reflect lack of knowledge in the developers on the steps involved in establishing the acceptability of measurement properties.

The rating of ‘sufficient’ for a measurement property is given when studies have met the criteria outlined by COSMIN’s updated criteria for good measurement properties [Citation7]. However, the extent to which risk of bias is minimised in these studies determines the quality of evidence. Good-quality evidence implies that all steps taken to study that measurement property have used appropriate methods, sample size, and participants [Citation14]. It is the combination of the rating of meeting the required criteria as well as the quality of evidence supporting it that determines whether a measurement property of an SRQ is or is not fully addressed (). A measurement property is not addressed if it is rated as ‘insufficient’ and based on good-quality evidence. It is questionable whether further research would address these significant issues. For all other combinations, further evidence through either improved documentation or further research is required to indicate that the measurement property is fully addressed. In the current review, and as highlighted in , the majority of the measurement properties of most SRQs required further evidence.

Inadequate reporting contributed to a measurement property receiving an indeterminate rating, a common occurrence identified both by the developers of COSMIN framework [Citation14,Citation15] as well in others who have reviewed SRQs using COSMIN methodology [Citation60,Citation61]. This inadequacy may be related to lack of guidelines for reporting of studies that develop or evaluate the measurement properties of SRQs [Citation62,Citation63]. The implementation of guidelines has improved reporting of other types of studies such as the CONSORT guidelines used to report randomised controlled trials [Citation64,Citation65]. The lack of guidelines for reporting on the development of SRQs is currently being addressed, with a guideline currently under development [Citation66]. Meanwhile, evaluation frameworks, such as COSMIN, can be used by developers of an SRQs to report against criteria for each measurement property they have addressed [Citation7,Citation13,Citation14,Citation62,Citation67].

The first step in determining the usability of an SRQ is in establishing its content validity as this step determines that it is relevant, complete and understandable for the intended population [Citation7,Citation68]. Without addressing this measurement property, investigation of other measurement properties is not warranted [Citation15,Citation68]. In this review, only seven SRQs had acceptable content validity: LYMQOL-leg, ULLQoL, LYMQOL-arm, LSIDs-H&N, Lymph-ICF-UL, Lymph-ICF-LL, and LLIS Version 1 SRQs [Citation21,Citation25–31,Citation39–55]. The other SRQs did not meet the criteria for content validity as they did not involve the target population in content development or validation [Citation35,Citation56] or did not describe their methods in enough detail to meet the criteria of COSMIN [Citation20,Citation22–24,Citation26,Citation32–34,Citation36–38,Citation57–59]. One reason why some SRQs may not have addressed the criteria was that the COSMIN criteria were only published in 2018; the majority (65%) of the studies in this review were conducted prior to 2018. Since none of the SRQ was rated insufficient for content validity, further evaluation is warranted of this measurement property in consultation with the content validity criteria may improve the content validity of the existing SRQs.

Another step that was not adhered to in developing the SRQs was related to the assessment of internal consistency. Internal consistency is, in part, conditional on first establishing the acceptability of structural validity [Citation7,Citation69]. Structural validity ensures the score of the SRQ is an adequate reflection of the construct (outcome) and dimensions of the construct (subscales), whereas internal consistency ensures that interrelatedness of the items in the scale or subscales [Citation7]. Of the seven SRQs in which content validity was addressed, only the LYMQOL-leg was addressed for internal consistency [Citation26,Citation27,Citation29–31]. This was the only SRQ that had undergone confirmatory factor analysis to confirm the subdomains within the SRQ. The other SRQs did not undertake this analysis, although they did report high Cronbach’s alphas, i.e., internal consistency, for their SRQs [Citation21,Citation26–28,Citation31,Citation39–53,Citation55]. Establishment of the subdomains using confirmatory factor analysis for these other SRQs would establish both their structural validity and internal consistency.

For SRQs to be useful in clinical practice, it is important to ensure that they are reliable, with low measurement error, and are responsive to changes in the condition for which they were designed. Although test–retest reliability and measurement error are interrelated measurement properties, measurement error was less reported than test–retest reliability. Of the seven SRQs that had the acceptable content validity, only LYMQOL-leg and Lymph-ICF-UL had acceptable reliability whereas both of them did not have acceptable measurement error [Citation26,Citation27,Citation29–31,Citation40–45]. Small sample sizes of the retest were a common problem contributing to poor quality evidence for both test–retest reliability and the measurement error in the included studies [Citation45,Citation49]. Studies with adequate retest sample size are, therefore, required to strengthen the evidence of test–retest reliability and measurement error existing SRQs. In addition, none of the reviewed SRQs had acceptable responsiveness. Lack of acceptable responsiveness of the SRQs was due to the low methodological quality of the studies and low sample size [Citation27,Citation29,Citation31,Citation36,Citation46,Citation49]. Since longitudinal studies are required for the evaluation of the responsiveness, longitudinal observational or intervention trials in which these SRQs are used may be an efficient approach to addressing this measurement property.

Cross-cultural adaptation of existing SRQs assists in comparing outcomes across different cultures [Citation70,Citation71]. This cross-cultural adaptation process consists of two steps: (1) translation of an existing SRQ followed by ascertainment of the translated version’s equivalence with the original SRQ and (2) validation of the translated questionnaire [Citation71]. All 11 cross-culturally adapted SRQs followed acceptable translation and equivalence processes and reported various measurement properties that were evaluated during the validation process [Citation24,Citation25,Citation29,Citation30,Citation33,Citation37,Citation43,Citation44,Citation46–48,Citation54]. However, none of these cross-culturally adapted SRQs analysed the measurement invariance, an essential step in validation for cross-cultural validity [Citation14,Citation19]. In brief, measurement invariance compares the responses to items from two cohorts that are similar in respect to the population characteristics other than the language [Citation7,Citation14,Citation19,Citation72]. Omission of this step may be due to a lack of awareness of this critical step. Evaluation of the measurement invariance of the translated SRQs is, therefore, required.

Overall, it was disappointing that so few SRQs could be supported in this review. One issue why this occurred is related to how the score for the measurement property is determined within the COSMIN framework. The score is based solely on the lowest score across all of the criteria underpinning that property, which range from three criteria for criterion validity to eight criteria for reliability. This stringent approach led to harsh ratings for methodological quality that may simply have reflected lack of sufficient documentation around a study design, an issue also identified by others [Citation62,Citation63]. Based on the scoring for COSMIN risk of bias tool, several reviews have concluded that most of the included studies in their reviews were of poor methodological quality, with inconclusive evidence to support any of the reviewed SRQs [Citation60–62,Citation73]. In our current review, this issue was overcome, in part for LYMQOL-leg, and Lymph-ICF-UL and LLIS SRQs, when multiple studies evaluated the measurement properties of the same SRQs [Citation26,Citation27,Citation29–31,Citation40–45]. Additional studies can be undertaken by others to address gaps in the evidence for specific measurement properties. Furthermore, the reporting guidelines currently in development [Citation66] should improve documentation of the steps taken to investigate the measurement property.

In this review, every effort was made to include all the studies that evaluated the measurement properties of lymphoedema specific SRQs. One limitation of our study is that we only included studies that were reported in English. As a consequence, one SRQ in which the evaluation was published in Japanese was excluded [Citation74]. On the other hand, one of the strengths of this review is its comprehensiveness and adherence to the Prisma protocol (Supplementary material 5) [Citation75]. We included all lymphoedema-specific SRQs, unlike previous systematic reviews which restricted their inclusion to either breast or gynaecological cancer related lymphoedema-specific SRQs only [Citation10–12]. Another strength of this review is that we used the COSMIN methodology to conduct the systematic review, including the new COSMIN risk of bias tool, the updated criteria for measurement properties and the modified GRADE approach to grade the overall quality of evidence [Citation7,Citation8,Citation14,Citation15,Citation19]. Use of a modified GRADE approach facilitated the process of recommendation based on the overall quality of evidence synthesised from different studies similar to the systematic reviews of effectiveness trials.

Conclusion

In summary, this systematic review revealed that out of 19 SRQs included in this review, the LYMQOL-leg SRQ could be recommended with confidence for evaluation of QOL of people with lower limb lymphoedema. Furthermore, the Lymph-ICF-UL can be recommended for evaluation of the QOL of the BCRL with some confidence. In view of the high level of the indeterminate ratings and lack of reporting some measurement properties of all existing SRQs, further validation studies are desirable.

Supplemental material

Supplemental Material

Download Zip (120 KB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • Rockson SG, Keeley V, Kilbreath S, et al. Cancer-associated secondary lymphoedema. Nat Rev Dis Primers. 2019;5(1):22.
  • Fu MR, Ridner SH, Hu SH, et al. Psychosocial impact of lymphedema: a systematic review of literature from 2004 to 2011. Psychooncology. 2013;22(7):1466–1484.
  • McLaughlin SA, Staley AC, Vicini F, et al. Considerations for clinicians in the diagnosis, prevention, and treatment of breast cancer-related lymphedema: recommendations from a Multidisciplinary Expert ASBrS Panel: Part 1: definitions, assessments, education, and future directions. Ann Surg Oncol. 2017;24(10):2818–2826.
  • Levenhagen K, Davies C, Perdomo M, et al. Diagnosis of upper quadrant lymphedema secondary to cancer: clinical practice guideline from the Oncology Section of the American Physical Therapy Association. Phys Ther. 2017;97(7):729–745.
  • Tamura MK. Introduction to patient-reported outcomes perspectives series. Clin J Am Soc Nephrol. 2017;12(11):1881–1881.
  • Wu AW, Kharrazi H, Boulware LE, et al. Measure once, cut twice-adding patient-reported outcome measures to the electronic health record for comparative effectiveness research. J Clin Epidemiol. 2013;66(8 Suppl.):S12–S20.
  • Prinsen CAC, Mokkink LB, Bouter LM, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1147–1157.
  • Mokkink LB, Terwee CB, Patrick DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–745.
  • Coriddi M, Dayan J, Sobti N, et al. Systematic review of patient-reported outcomes following surgical treatment of lymphedema. Cancers (Basel). 2020;12(3):565.
  • Cornelissen AJM, Kool M, Keuter XHA, et al. Quality of life questionnaires in breast cancer-related lymphedema patients: review of the literature. Lymphat Res Biol. 2018;16(2):134–139.
  • Pusic AL, Cemal Y, Albornoz C, et al. Quality of life among breast cancer patients with lymphedema: a systematic review of patient-reported outcome instruments and outcomes. J Cancer Surviv. 2013;7(1):83–92.
  • Cemal Y, Jewell S, Albornoz CR, et al. Systematic review of quality of life and patient reported outcomes in patients with oncologic related lower extremity lymphedema. Lymphat Res Biol. 2013;11(1):14–19.
  • Mokkink LB, Prinsen CA, Patrick DL, et al. COSMIN methodology for systematic reviews of patient-reported outcome measures (PROMs); user manual; 2018. Available from: https://www.cosmin.nl/tools/checklists-assessing-methodological-study-qualities/checklists-assessing-methodological-study-qualities/
  • Mokkink LB, de Vet HCW, Prinsen CAC, et al. COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1171–1179.
  • Terwee CB, Prinsen CAC, Chiarotto A, et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018;27(5):1159–1170.
  • Kleinlugtenbelt YV, Nienhuis RW, Bhandari M, et al. Are validated outcome measures used in distal radial fractures truly valid? A critical assessment using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Bone Joint Res. 2016;5(4):153–161.
  • Terwee CB, Mokkink LB, Knol DL, et al. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21(4):651–657.
  • Terwee CB, Bot SDM, de Boer MR, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.
  • Prinsen CA, Vohra S, Rose MR, et al. How to select outcome measurement instruments for outcomes included in a "Core Outcome Set" – a practical guideline. Trials. 2016;17(1):449.
  • Augustin M, Conde Montero E, Hagenström K, et al. Validation of a short-form of the Freiburg Life Quality Assessment for lymphoedema (FLQA-LS) instrument. Br J Dermatol. 2018;179(6):1329–1333.
  • Williams AE, Rapport F, Russell IT, et al. Psychometric development of the Upper Limb Lymphedema Quality of Life Questionnaire demonstrated the patient-reported outcome measure to be a robust measure for breast cancer-related lymphedema. J Clin Epidemiol. 2018;100:61–70.
  • Klernäs P, Johnsson A, Boyages J, et al. Test of responsiveness and sensitivity of the questionnaire "Lymphedema Quality of Life Inventory". Lymphat Res Biol. 2018;16(3):300–308.
  • Klernas P, Johnsson A, Horstmann V, et al. Lymphedema Quality of Life Inventory (LyQLI)—development and investigation of validity and reliability. Qual Life Res. 2015;24(2):427–439.
  • Klernas P, Kristjanson LJ, Johansson K. Assessment of quality of life in lymphedema patients: validity and reliability of the Swedish version of the Lymphedema Quality of Life Inventory (LQOLI). Lymphology. 2010;43(3):135–145.
  • Borman P, Yaman A, Denizli M, et al. The reliability and validity of Lymphedema Quality of Life Questionnaire-Arm in Turkish patients with upper limb lymphedema related with breast cancer. Turk J Phys Med Rehab. 2018;64(3):205–212.
  • Keeley V, Crooks S, Locke J, et al. A quality of life measure for limb lymphoedema (LYMQOL). J Lymphoedema. 2010;5(1):26–37.
  • Wedin M, Fredrikson M, Ahlner E, et al. Validation of the Lymphoedema Quality of Life Questionnaire (LYMQOL) in Swedish cancer patients. Acta Oncol. 2020;59(3):365–371.
  • Karayurt O, Deveci Z, Eyigor S, et al. Adaptation of quality of life measure for limb lymphedema-arm in Turkish women with breast cancer-related lymphedema. Cancer Nurs. 2019;28:2828.
  • van de Pas CB, Biemans AA, Boonen RS, et al. Validation of the Lymphoedema Quality-of-Life Questionnaire (LYMQOL) in Dutch patients diagnosed with lymphoedema of the lower limbs. Phlebology. 2016;31(4):257–263.
  • Bakar Y, Tuğral A. Translation, reliability, and validation of the Turkish version of the lymphedema quality-of-life tool in Turkish-speaking patients with lower limb lymphedema. J Vasc Nurs. 2019.
  • Borman P, Yaman A, Denizli M, et al. The reliability and validity of lymphedema quality of life questionnaire-leg in Turkish patients with lower limb lymphedema. Lymphat Res Biol. 2020;18(1):42–48.
  • Augustin M, Bross F, Foldi E, et al. Development, validation and clinical use of the FLQA-I, a disease-specific quality of life questionnaire for patients with lymphedema. Vasa. 2005;34(1):31–35.
  • Viehoff PB, van Genderen FR, Wittink H. Upper limb lymphedema 27 (ULL27): Dutch translation and validation of an illness-specific health-related quality of life questionnaire for patients with upper limb lymphedema. Lymphology. 2008;41(3):131–138.
  • Launois R, Megnigbeto AC, Pocquet K, et al. A specific quality of life scale in upper limb lymphedema: the ULL-27 questionnaire. Lymphology. 2002;35(1-760):181–187.
  • Ridner SH, Doersam JK, Stolldorf DP, et al. Development and validation of the lymphedema symptom intensity and distress survey-lower limb. Lymphat Res Biol. 2018;16(6):538–546.
  • Ridner SH, Dietrich MS. Development and validation of the lymphedema symptom and intensity survey-arm. Support Care Cancer. 2015;23(10):3103–3112.
  • Shi S, Lu Q, Fu MR, et al. Psychometric properties of the breast cancer and lymphedema symptom experience index: the Chinese version. Eur J Oncol Nurs. 2016;20:10–16.
  • Fu MR, Axelrod D, Guth AA, et al. mHealth self-care interventions: managing symptoms following breast cancer treatment. mHealth. 2016;2:28.
  • Deng J, Ridner SH, Murphy BA, et al. Preliminary development of a lymphedema symptom assessment scale for patients with head and neck cancer. Support Care Cancer. 2012;20(8):1911–1918.
  • De Vrieze T, Gebruers N, Nevelsteen I, et al. Responsiveness of the lymphedema functioning, disability, and health questionnaire for upper limb lymphedema in patients with breast cancer-related lymphedema. Lymphat Res Biol. 2020;18(4):365–373.
  • De Vrieze T, Frippiat J, Deltombe T, et al. Cross-cultural validation of the French version of the lymphedema functioning, disability and health questionnaire for upper limb lymphedema (Lymph-ICF-UL). Disabil Rehabil. 2020:1–8. DOI:10.1080/09638288.2020.1716271
  • De Vrieze T, Vos L, Gebruers N, et al. Revision of the lymphedema functioning, disability and health questionnaire for upper limb lymphedema (Lymph-ICF-UL): reliability and validity. Lymphat Res Biol. 2019;17(3):347–355.
  • Grarup KR, Devoogdt N, Strand LI. The Danish version of lymphoedema functioning, disability and health questionnaire (Lymph-ICF) for breast cancer survivors: translation and cultural adaptation followed by validity and reliability testing. Physiother Theory Pract. 2019;35(4):327–340.
  • Kostanoglu A, Hosbay Z, Tarakci E. Lymphoedema functioning, disability and health questionnaire Turkish version: translation, cross-cultural adaptation and validation. J Phys Ther Sci. 2016;28(6):1728–1732.
  • Devoogdt N, Van Kampen M, Geraerts I, et al. Lymphoedema functioning, disability and health questionnaire (Lymph-ICF): reliability and validity. Phys Ther. 2011;91(6):944–957.
  • Wang C-M, Lee S-Y, Hsu K-F, et al. Psychometric evaluation of a Chinese version of lymphoedema functioning, disability and health questionnaire for lower limb lymphoedema in women with gynaecological cancer surgery. Eur J Cancer Care (Engl). 2018;27(6):1–11.
  • Kostanoğlu A, Mbata GB, Gökmen GY, et al. The lymphedema functioning, disability, and health questionnaire for lower limb lymphedema: translation, reliability, and validation study of the Turkish version. Turk Gogus Kalp Dama. 2017;25(4):586–591.
  • Ferreira KR, Carvalho RBM, de Andrade MFC, et al. Translation and cross-cultural adaptation of the lymphoedema functioning, disability and health questionnaire for lower limb lymphoedema into Portuguese language. Rev Bras Ginecol Obstet. 2016;38(2):88–96.
  • Devoogdt N, De Groef A, Hendrickx A, et al. Lymphoedema functioning, disability and health questionnaire for lower limb lymphoedema (Lymph-ICF-LL): reliability and validity. Phys Ther. 2014;94(5):705–721.
  • Abu Sharour L. Psychometric evaluation of the Arabic version of the lymphedema life impact scale in breast cancer patients. Breast J. 2020;26(3):563–565.
  • Orhan C, Uzelpasaci E, Baran E, et al. The reliability and validity of the Turkish version of the lymphedema life impact scale in patients with breast cancer-related lymphedema. Cancer Nurs. 2020;43(5):375–383.
  • Weiss J, Daniel T. Validation of the lymphedema life impact scale version 2: a condition-specific measurement tool for persons with lymphedema. Rehabil Oncol. 2018;36(1):28–36.
  • Degirmenci B, Tuzun S, Of NS, et al. Reliability and validity of Turkish version of lymphedema life impact scale. Turk J Phys Med Rehab. 2019;65(2):147–153.
  • Haghighat S, Montazeri A, Zayeri F, et al. Psychometric evaluation of the Persian version of the lymphedema life impact scale (LLIS, version 1) in breast cancer patients. Health Qual Life Outcomes. 2018;16(1).
  • Weiss J, Daniel T. Validation of the lymphedema life impact scale (LLIS): a condition-specific measurement tool for persons with lymphedema. Lymphology. 2015;48(3):128–138.
  • Huang W, Zhang L, Yan J. Psychometric evaluation of the Chinese version of the revised illness perception questionnaire for breast cancer-related lymphedema. Eur J Cancer Care (Engl). 2019;28(1):e12900.
  • Buchan J, Janda M, Box R, et al. Exercise barriers self-efficacy: development and validation of a subcale for individuals with cancer-related lymphedema. Health Qual Life Outcomes. 2015;13(1):37.
  • Duygu E, Bakar Y, Keser I. An important tool in lymphedema management: validation of Turkish version of the patient benefit index-lymphedema. Lymphat Res Biol. 2020;18(1):49–55.
  • Blome C, Augustin M, Heyer K, et al. Evaluation of patient-relevant outcomes of lymphedema and lipedema treatment: development and validation of a new benefit tool. Eur J Vasc Endovasc Surg. 2014;47(1):100–107.
  • Rotenberg S, Ruthralingam M, Hnatiw B, et al. Measurement properties of the multiple errands test: a systematic review. Arch Phys Med Rehabil. 2020;101(9):1628–1642.
  • van Zijl F, Mokkink LB, Haagsma JA, et al. Evaluation of measurement properties of patient-reported outcome measures after rhinoplasty: a systematic review. JAMA Facial Plast Surg. 2019;21(2):152–162.
  • Speyer R, Kim JH, Doma K, et al. Measurement properties of self-report questionnaires on health-related quality of life and functional health status in dysphonia: a systematic review using the COSMIN taxonomy. Qual Life Res. 2019;28(2):283–296.
  • Crudgington H, Rogers M, Morris H, et al. Epilepsy-specific patient-reported outcome measures of children's health-related quality of life: a systematic review of measurement properties. Epilepsia. 2020;61(2):230–248.
  • Moher D, Hopewell S, Schulz KF, et al. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. Int J Surg. 2012;10(1):28–55.
  • Schulz KF, Altman DG, Moher D. CONSORT 2010 statement: updated guidelines for reporting parallel group randomised trials. BMJ. 2010;340:c332.
  • COSMIN. Research & publications COSMIN; 2020; [cited 2020 Apr 6]. Available from: https://www.cosmin.nl/finding-right-tool/conducting-study-measurement-properties/
  • Terwee CB, Prinsen CA, Chiarotto A, et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018;27(5):1159–1170.
  • U.S. Department of Health and Human Services Food and Drug Administration (FDA) CfDEaRC, Center for Biologics Evaluation and Research (CBER), Center for Devices and Radiological Health (CDRH). Guidance for industry patient-reported outcome measures: use in medical product development to support labeling claims. Rockville (MD); 2009; [cited 2019 Dec 23]. Available from: https://www.fda.gov/media/77832/download
  • Mokkink LB, Prinsen CAC, Bouter LM, et al. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) and how to select an outcome measurement instrument. Braz J Phys Ther. 2016;20(2):105–113.
  • International Test Commission. The ITC guidelines for translating and adapting tests (2nd ed.). IJT. 2018;18(2):101–134.
  • Borsa JC, Damásio BF, Bandeira DR. Cross-cultural adaptation and validation of psychological instruments: some considerations. Paidéia (Ribeirão Preto). 2012;22(53):423–432.
  • Boateng GO, Neilands TB, Frongillo EA, et al. Best practices for developing and validating scales for health, social, and behavioral research: a primer. Front Public Health. 2018;6:149.
  • Sharawi N, Klima L, Shah R, et al. Evaluation of patient-reported outcome measures of functional recovery following caesarean section: a systematic review using the consensus-based standards for the selection of health measurement instruments (COSMIN) checklist. Anaesthesia. 2019;74(11):1439–1455.
  • Tsuchiya M. Development of the Arm Lymphoedema Physical Discomfort Scale: validity and reliability testing with Japanese breast cancer survivors. Japan J Health Psychol. 2012;25(1):74–82.
  • Moher D, Liberati A, Tetzlaff J, et al. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 2009;6(7):e1000097.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.