192
Views
0
CrossRef citations to date
0
Altmetric
ORIGINAL RESEARCH

Measurement Properties and Optimal Cutoff Point of the WHO-5 Among Chinese Healthcare Students

, , , , ORCID Icon, , , , , ORCID Icon, & ORCID Icon show all
Pages 5141-5158 | Received 27 Aug 2023, Accepted 26 Nov 2023, Published online: 21 Dec 2023

Abstract

Purpose

The World Health Organization-Five Well-Being Index (WHO-5) is widely used to assess subjective well-being. Nevertheless, measurement invariance and optimal cutoff point of the WHO-5 have not been examined in Chinese samples. We aimed to assess measurement properties of the Chinese version of the WHO-5 (WHO-5-C) among healthcare students.

Patients and Methods

A two-wave longitudinal assessment was conducted among 343 Chinese healthcare students from September to November 2022. Measurement properties of the WHO-5-C were assessed through structural validity using confirmatory factor analysis (CFA), measurement invariance using multigroup CFA (MGCFA) and longitudinal CFA (LCFA), convergent validity using correlation analysis with the Self-Rated Health Questionnaire (SRHQ) and Patient Health Questionnaire-4 (PHQ-4), reliability using internal consistency and test–retest reliability, and optimal cutoff point using receiver operating characteristic (ROC) analysis.

Results

The WHO-5-C demonstrated satisfactory structural validity with comparative fit index (CFI) of 0.968 at baseline and 0.980 at follow-up, and adequate measurement invariance in different sociodemographic variables at baseline (gender, age, major, home location, being only child, monthly household income, part-time job, physical exercise, hobby, frequency of visiting home, and stress coping strategy) (CFI changes [ΔCFI] = −0.009–0.003) and over a week (ΔCFI = −0.006–0.000). The WHO-5-C also had good internal consistency (Cronbach’s α = 0.907–0.934; McDonald’s ω = 0.908–0.935) and test–retest reliability (intraclass correlation coefficient [ICC] = 0.803). Convergent validity was supported by moderate correlations of the WHO-5-C with the SRHQ and PHQ-4. The optimal cutoff point of the WHO-5-C was found to be 50, with an area under the ROC curve of 0.882 at baseline data, with sensitivity of 0.803 and specificity of 0.762 at follow-up.

Conclusion

The WHO-5-C demonstrated adequate measurement properties, especially concerning cross-sectional and longitudinal measurement invariance, with a recommended optimal cutoff point of ≥ 50 for assessing adequate level of psychological well-being in healthcare students.

Introduction

Subjective well-being (SWB) refers to an individual’s ability to develop their potential, to work productively and creatively, to build strong and positive relationships with others, and to contribute to their community.Citation1 SWB involves multiple dimensions and its contribution to all aspects of human life, eg, optimistic attitude, positive affect, psychological resilience, and happiness.Citation2 Positive SWB is recognized as having crucial consequences for better physical outcomes in both healthy populations and patients suffering from various diseases, such as fewer self-perceived symptoms, fewer pain sensations, and longer survival.Citation3–5 As such, positive SWB has been regarded as a conducive health asset given its explicit associations with salutary outcomes of mental health,Citation6 behaviors,Citation7 and disease progression and rehabilitation.Citation8 Therefore, identifying SWB carries clinical significance for public and psychosomatic health.

There has been increasing evidence showing that college students are exposed to excessive academic pressures that may affect their academic performance, lifestyle, physical health, and even psychosomatic disorders.Citation9–11 Healthcare students are expected to provide professional support in health services for individuals and societies in the futureCitation12 and are at risk of physical and mental disorders.Citation13,Citation14 Worldwide, approximately one-third of healthcare students suffer from negative emotions, which is notably higher than general population and non-healthcare students.Citation15,Citation16 A meta-analysis of mental health problems among healthcare students in China showed that the prevalence of depression, anxiety, and suicidal ideation were 29%, 21%, and 11%, respectively, underlining the severity of the issue and urgency to understand their well-being more.Citation17 Therefore, instruments with a positive focus can suggest to respondents that such programs are for individuals with pleased SWB and in this way support, rather than detract from, these initiatives.

There are numerous approaches in explaining and assessing well-being, and several instruments have been developed to monitor well-being.Citation18 To date, the popular instruments for assessing SWB state are mostly symptom-based, eg, stress, anxiety, depression and bipolar disorders.Citation19–21 Existing instruments developed to evaluate well-being are mostly based on a single perspective.Citation18 For instance, one of the most widely used for evaluating SWB is the Warwick-Edinburgh Mental Well-Being Scale (WEMWBS), a 14-item instrument developed to assess hedonic and eudaimonic elements of mental health.Citation22 Despite previous researches have endorsed the WEMWBS presented appropriate capacity for mental well-being in Chinese samples, it does not consider purpose in physical condition which is an essential component of overall well-being.Citation23,Citation24 Likewise, other instruments introduced into China to determine SWB include the Personal Wellbeing Index (PWI),Citation25 Psychological Well-Being Scale (PWBS),Citation26 etc., and most only assess satisfaction level in terms of health-related quality of life. It is necessary to unearth an instrument that can capture a comprehensive conception of SWB, including affective-emotional aspects and physiological feelings.

The positively worded World Health Organization-Five Well-Being Index (WHO-5) is one of the measures from a positive perspective, aiming to measure overall wellness and cover healthy elements psychologically and physically.Citation27 As a multidimensional screening instrument, the WHO-5 initially comprised twenty-eight items and progressively shortened to five positive items to produce a brief and comprehensive scale of well-being.Citation27,Citation28 To date, there are more than thirty official language versions of the WHO-5 that have been endorsed by the World Health Organization (WHO).Citation27 The scale has been adapted and validated across numerous cultures, including Arabic,Citation29 Brazilian,Citation30 Chinese,Citation31 German,Citation32 Icelandic,Citation33 Japanese,Citation34 Malay,Citation35 Polish,Citation36 Portuguese,Citation30 Sinhala,Citation37 Spanish,Citation38 Swahili,Citation39 Swedish,Citation40 and Turkish.Citation41 The original English version of the WHO-5,Citation42 as well as the JapaneseCitation34 and PolishCitation36 versions, had a single-factor structure with excellent reliability and validity in adolescents with type 1 diabetes and a cutoff point below 50 was established to identify worsened well-being. Moreover, the WHO-5 has revealed correlations with adverse health conditions among different samples of diverse cultures, including Spanish, Swedish, and Turkish.Citation38,Citation40,Citation41 The WHO-5, overall, has sufficient validity to measure SWB in patients and general populations across study fields, and the scale has been extensively applied in endocrinology, psychiatry, and clinical as well as positive psychology.Citation28,Citation37 Therefore, the WHO-5 demonstrated adequate effectiveness as both an outcome measure in clinical trials and a screening instrument in community settings.

The WHO-5 has been adapted and validated in Chinese university studentsCitation43 and medical educators.Citation44 Measurement properties of the Chinese version of the WHO-5 (WHO-5-C) were assessed in a recent multi-site study among a sample of university students and reported a stable one-factor structure and adequate internal consistency, factorial validity, construct validity and concurrent validity.Citation43 According to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guideline,Citation45 it would be integral to further ascertain test–retest reliability, measurement invariance, measurement error, and recommended cutoff point of the WHO-5-C, intending to select the most appropriate outcome measurement instruments in research and clinical practice. This study aimed to achieve the goal of validating the WHO-5-C as a screening and monitoring instrument of SWB using data collected from healthcare students. This two-wave longitudinal study specifically assessed structural validity, measurement invariance, convergent validity and reliability; identified the optimal cutoff point by conducting receiver operating characteristic (ROC) analysis; and determined cross-sectional and longitudinal measurement invariance with respect to sociodemographic variables and approximately one-week interval, respectively. We hypothesized that the WHO-5 can serve as both an assessment and a screening instrument for routine monitoring of SWB in Chinese samples.

Materials and Methods

Participants and Procedures

This study was a two-wave longitudinal assessment and a convenience sample from a university in Hangzhou, China, participated in a paper-based survey in simplified Chinese from September to November 2022 upon informed consent. We recruited healthcare students who were enabled to read and write simplified Chinese and had the ability to conduct the response process freely. Respondents were excluded if they: 1) had difficulty understanding or writing Chinese; 2) were on leave or out of school; or 3) disagreed to participate in the study. The self-report paper-and-pencil survey was administered during breaks between classes or evening self-study and required about 10 minutes to complete the questionnaires. The trained investigators were responsible for implementing assessment and ensuring its onsite quality control. All participants were required to complete two-wave longitudinal measurements with an average interval of 7 days + 3 hours, given that 1) the appropriate interval varies from an hour to a year depending on the task, but generally speaking, a retest interval of 2 to 14 days is usual;Citation46 and 2) reproducibility of health status measures intended for longitudinal use may best be measured at intervals of 1 to 2 weeks.Citation47 Respondents were compensated with 2 CNY (1 CNY ≈ 0.150 US dollars) each. The valid questionnaires in this study (N = 343) reached recommended sample size for factor analyses: 1) the sample size of 300 is considered as good;Citation48 2) the appropriate minimum for sample size include from 3 to 20 times the number of variables, and sample size above 200 is suggested when the variables-to-factors is six.Citation49

The current study was conducted in accordance with the Declaration of HelsinkiCitation50 and approved by the Institutional Review Board of the School of Public Health, Hangzhou Normal University, China (Reference No. 20210014). All healthcare students freely consented to respond to the questionnaires and provided their informed consents before inclusion into the assessments. The authors confirmed full respect and protection of individual privacy rights before, during, and after the data collection and processing.

Measures

Sociodemographic Variables

We collected sociodemographic variables, including gender (male, female), age, major (clinical medicine, preventive medicine), home location (urban, rural, suburban), being only child (yes, no), monthly household income (< 10,000 CNY, ≥ 10,000 CNY), part-time job (yes, no), physical exercise [exercise goals to improve health (yes, no)], hobby (yes, no), frequency of visiting home [frequently (once per week, twice per week, and once per month); occasionally (once per quarter, once per semester, and once per academic year)], and self-reported preferred stress coping strategy (emotion-focused, solution-focused, avoidance coping).

World Health Organization-Five Well-Being Index (WHO-5)

The WHO-5 developed by the WHO is a brief self-report instrument used to assess SWB. The WHO-5 was translated into simplified Chinese in 2007 and is available on the WHO-5 official website.Citation27,Citation51 Respondents were asked how well each of the five statements applies to them when considering the last two weeks. The five items of the scale cover positive emotions (in good spirits, feeling relaxed), vitality (feeling active, waking up refreshed and rested), and being interested in things.Citation27 Each item was scored on a six-point Likert scale, ranging from 0 (at no time) to 5 (all the time). Multiplying the original score by four is usually recommended because quality-of-life-related scales are often converted to a percentage scale.Citation27,Citation28 The final score from 0 representing the worst imaginable well-being to 100 representing the best imaginable well-being. All items of the WHO-5-C are loaded on a potential factor with adequate reliability (Cronbach’s α = 0.810–0.850, McDonald’s ω = 0.820–0.860) and validity (comparative fit index [CFI] = 0.974, Tucker-Lewis index [TLI] = 0.947, and root mean square error of approximation [RMSEA] = 0.080).Citation43

Patient Health Questionnaire-4 (PHQ-4)

The PHQ was designed to facilitate recognition and diagnosis of the most common mental disorders in primary care patients.Citation52 The PHQ-4 is a validated instrument for detecting anxiety and depression two weeks prior to assessment administration. The PHQ-4 consists of two ultra-short scales, one is the Generalized Anxiety Disorder-2 (GAD-2) for anxiety detectionCitation53 and another is the PHQ-2 that reflects depression.Citation54 Each item of the PHQ-4 is scored on a four-point Likert scale, ranging from 0 (not at all) to 3 (nearly every day), with higher scores indicating more severe symptom levels.Citation55,Citation56 The Chinese version of the PHQ-4 (PHQ-4-C) is publicly available and has shown good internal consistency and test–retest reliability (Cronbach’s α = 0.870–0.904, McDonald’s ω = 0.894–0.904, intraclass correlation coefficient [ICC] = 0.697).Citation57,Citation58

Self-Rated Health Questionnaire (SRHQ)

A simple two-item questionnaire was applied to assess the self-perceived health of all participants, with one item estimating their physical health and one assessing their mental health. Each item was rated on a five-point Likert scale, from 1 (excellent) to 5 (extremely poor), with higher scores indicating worse overall health.Citation59,Citation60 The SRHQ was demonstrated to have satisfactory internal consistency and test–retest reliability in our previous studies (Cronbach’s α = 0.706–0.857, ICC = 0.565–0.710).Citation59–61

Statistical Analysis

Data analysis was performed using R (version 4.1.3) and JASP (version 0.16.1). The R packages used in this analysis were “lavvan (0.6–11)”,Citation62MBESS (4.9.2)”,Citation63irr (0.84.1)”,Citation64semTools (0.5–6)”,Citation65 and “pROC (1.18.2)”.Citation66 We matched data according to student ID, and missingness in the present study ranged from 0.29% to 1.46% (< 5%). We used mean imputation for continuous variables and median imputation for categorical variables.Citation67 Means, standard deviations (SDs), skewness, and kurtosis were used to assess multivariate normality. We selected maximum likelihood estimation (MLE) method to evaluate all confirmatory factor analysis (CFA) results given 1) data of the WHO-5-C is nonnormality (skew < 2, kurtosis < 7), and 2) the number of ordered categories is more than five.Citation68 All measurement properties of the WHO-5-C were assessed in accordance with the requirements of the COSMIN guideline.Citation69

Structural Validity

We assessed structural validity of the WHO-5-C by CFA to evaluate the extent to which the scores reflected underlying dimensions. Since the WHO-5 has been widely recognized as a single-factor scale, we evaluated structural validity of the scale based on a single-factor model. A satisfactory model fit was indicated for CFI and TLI ≥ 0.900, RMSEA ≤ 0.080, and standardized root mean square residual (SRMR) ≤ 0.080.Citation70,Citation71

Measurement Invariance

To assess measurement invariance of the WHO-5-C with regard to demographic variables and time, we conducted a series of multi-group CFA (MGCFA) and longitudinal CFA (LCFA). This method involved progressively constraining parameters to be equal between subgroups (sociodemographic variables) and across time intervals, and then comparing changes in fit indices to determine whether the relationship between observed variables and underlying traits was equivalent.Citation72 Herein, a series of nested models with increasing constraints were established, including the configural (same pattern of factors), metric (same pattern of factors and loadings), scalar (same pattern of factors, loadings, and item thresholds), and strict (same pattern of factors, loadings, item thresholds, and residual variances) models. Measurement invariance was defined as the fit statistic not significantly changing in an iterative procedure of progressively strict constraints. We considered the change in CFI (ΔCFI) to be an applicable metric for measurement invariance, with a change ≤ 0.010 indicating an appropriate measurement invariance.Citation73,Citation74

Convergent Validity

To evaluate convergent validity of the WHO-5-C, we calculated Pearson correlation coefficients of the WHO-5-C with the SRHQ and PHQ-4-C, as these instruments are developed to assess an individual’s subjective feelings, considering mental and physical health often influence and accompany each other.Citation75 We additionally analyzed inter-item and item-total correlations. We hypothesized that the WHO-5-C would be moderately correlated with the PHQ-4-C and SRHQ in expected directions, given that these instruments measure interrelated constructs. The absolute magnitude of correlation coefficient was categorized as very strong correlation (r > 0.900), strong correlation (r = 0.700–0.900), moderate correlation (r = 0.400–0.700), and weak correlation (r < 0.400).Citation76

Reliability

We assessed internal consistency of the WHO-5-C using Cronbach’s α and McDonald’s ω coefficients, with coefficient ≥ 0.700 considered satisfactory.Citation77 To measure test–retest reliability, we used ICC between two separate times and coefficient below 0.400, between 0.400 and 0.590, between 0.600 and 0.740, and greater than 0.750 indicating poor, fair, good, and excellent reliability, respectively.Citation78 Moreover, standard error of measurement (SEM) was calculated as additional evidence to determine measurement accuracy when evaluating test–retest reliability.Citation79

Sensitivity and Specificity

To evaluate the WHO-5-C as a screening scale for SWB, area under the ROC curve (AUC), optimal cutoff point, sensitivity, and specificity were calculated using the ROC analysis. As the PHQ-2 was an ultra-brief and useful screening instrument in depression with excellent operating characteristics, the score ≥ 3 is used as a criterion to determine the optimal cutoff point of the WHO-5-C.Citation52 The optimal cutoff point was obtained from the point closest to the top left-hand corner of ROC curve in baseline data and examined in follow-up data. The AUC value ranges from 0.500 to 1.000, with higher value indicating better prediction. A value greater than or equal to 0.800 was regarded as a good discrimination.Citation80,Citation81

Results

Sociodemographic Variables

The sample consisted of 216 females (62.974%) with an average age of 19.650 (SD = 1.414) years, ranging from 17 to 23 years. and S1 summarized the participants’ sociodemographic variables and scores of the measured scales at baseline and follow-up.

Table 1 Sociodemographic Variables (N = 343)

Structural Validity

Structural validity of the WHO-5-C was explored using CFA based on a one-factor model. As shown in , the fit indices of both baseline (CFI = 0.968, TLI = 0.937, SRMR = 0.028) and follow-up (CFI = 0.980, TLI = 0.961, SRMR = 0.020) analyses indicated that the WHO-5-C had a satisfactory fit with a single-factor structure.

Table 2 Model Fit Indices of the Single-Factor Model for the WHO-5-C (N = 343)

Measurement Invariance

Cross-sectional measurement invariance of the WHO-5-C was analyzed across healthcare students’ sociodemographic variables. and S2 showed that the WHO-5-C was well-fixed in four nested models among all subgroups, with all CFI and TLI values greater than 0.900 and SRMR values below 0.080. Meanwhile, MGCFA tests demonstrated that all ΔCFI values (ΔCFI = −0.009–0.003) were within the recommended range, indicating that the WHO-5-C had acceptable measurement invariance across different sociodemographic variables.

Table 3 Cross-Sectional Measurement Invariances of the WHO-5-C (N = 343)

In LCFA tests of the WHO-5-C, all fit indices were in line with the proposed thresholds (CFI = 0.974–0.980, TLI = 0.968–0.976, REMSA = 0.070–0.081, SRMR = 0.022–0.027), and there were no substantial CFI changes in each nested model (ΔCFI = −0.006–0.000), supporting that the WHO-5-C had good longitudinal measurement invariance ().

Table 4 Longitudinal Measurement Invariances of the WHO-5-C (N = 343)

Convergent Validity

As shown in , inter-item coefficients of the WHO-5-C ranged from 0.535 to 0.844 and item-total coefficients ranged from 0.806 to 0.935, indicating moderate to super strong correlations. Moderate correlations were observed between the WHO-5-C and SRHQ (Time 1: r = −0.561, Time 2: r = −0.573), as well as the PHQ-4-C (Time 1: r = −0.691, Time 2: r = −0.675), demonstrating adequate convergent validity.

Figure 1 Inter-item and item-total correlations between the Chinese WHO-5, PHQ-4, and SRHQ (N=343).

Abbreviations: WHO-5, World Health Organization-Five Well-Being Index; WHO01-05, item 01–05 of the WHO-5; PHQ-2, Patient Health Questionnaire-2; GAD-2, Generalized Anxiety Disorder-2; PHQ-4, Patient Health Questionnaire-4; SRHQ, Self-Rated Health Questionnaire; T1, Time 1; T2, Time 2.
Figure 1 Inter-item and item-total correlations between the Chinese WHO-5, PHQ-4, and SRHQ (N=343).

Reliability

Cronbach’s α and McDonald’s ω coefficients of the WHO-5-C at baseline were 0.907 (range of α-if-item-deleted = 0.865–0.905) and 0.908 (range of α-if-item-deleted = 0.865–0.905), respectively; and at follow-up were 0.934 (range of α-if-item-deleted = 0.906–0.930) and 0.935 (range of α-if-item-deleted = 0.904–0.931), respectively. The WHO-5-C demonstrated excellent test–retest reliability, with an ICC of 0.803 (range of ICC-if-item-deleted = 0.785–0.799), and stability of the scale was also demonstrated by SEM indices. No significant increase in indices was observed when any item was deleted. Similarly, the SRHQ and PHQ-4-C showed good internal consistencies and test–retest reliabilities ().

Table 5 Internal Consistency and Test–Retest Reliability of the WHO-5-C, SRHQ, and PHQ-4-C (N = 343)

Sensitivity and Specificity

Based on baseline data, ROC curve analysis indicated that the WHO-5-C had an AUC value of 0.882, and the optimal cutoff point of ≥ 50 had sensitivity of 0.782 and specificity of 0.857 for screening satisfactory SWB in healthcare students (). Sensitivity and specificity of the WHO-5-C were 0.803 and 0.762, respectively, in follow-up data when the cutoff point of 50 was applied as the threshold for predicting SWB ().

Table 6 Sensitivity and Specificity of the WHO-5-C for Identifying Well-Being in Healthcare Students (N = 343)

Figure 2 ROC curve of the Chinese WHO-5 for well-being (N=343).

Abbreviations: WHO-5, World Health Organization-Five Well-Being Index; ROC, receiver operating characteristic; AUC, area under the curve.
Figure 2 ROC curve of the Chinese WHO-5 for well-being (N=343).

Discussion

The adaptation of existing well-being instruments in Chinese populations mostly focuses on construct validity and reliability, but rarely responds to measurement invariance and clinical diagnostic capability. As an example, the WEMWBS is an instrument of well-being considering positive aspects of mental health, and despite the Chinese version representing stable factor structure and reliability across populations, its measurement invariance and truncation value are not available.Citation23,Citation82,Citation83 Herein, the current study comprehensively evaluated measurement properties of the WHO-5-C in healthcare students, intending to introduce a well-being measuring tool with all-round persuasiveness. To our knowledge, this is the first study to explore measurement invariance of the WHO-5-C and determine a cutoff point for SWB identification in healthcare students. We provide strong evidence of a one-factor model with adequate validity and reliability, as well as measurement invariance, explaining its stability across different sociodemographic variables and time intervals. ROC analysis supports the WHO-5-C as a sensitive and specific instrument in assessing positive mental and physical well-being, with a recommended cutoff point of ≥ 50 indicating adequate SWB for Chinese healthcare students. Overall, our findings highlight effectiveness of the WHO-5-C and its validity in assessing SWB among healthcare students.

Structural Validity

CFA results corroborated general consensus of previous studies that the WHO-5 had a stable single-factor structure, which has also been found in most language versions: Brazilian,Citation30 Japanese,Citation34 Polish,Citation36 Sinhala,Citation37 Spanish,Citation38 Swedish,Citation40 and Turkish.Citation41 The WHO-5-C had adequate structural validity, and these findings were also found in another study in multiple samples.Citation43 In summary, our study adds to the growing body of evidence supporting structural validity of the WHO-5.

Measurement Invariance

Our study has validated cross-cultural validity of the WHO-5-C by demonstrating adequate measurement invariance across sociodemographic variables and time intervals. Adequate measurement invariance ensured that the scale had consistent measurement properties in a heterogeneous group and that targeted measurement was stably assessed by the scale.Citation84 While partial fitting indices of MGCFA tests were contradictory to expectations, likely due to a small degree of freedom,Citation85,Citation86 all CFI changes among the four progressive nested models were consistently within adequate range, which is recognized as the most pivotal indicator to assess measurement invariance of a scale.Citation74 Only limited evidence for the WHO-5 measurement invariance was found in extant literature. The Sinhala version supported measurement invariance across gender in all four nested models.Citation37 Our findings extend previous ones in supporting satisfactory measurement invariance of the WHO-5-C in healthcare students both by different sociodemographic variables and over time.

Convergent Validity

Our findings demonstrated that the WHO-5-C demonstrated moderate negative correlations with both the SRHQ and PHQ-4-C, showing the WHO-5-C had sufficient convergent validity. Associations of comparable strengths with its related constructs of mental and physical health have been reported in community and patient samples,Citation33,Citation87,Citation88 and special populations like healthcare workers in multi-national studies.Citation89 Our evidence for construct validity of the WHO-5 in Chinese population adds support to its cross-cultural applicability as a valid instrument for measuring SWB in diverse domains and groups.

Reliability

Similar and consistent with previous studies of other language versions of the WHO-5,Citation29,Citation30,Citation33,Citation37–39,Citation41,Citation90 Cronbach’s α, McDonald’s ω and ICC of the WHO-5-C across two separate time points were greater than 0.800 in our study, illustrating its high internal consistency and test–retest reliability. Despite the brevity of the WHO-5-C, our data further support coherence across items and stability over time in Chinese.

Sensitivity and Specificity

Our study showed that the WHO-5-C had a reasonable ability to distinguish those who had psychological positive components when the cutoff point was ≥ 50 (AUC = 0.882), suggesting that the scale is a suitable screening instrument for measuring adequate SWB in Chinese healthcare students. Consistent with other studies in Asia, a standard cutoff point of ≥ 13 had an excellent sensitivity/specificity trade-off in the Arabic and Japanese versions, which meant that respondents with positive mindset can be detected by this cutoff criterion.Citation29,Citation34 Along the same vein, a reduction of 50% indicates reduced wellness in the Brazilian version and recommends that further clinical diagnosis for depression should be undertaken.Citation30 Other cutoff points for different purposes and samples have been identified in other language versions of the WHO-5. The German version of the WHO-5 was applied to screen depression in an elderly population (AUC = 0.886), and a standard cutoff point of ≥ 16 was sufficiently sensitive and specific in predicting a status of optimal well-being.Citation32 The fact that the definition and feeling of wellness are notably discrepant in different stages of life due to the gains and losses in health assets, social networks, economic resources and family structures.Citation91,Citation92 A systematic review of the WHO-5, thoroughly, drew a conclusion that the scale had been applied successfully as a generic scale for well-being across numerous fields, and a point below 50 was defined as an obvious reduction in well-being.Citation28 The WHO-5, briefly, has appropriate validity both as a screening instrument for subjective psychosomatic well-being and as an outcome measurement in clinical trials.

Strengths and Limitations

The present study contributes to literature on utilization of the WHO-5 in Chinese healthcare students by providing empirical findings on its measurement properties and diagnostic performance. The study supports the application of the WHO-5-C in young adults with adequate validity and reliability, laying a solid foundation for its utilization in China. To our knowledge, our study is the first to verify cutoff point and to comprehensively examine measurement invariance of the WHO-5-C among healthcare students in terms of sociological characteristics and measurement times. As such, we provide a precise threshold for evaluating SWB state for clinical and research purposes in young adults, a vulnerable group for onset of psychosomatic health problems and developmental challenges.

This study is admittedly limited in that participants involved were all young healthcare students, rendering it difficult to generalize the findings to other age groups or disciplines. Given the promising findings this study offers, it would be highly valuable to further investigate psychometric properties of the WHO-5-C in various samples or a nationally representative and to validate the optimal cutoff point in different purposes and external samples. Despite widespread utilization of the PHQ-4-C, an instrument of positive SWB should be considered as a complementary measure to capture the comprehensive and multidimensional construct of wellness.Citation1 Moreover, despite the WHO-5-C is freely available, it is essential to examine its content validity with a view to providing credible evidence for the sustainable implementation of the scale in China.

Conclusion

In conclusion, this study supports the WHO-5-C as a reliable and valid instrument for capturing SWB in healthcare students, with satisfactory measurement invariance in different sociodemographic variables and over time, and a cutoff point of ≥ 50 for identifying significantly adequate SWB state. Together with the family of studies on the WHO-5 worldwide, our evidence enables research and clinical communities to apply the scale for screening well-being in public health and primary care settings.

Data Sharing Statement

Not available.

Author Contributions

All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.

Disclosure

The authors report no conflicts of interest in this work.

Acknowledgments

The authors thank the study participants and the research assistants for their time. The authors are indebted to three anonymous reviewers and a handling editor for their insightful views and constructive comments.

Additional information

Funding

This study was supported by the Medical Research Fund of Zhejiang Province, Grant No. 2023RC073 and the Research Initiation Fund of Hangzhou Normal University, Grant No. RWSK20201003.

References

  • Beddington J, Cooper CL, Field J, et al. The mental wealth of nations. Nature. 2008;455(7216):1057–1060. doi:10.1038/4551057a
  • Faruk MO, Alam F, Chowdhury KUA, Soron TR. Validation of the Bangla WHO-5 Well-being Index. Glob Ment Health. 2021;8:e26. doi:10.1017/gmh.2021.26
  • Finan PH, Garland EL. The role of positive affect in pain and its treatment. Clin J Pain. 2015;31(2):177–187. doi:10.1097/ajp.0000000000000092
  • Rasmussen HN, Scheier MF, Greenhouse JB. Optimism and physical health: a meta-analytic review. Ann Behav Med. 2009;37(3):239–256. doi:10.1007/s12160-009-9111-x
  • Chida Y, Steptoe A. Positive psychological well-being and mortality: a quantitative review of prospective observational studies. Psychosom Med. 2008;70(7):741–756. doi:10.1097/PSY.0b013e31818105ba
  • Bolier L, Haverman M, Westerhof GJ, Riper H, Smit F, Bohlmeijer E. Positive psychology interventions: a meta-analysis of randomized controlled studies. BMC Public Health. 2013;13:119. doi:10.1186/1471-2458-13-119
  • Kubzansky LD, Huffman JC, Boehm JK, et al. Positive psychological well-being and cardiovascular disease: JACC Health Promotion Series. J Am Coll Cardiol. 2018;72(12):1382–1396. doi:10.1016/j.jacc.2018.07.042
  • Howell RT, Kern ML, Lyubomirsky S. Health benefits: meta-analytically determining the impact of well-being on objective health outcomes. Health Psychol Rev. 2007;1(1):83–136. doi:10.1080/17437190701492486
  • Liu X, Ping S, Gao W. Changes in undergraduate students’ psychological well-being as they experience university life. Int J Environ Res Public Health. 2019;16(16):2864. doi:10.3390/ijerph16162864
  • Holm-Hadulla RM, Klimov M, Juche T, Möltner A, Herpertz SC. Well-being and mental health of students during the COVID-19 pandemic. Psychopathology. 2021;54(6):291–297. doi:10.1159/000519366
  • Sheldon E, Simmonds-Buckley M, Bone C, et al. Prevalence and risk factors for mental health problems in university undergraduate students: a systematic review with meta-analysis. J Affect Disord. 2021;287:282–292. doi:10.1016/j.jad.2021.03.054
  • Sattar K, Yusoff MSB, Arifin WN, Yasin MAM, Nor MZM. A scoping review on the relationship between mental wellbeing and medical professionalism. Med Educ Online. 2023;28(1):2165892. doi:10.1080/10872981.2023.2165892
  • Wang CY, Pan RY, Wan XY, et al. Immediate psychological responses and associated factors during the initial stage of the 2019 coronavirus disease (COVID-19) epidemic among the general population in China. Int J Environ Res Public Health. 2020;17(5):1729. doi:10.3390/ijerph17051729
  • Cao WJ, Fang ZW, Hou GQ, et al. The psychological impact of the COVID-19 epidemic on college students in China. Psychiatry Res. 2020;287:112934. doi:10.1016/j.psychres.2020.112934
  • Moreira de Sousa J, Moreira CA, Telles-Correia D. Anxiety, depression and academic performance: a study amongst Portuguese medical students versus non-medical students. Acta Med Port. 2018;31(9):454–462. doi:10.20344/amp.9996
  • Quek TT, Tam WW, Tran BX, et al. The global prevalence of anxiety among medical students: a meta-analysis. Int J Environ Res Public Health. 2019;16(15):2735. doi:10.3390/ijerph16152735
  • Zeng W, Chen R, Wang X, Zhang Q, Deng W. Prevalence of mental health problems among medical students in China: a meta-analysis. Medicine. 2019;98(18):e15337. doi:10.1097/md.0000000000015337
  • Cooke PJ, Melchert TP, Connor K. Measuring well-being: a review of instruments. Couns Psychol. 2016;44(5):730–757. doi:10.1177/0011000016633507
  • Zigmond AS, Snaith RP. The Hospital Anxiety and Depression Scale. Acta Psychiatr Scand. 1983;67(6):361–370. doi:10.1111/j.1600-0447.1983.tb09716.x
  • Melzack R. The short-form McGill Pain Questionnaire. Pain. 1987;30(2):191–197. doi:10.1016/0304-3959(87)91074-8
  • McIntyre RS, Alda M, Baldessarini RJ, et al. The clinical characterization of the adult patient with bipolar disorder aimed at personalization of management. World Psychiatry. 2022;21(3):364–387. doi:10.1002/wps.20997
  • Tennant R, Hiller L, Fishwick R, et al. The Warwick-Edinburgh Mental Well-being Scale (WEMWBS): development and UK validation. Health Qual Life Out. 2007;5(1):63. doi:10.1186/1477-7525-5-63
  • Taggart F, Friede T, Weich S, Clarke A, Johnson M, Stewart-Brown S. Cross cultural evaluation of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS)-a mixed methods study. Health Qual Life Out. 2013;11:27. doi:10.1186/1477-7525-11-27
  • Diener E. Subjective well-being: the science of happiness and a proposal for a national index. Am Psychol. 2000;55(1):34–43. doi:10.1037/0003-066X.55.1.34
  • Smyth R, Nielsen I, Zhai Q. Personal Well-being in urban China. Soc Indic Res. 2010;95(2):231–251. doi:10.1007/s11205-009-9457-2
  • Chan DW, Chan L-K, Sun X. Developing a brief version of Ryff’s scale to assess the psychological well-being of adolescents in Hong Kong. Eur J Psychol Assess. 2019;35(3):414–422. doi:10.1027/1015-5759/a000403
  • Wellbeing measures in primary health care/the DepCare Project: report on a WHO meeting: Stockholm, Sweden, 12–13 February 1998. World Health Organization. Regional Office for Europe; ‎1998‎. Available from: https://iris.who.int/handle/10665/349766. Accessed January 30, 2021.
  • Topp CW, Østergaard SD, Søndergaard S, Bech P. The WHO-5 Well-Being Index: a systematic review of the literature. Psychother Psychosom. 2015;84(3):167–176. doi:10.1159/000376585
  • Sibai AM, Chaaya M, Tohme RA, Mahfoud Z, Al-Amin H. Validation of the Arabic version of the 5-item WHO Well Being Index in elderly population. Int J Geriatr Psychiatry. 2009;24(1):106–107. doi:10.1002/gps.2079
  • de Souza CM, Hidalgo MP. World Health Organization 5-item well-being index: validation of the Brazilian Portuguese version. Eur Arch Psychiatry Clin Neurosci. 2012;262(3):239–244. doi:10.1007/s00406-011-0255-x
  • Lee GL, Fan GK, Chan SW. Validation of Chinese and English versions of the Holistic Well-being Scale in patients with cancer. Support Care Cancer. 2015;23(12):3563–3571. doi:10.1007/s00520-015-2736-3
  • Bonsignore M, Barkow K, Jessen F, Heun R. Validity of the five-item WHO Well-Being Index (WHO-5) in an elderly population. Eur Arch Psychiatry Clin Neurosci. 2001;251(Suppl 2):27–31. doi:10.1007/bf03035123
  • Guðmundsdóttir HB, Olason DP, Guðmundsdóttir DG, Sigurðsson JF. A psychometric evaluation of the Icelandic version of the WHO-5. Scand J Psychol. 2014;55(6):567–572. doi:10.1111/sjop.12156
  • Awata S, Bech P, Yoshida S, et al. Reliability and validity of the Japanese version of the World Health Organization-Five Well-Being Index in the context of detecting depression in diabetic patients. Psychiatry Clin Neurosci. 2007;61(1):112–119. doi:10.1111/j.1440-1819.2007.01619.x
  • Suhaimi AF, Makki SM, Tan KA, Silim UA, Ibrahim N. Translation and validation of the Malay version of the WHO-5 Well-Being Index: reliability and validity evidence from a sample of type 2 diabetes mellitus patients. Int J Environ Res Public Health. 2022;19(7):4415. doi:10.3390/ijerph19074415
  • Cichoń E, Kiejna A, Kokoszka A, et al. Validation of the Polish version of WHO-5 as a screening instrument for depression in adults with diabetes. Diabetes Res Clin Pract. 2020;159:107970. doi:10.1016/j.diabres.2019.107970
  • Perera BPR, Jayasuriya R, Caldera A, Wickremasinghe AR. Assessing mental well-being in a Sinhala speaking Sri Lankan population: validation of the WHO-5 well-being index. Health Qual Life Out. 2020;18(1):305. doi:10.1186/s12955-020-01532-8
  • Lucas-Carrasco R. Reliability and validity of the Spanish version of the World Health Organization-Five Well-Being Index in elderly. Psychiatry Clin Neurosci. 2012;66(6):508–513. doi:10.1111/j.1440-1819.2012.02387.x
  • Chongwo E, Ssewanyana D, Nasambu C, et al. Validation of a Swahili version of the World Health Organization 5-item well-being index among adults living with HIV and epilepsy in rural coastal Kenya. Glob Health Res Policy. 2018;3:26. doi:10.1186/s41256-018-0081-z
  • Löve J, Andersson L, Moore CD, Hensing G. Psychometric analysis of the Swedish translation of the WHO well-being index. Qual Life Res. 2014;23(1):293–297. doi:10.1007/s11136-013-0447-0
  • Eser E, Çevik C, Baydur H, et al. Reliability and validity of the Turkish version of the WHO-5, in adults and older adults for its use in primary care settings. Prim Health Care Res Dev. 2019;20:e100. doi:10.1017/s1463423619000343
  • de Wit M, Pouwer F, Gemke RJ. Validation of the WHO-5 Well-Being Index in adolescents with type 1 diabetes. Diabetes Care. 2007;30(8):2003–2006. doi:10.2337/dc07-0447
  • Fung SF, Kong CYW, Liu YM, et al. Validity and psychometric evaluation of the Chinese version of the 5-Item WHO Well-Being Index. Front Public Health. 2022;10:872436. doi:10.3389/fpubh.2022.872436
  • Chan L, Liu RKW, Lam TP, Chen JY, Tipoe GL, Ganotice FA. Validation of the World Health Organization Well-Being Index (WHO-5) among medical educators in Hong Kong: a confirmatory factor analysis. Med Educ Online. 2022;27(1):2044635. doi:10.1080/10872981.2022.2044635
  • Saracci R. The World Health Organisation needs to reconsider its definition of health. BMJ. 1997;314(7091):1409–1410. doi:10.1136/bmj.314.7091.1409
  • Streiner DL, Norman GR, Cairney J. Health Measurement Scales: A Practical Guide to Their Development and Use. Oxford University Press; 2014.
  • Deyo RA, Diehr P, Patrick DL. Reproducibility and responsiveness of health status measures. Statistics and strategies for evaluation. Control Clin Trials. 1991;12(4 Suppl):142S–158S. doi:10.1016/s0197-2456(05)80019-4
  • Kline P. An Easy Guide to Factor Analysis. New York: Routledge; 1994.
  • Mundfrom DJ, Shaw DG, Ke TL. Minimum sample size recommendations for conducting factor analyses. Int J Test. 2005;5(2):159–168. doi:10.1207/s15327574ijt0502_4
  • Association WM. World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. JAMA. 2013;310(20):2191–2194. doi:10.1001/jama.2013.281053
  • The Chinese version of the WHO (Five) in characters used in PR China. Psychiatric Research Unit, WHO Collaborating Center for Mental Health. Available from: https://www.psykiatri-regionh.dk/who-5/Documents/WHO5_Chinese_PR.pdf. Accessed January 30, 2021.
  • Spitzer RL, Kroenke JBWWK. Instructions for Patient Health Questionnaire (PHQ) and GAD-7 Measures. Available from: https://www.phqscreeners.com/. Accessed January 30, 2021.
  • Sapra A, Bhandari P, Sharma S, Chanpura T, Lopp L. Using Generalized Anxiety Disorder-2 (GAD-2) and GAD-7 in a Primary Care Setting. Cureus. 2020;12(5):e8224. doi:10.7759/cureus.8224
  • Kroenke K, Spitzer RL, Williams JB. The Patient Health Questionnaire-2: validity of a two-item depression screener. Med Care. 2003;41(11):1284–1292. doi:10.1097/01.Mlr.0000093487.78664.3c
  • Löwe B, Wahl I, Rose M, et al. A 4-item measure of depression and anxiety: validation and standardization of the Patient Health Questionnaire-4 (PHQ-4) in the general population. J Affect Disord. 2010;122(1–2):86–95. doi:10.1016/j.jad.2009.06.019
  • Kroenke K, Spitzer RL, Williams JB, Löwe B. An ultra-brief screening scale for anxiety and depression: the PHQ-4. Psychosomatics. 2009;50(6):613–621. doi:10.1176/appi.psy.50.6.613
  • Luo Y, Fei S, Gong B, Sun T, Meng R. Understanding the mediating role of anxiety and depression on the relationship between perceived stress and sleep quality among health care workers in the COVID-19 response. Nat Sci Sleep. 2021;13:1747–1758. doi:10.2147/nss.S313258
  • Meng R, Dong L, Dzierzewski JM, et al. The RU_SATED as a measure of sleep health: cross-cultural adaptation and validation in Chinese healthcare students. BMC Psychology. 2023;11(1):200. doi:10.1186/s40359-023-01203-5
  • Zhu Y, Jiang C, Yang Y, et al. Depression and anxiety mediate the association between sleep quality and self-rated health in healthcare students. Behav Sci. 2023;13(2):82. doi:10.3390/bs13020082
  • Jiang C, Ma H, Luo Y, et al. Validation of the Chinese version of the Perceived Stress Scale-10 integrating exploratory graph analysis and confirmatory factor analysis. Gen Hosp Psychiatry. 2023;84:194–202. doi:10.1016/j.genhosppsych.2023.07.008
  • Jiang C, Zhu Y, Luo Y, et al. Validation of the Chinese version of the Rosenberg Self-Esteem Scale: evidence from a three-wave longitudinal study. BMC Psychology. 2023;11(1):345. doi:10.1186/s40359-023-01293-1
  • Rosseel Y. Lavaan: an R package for structural equation modeling. J Stat Softw. 2012;48:1–36. doi:10.18637/jss.v048.i02
  • Kelley K. MBESS: the MBESS R package. 2022. Available from: https://CRAN.R-project.org/package=MBESS. Accessed January 30, 2023.
  • Gamer M, Lemon J, Singh IFP. irr: various coefficients of interrater reliability and agreement. 2019. Available from: https://CRAN.R-project.org/package=irr. Accessed January 30, 2023.
  • Jorgensen TD, Pornprasertmanit S, Schoemann AM, Rosseel Y. Useful tools for structural equation modeling. 2022. Available from: https://CRAN.R-project.org/package=semTools. Accessed January 30, 2023.
  • Sachs MC. plotROC: a tool for plotting ROC curves. J Stat Softw. 2017;79(2):1–19. doi:10.18637/jss.v079.c02
  • Dziura JD, Post LA, Zhao Q, Fu Z, Peduzzi P. Strategies for dealing with missing data in clinical trials: from design to analysis. Yale J Biol Med. 2013;86(3):343–358.
  • Finney SJ, DiStefano C. Nonnormal and Categorical Data in Structural Equation Models. Structural Equation Modeling: A Second Course. Greenwich, CT: Information Age; 2006.
  • Mokkink LB, Terwee CB, Patrick DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–745. doi:10.1016/j.jclinepi.2010.02.006
  • Hair JF, Black WC, Babin BJ, Anderson RE. Multivariate Data Analysis: Pearson New International Edition. 7th ed. London, UK: Pearson Higher Education; 2014.
  • Kline RB. Principles and Practice of Structural Equation Modeling. 4th ed. New York, NY, USA: Guilford Publications; 2016.
  • Edwards MC, Houts CR, Wirth RJ. Measurement invariance, the lack thereof, and modeling change. Qual Life Res. 2018;27(7):1735–1743. doi:10.1007/s11136-017-1673-7
  • Cheung GW, Rensvold RB. Evaluating goodness-of-fit indexes for testing measurement invariance. Struct Equ Modeling. 2002;9(2):233–255. doi:10.1207/S15328007SEM0902_5
  • Putnick DL, Bornstein MH. Measurement invariance conventions and reporting: the state of the art and future directions for psychological research. Dev Rev. 2016;41:71–90. doi:10.1016/j.dr.2016.06.004
  • Doan T, Ha V, Strazdins L, Chateau D. Healthy minds live in healthy bodies - effect of physical health on mental health: evidence from Australian longitudinal data. Curr Psychol. 2022;42:18702–18713. doi:10.1007/s12144-022-03053-7
  • Schober P, Boer C, Schwarte LA. Correlation coefficients: appropriate use and interpretation. Anesth Analg. 2018;126(5):1763–1768. doi:10.1213/ane.0000000000002864
  • DeVon HA, Block ME, Moyle-Wright P, et al. A psychometric toolbox for testing validity and reliability. J Nurs Scholarsh. 2007;39(2):155–164. doi:10.1111/j.1547-5069.2007.00161.x
  • Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–163. doi:10.1016/j.jcm.2016.02.012
  • Weir JP. Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM. J Strength Cond Res. 2005;19(1):231–240. doi:10.1519/15184.1
  • Walter SD. The partial area under the summary ROC curve. Stat Med. 2005;24(13):2025–2040. doi:10.1002/sim.2103
  • Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem. 1993;39(4):561–577. doi:10.1093/clinchem/39.4.561
  • Ng SS, Lo AW, Leung TK, et al. Translation and validation of the Chinese version of the short Warwick-Edinburgh Mental Well-being Scale for patients with mental illness in Hong Kong. East Asian Arch Psychiatry. 2014;24(1):3–9.
  • Dong A, Chen X, Zhu L, et al. Translation and validation of a Chinese version of the Warwick–Edinburgh Mental Well-being Scale with undergraduate nursing trainees. J Psychiatr Ment Health Nurs. 2016;23(9–10):554–560. doi:10.1111/jpm.12344
  • Chen FF. Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct Equ Modeling. 2007;14(3):464–504. doi:10.1080/10705510701301834
  • Lai K, Green SB. The problem with having two watches: assessment of fit when RMSEA and CFI disagree. Multivariate Behav Res. 2016;51(2–3):220–239. doi:10.1080/00273171.2015.1134306
  • Shi D, DiStefano C, Maydeu-Olivares A, Lee T. Evaluating SEM model fit with small degrees of freedom. Multivariate Behav Res. 2022;57(2–3):179–207. doi:10.1080/00273171.2020.1868965
  • Hansen CP, Amiri M. Combined detection of depression and anxiety in epilepsy patients using the Neurological Disorders Depression Inventory for Epilepsy and the World Health Organization well-being index. Seizure. 2015;33:41–45. doi:10.1016/j.seizure.2015.10.008
  • Mergl R, Seidscheck I, Allgaier AK, Möller HJ, Hegerl U, Henkel V. Depressive, anxiety, and somatoform disorders in primary care: prevalence and recognition. Depress Anxiety. 2007;24(3):185–195. doi:10.1002/da.20192
  • Lara-Cabrera ML, Betancort M, Muñoz-Rubilar A, Rodríguez-Novo N, Bjerkeset O, De Las Cuevas C. Psychometric properties of the WHO-5 Well-Being Index among nurses during the COVID-19 pandemic: a cross-sectional study in three countries. Int J Environ Res Public Health. 2022;19(16):10106. doi:10.3390/ijerph191610106
  • Awata S, Bech P, Koizumi Y, et al. Validity and utility of the Japanese version of the WHO-Five Well-Being Index in the context of detecting suicidal ideation in elderly community residents. Int Psychogeriatr. 2007;19(1):77–88. doi:10.1017/s1041610206004212
  • Mirowsky J, Ross CE. Age and depression. J Health Soc Behav. 1992;33(3):187–205. doi:10.2307/2137349
  • Stone AA, Schwartz JE, Broderick JE, Deaton A. A snapshot of the age distribution of psychological well-being in the United States. PNAS. 2010;107(22):9985–9990. doi:10.1073/pnas.1003744107