Search in:

Neuropsychiatric Disease and Treatment Volume 12, 2016 - Issue

Submit an article Journal homepage

Open access

304

Views

CrossRef citations to date

Altmetric

Listen

Original Research

Psychometric properties of the Spanish version of the Clinical Outcomes in Routine Evaluation – Outcome Measure

Adriana Trujillo1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology;2 Institute for Brain, Cognition and Behaviour, University of Barcelona

Guillem Feixas1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology;2 Institute for Brain, Cognition and Behaviour, University of BarcelonaCorrespondence[email protected]

Arturo Bados1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology

Eugeni García-Grau1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology

Marta Salla1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology

Joan Carles Medina1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology

Adrián Montesano1 Department of Personality, Assessment and Psychological Treatments, Faculty of Psychology;2 Institute for Brain, Cognition and Behaviour, University of Barcelona

José Soriano3 Hospital of the Holy Cross and Saint Paul

Leticia Medeiros-Ferreira4 Nou Barris Mental Health Center, Barcelona

Josep Cañete5 Hospital of Mataró, Sanitary Consortium of Maresme, Mataró

Sergi Corbella6 FPCEE, Blanquerna, Universitat Ramon Llull

Antoni Grau7 Institute of Eating Disorders, Barcelona

Fernando Lana8 MAR Health Park, CAEMIL, Santa Coloma de Gramenet, Spain

Chris Evans9 East London NHS Foundation Trust, NPDDNet, London, UK

show all

Pages 1457-1466 | Published online: 21 Jun 2016

Cite this article
CrossMark

In this article

Introduction
Method
Results
Discussion
References

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF

Abstract

Objective

The objective of this paper is to assess the reliability and validity of the Spanish translation of the Clinical Outcomes in Routine Evaluation – Outcome Measure, a 34-item self-report questionnaire that measures the client’s status in the domains of Subjective well-being, Problems/Symptoms, Life functioning, and Risk.

Method

Six hundred and forty-four adult participants were included in two samples: the clinical sample (n=192) from different mental health and primary care centers; and the nonclinical sample (n=452), which included a student and a community sample.

Results

The questionnaire showed good acceptability and internal consistency, appropriate test–retest reliability, and acceptable convergent validity. Strong differentiation between clinical and nonclinical samples was found. As expected, the Risk domain had different characteristics than other domains, but all findings were comparable with the UK referential data. Cutoff scores were calculated for clinical significant change assessment.

Conclusion

The Spanish version of the Clinical Outcomes in Routine Evaluation – Outcome Measure showed acceptable psychometric properties, providing support for using the questionnaire for monitoring the progress of Spanish-speaking psychotherapy clients.

Keywords:

CORE-OM
outcome measure
reliability
validity
psychometric validation

Introduction

This paper reports the psychometric properties of the Spanish version of Clinical Outcomes in Routine Evaluation – Outcome Measure (CORE-OM). The CORE-OM was designed mainly for practice-based evidence (a complement to evidence-based practice).^Citation1 We expected that the Spanish translation of the CORE-OM would be a useful, reliable, and valid instrument suitable to be widely used for research and practice in Spain and in some countries in which Spanish is spoken in similar form to that used in Spain. The translation should also prove a useful base, with the original English version, for countries where local Spanish usage is sufficiently different from that in Spain that a somewhat different translation will be needed.

From its origin, the measure was designed to be pan-theoretical (not associated with a school of therapy) and pan-diagnostic (not focused on a single presenting problem), and was driven by what practitioners and clients considered to be the most important generic aspects of psychological well-being, and change in therapies, to be measured. It is recommended to be used before and at the end of therapy.

The CORE-OM measure is copyleft; that is, it can be reproduced without payment of any license fee if it is not changed in any way.^Citation2 Translations were done following the CORE System Trust (CST) protocol, and with the supervision and guidance of Chris Evans (CE). Copyright violations are illegal, but CST and CE welcome collaboration on new translations to the protocol. All CORE instruments are available to download from,^Citation3 which provides more information about the system, instruments, and translation protocol. Information focused on the CORE-OM in Spanish is at.^Citation4

There are many fields in which CORE-OM has demonstrated its utility having been used in areas as varied as benchmark studies,^{Citation5,Citation6} assessment of outcome of psychological therapies in primary and secondary settings,^Citation7^–^Citation10 studies of treatment processes,^Citation11^–^Citation13 assessment of the psychological well-being of individuals in nonclinical occupational settings,^Citation14 and examination of psychological health among university students who were receiving university counseling.^{Citation15,Citation16} Acceptability and psychometric properties have been demonstrated with diverse samples, for example, older people and patients with eating disorders.^{Citation17,Citation18} Though designed more for practice-based evidence, the CORE-OM has been used in randomized controlled trials.^Citation19^–^Citation21

CORE-OM has been translated, following a clear and thorough protocol,^Citation22 into over 20 languages, with that number growing. Evaluation of the psychometric properties of the translated measure has been completed showing comparable psychometric properties to that found for the English version in the UK for a growing number of languages including Italian,^Citation23 Portuguese,^Citation24 Swedish,^Citation25 Lithuanian,^Citation26 Icelandic,^Citation27 and Croatian,^Citation28 and many others are nearing completion including a Catalan version. All forms are widely used as a routine change measure in a range of health care settings in the UK and increasingly in other languages and countries.^{Citation13,Citation29}^–^Citation32

This study was designed to assess the psychometric properties of the Spanish translation of the CORE-OM, and hence its suitability to be used in routine assessment of mental health interventions in Spain and perhaps other Spanish-speaking countries.

Method

The Spanish version

For the translation and adaptation of the CORE-OM to Spanish, we followed the steps established by international groups and the CST protocol including participation of a member of the group who designed the instrument (CE).^Citation33 This process is congruent to the guidelines of the International Test Commission,^Citation34 and it emphasizes the importance of translating the items according to their con-textualized meaning in culture and the environment in which they will be applied, as well as making them understandable for the most varied range of possible potential users. It does not rely excessively on back-translation in order to avoid too literal translations. To seek for improvement of the resulting version, we requested the collaboration of 12 people from different parts of Spain, selected because of their high level of English proficiency. Ten of them responded to the request by providing a translation. Six of them were professionals in psychology, and four were lay people. With this material, a working session was organized with the participation of two of the professionals of psychology and two of the lay people who collaborated with the translation, along with a member of the CST (CE) who acted as a consultant or supervisor. In this session, each item was discussed taking into account the available translations. For each item, the best option was chosen by consensus. A first draft came out of this process which was reviewed by three experts in psychology with over 20 years of experience in clinical settings who made some modifications that were discussed by email with CE. This revised version was submitted to extensive scrutiny by a group of 64 people (between 16 and 76 years, all of them from different conditions and linguistic backgrounds, and fully proficient in Spanish, 12 of them were professionals of psychology, and 52 lay people) who were asked to read it carefully and to judge whether the items were understandable and clear. They were also encouraged to make all the comments they deemed appropriate with regard to the way items were written.

Afterward, the comments and observations made were discussed by the three experts mentioned, and issues that seemed to need discussion of the original English were shared with CE, until a final version was achieved. This version was delivered to an experienced, bilingual English–Spanish translator, with a degree in psychology and without access to the original version, to back-translate. Looking at this back-translation, neither the experts nor the member of the CST considered necessary to make any modification of the latest version, at which point it became the CST-approved translation into Spanish.

From that version,^Citation33 the shorter versions (designed for routine use in therapy sessions, for screening and ongoing monitoring: CORE-SFA, CORE-SFB, CORE-10, and CORE-5, all in male and female versions^Citation35) were typeset and made available through webpages, initially^Citation4 and now also.^Citation22

Participants and procedures

The study protocol was approved by the Bioethics Committee of the University of Barcelona (ref. IRB0003099) and by the ethical committees of the centers taking part in the study. All the participants were informed of the implications of the study and signed an informed consent document before enrolling. The study included 644 adult participants in two samples (). The clinical sample (n=192) comprised patients from nine mental health centers and from some primary care centers in the Barcelona area. The CORE-OM was included in the routine pretreatment assessment of these centers, and it is this routine clinical data that are reported on in this paper. All patients who were referred for psychological treatment between March 2012 and May 2013 in the centers collaborating in the study were included in the study. Professionals were asked to exclude from these referrals inpatients and outpatients with severe psychological disorders. Another exclusion criterion was insufficient linguistic competence to communicate in Spanish.

Table 1 Demographic data

Download CSV Display Table

The nonclinical sample (n=452) included a student and a community nonstudent sample, between 18 and 70 years of age (inclusion criteria) who were assessed in the same period from March 2012 to May 2013 and had sufficient linguistic competence to communicate in Spanish. The latter (n=127) consisted of volunteers and/or their relatives who were not receiving psychological treatment (exclusion criterion). The student sample (n=325) was drafted from the Faculty of Psychology of the University of Barcelona; 219 were undergraduate students from four different subject areas, and 106 were master-level students.

Forty-six participants of the community sample and 32 of the student sample agreed to take part in the test–retest survey completing the questionnaire twice; this second administration of the questionnaire took place between 15 and 30 days after the first one. For student, test and retest were made in their classrooms with a 2-week test–retest interval; for the community sample, all the participants who completed the first assessment were contacted by phone ~2 weeks later and were invited to participate in the retest survey. For those who accepted, the questionnaire was sent in an envelope, and they completed and returned it. Test–retest stability was not measured in the clinical sample as that would have involved significant interference with normal clinical management of these participants. This was in line with the UK original study where there was no test–retest stability examination in the clinical sample for the same reason.^Citation36

Instruments and measures

CORE-OM is a 34-item self-report questionnaire that assesses the client’s status in the domains of Subjective well-being (four items), Problems/Symptoms (12 items), Functioning (12 items), and Risk (six items).^{Citation36,Citation37} Eight of the items are positively cued (items 3, 4, 7, 12, 19, 21, 31, and 32). The focus is on the last 7 days, and items are scored in a five-point scale ranging from 0 (not at all) to 4 (most or all the time), where higher scores on all domains indicate more problems and high levels of psychological distress even for the Subjective well-being scale. The domains were named to designate their item content but never envisaged to be psychometric factors.^{Citation35,Citation38} The Subjective well-being domain comprises four items capturing this aspect. The Problems/Symptom domain includes four items addressing anxiety, four for depression, and two each for physical problems and trauma. The Functioning domain includes four items covering general/work functioning, four addressing close relationships, and four for social functioning. The Risk domain has four items about risk to self and two about risk to others.

The CORE-OM was designed to be user-friendly for both clients and practitioners.^{Citation35,Citation39} It takes 5–10 minutes to complete, and the total and domain scores are reported as means across items. Prorating, that is, using the item mean even with missing items, is recommended as long as <10% of the items in the score are missing.^Citation36

Psychometric properties were excellent in the original UK testing and in all subsequent explorations showing high internal consistency (Cronbach α between 0.75 and 0.94 for all scores, the lowest for Risk) and test–retest stability of 0.91 (Spearman’s ρ 0.91 for 1-week test–retest in a student sample). Discriminant validity showed large differences between clinical and nonclinical samples (Cohen’s d from 0.71 Risk to 1.77 Problems/Symptoms) and high correlations with measures which are conceptually close, for example, Beck Depression Inventory-II (BDI-II) (ρ=0.85) and Symptom Checklist 90 Revised (SCL-90-R) (ρ=0.88). The CORE-OM is also sensitive to change in therapies.^{Citation15,Citation36} As expected, the domains did not show neat factorial separation, but an oblique structure in which Risk items are clearly separated from other items and two strongly correlated main problem dimensions of the positively and the negatively cued items gave a moderate and just acceptable fit on confirmatory factor analysis.^Citation38

BDI-II is a 21-item self-administered inventory designed to measure the intensity of depressive symptoms in psychiatric and nonpsychiatric populations of both adults and adolescents.^Citation40 Items are rated on a four-point scale (0–3), and total scores are obtained by tallying the ratings for all 21 items. Scores range from 0 to 63, with higher scores reflecting increased depressive severity. The BDI-II requires ~5–10 minutes to complete and may be administered to individuals 13–80 years of age. We used the Spanish-language version of the BDI-II.^Citation41

SCL-90-R is a 90-item self-report symptom inventory designed to screen for a broad range of psychological problems.^Citation42 Each of the 90 items is rated on a five-point Likert scale of distress, ranging from “not at all” (0) to “extremely” (4). Subsequently, the answers are combined in nine primary symptom dimensions: Somatization, Obsessive-Compulsive, Interpersonal Sensitivity, Hostility, Depression, Anxiety, Paranoid Ideation, Phobic Anxiety, and Psychoticism. In addition, three global indices provide measures of overall psychological distress: the Global Severity Index, the Positive Symptom Total, and the Positive Symptom Distress Index. We used the Spanish-language version of the SCL-90-R.^Citation43

Analyses

To facilitate a comparison with the UK data, we followed the original study by assessing acceptability, internal consistency, test–retest reliability (with 15- to 30-day interval), influence of age and sex, correlations between domain scores, and discriminant validity against sample, reflected in the differences between clinical and nonclinical sample, along with the calculation of cutoff scores, and convergent validity in terms of the correlations between CORE-OM’s scores and those on the BDI-II and SCL-90-R.^Citation36 Following the UK study, most analyses were reported for each of the four content domains (Subjective well-being, Problems/Symptoms, Life/social functioning, and Risk) as well as for total scale, and for score of all items except those in the Risk domain. Internal reliability was reported as Cronbach’s α for the subsample with no missing item data,^Citation44 but results for domain scores were reported where a score could be computed by prorating up to 10% of missing items. To test the equality for the different coefficients in the samples and subsamples, a Felt’s procedure was done.^Citation45 Again following the UK validation study, nonparametric correlation coefficients (Spearman’s ρ) and nonparametric tests of differences in central location of distributions (Wilcoxon test) were used as scores did not conform to Gaussian distributions. The BDI-II^{Citation40,Citation41} and the SCL-90-R^{Citation42,Citation43} were used to test convergent validity with other self-report measures. Clinically significant change was calculated according to the c criterion that uses a cutoff point based on the contrast between dysfunctional and general population samples.^Citation46 Analyses were conducted using SPSS, version 20.0. As in the original paper, the methodology was mainly exploratory and descriptive rather than one of null hypothesis testing; wherever possible, 95% confidence intervals (CIs) were reported rather than P-values. This gave a test approximating to testing for P<0.05. Comparisons of parameters within this sample and against those reported in the UK data were generally informed in terms of overlap or not of CIs.^Citation15

This paper did not follow the original UK analysis in including a principal component analysis, as subsequent UK papers have shown that the CORE-OM, as its authors expected, has a complicated factor structure that would need larger clinical and nonclinical samples for the Spanish data than we have to date.^Citation6 More psychometric exploration will be reported later when such significantly larger samples are available.

Results

Acceptability

All of the questionnaires have sufficiently few items missing to allow prorating for a usable overall score (ie, no participant omitted more than three items). One hundred and seventy-nine (93.2%) participants of the clinical and 432 (95.6%) of the nonclinical samples returned completed data. The overall omission rate was 0.17%. The items that were most often incomplete were items 3 (0.7%) and 25 (0.7%) in the nonclinical and items 21 (1%) and 32 (1%) in the clinical sample.

Internal consistency

To evaluate the internal reliability, we calculated Cronbach’s α,^Citation44 for all domains and the entire scale for the clinical and nonclinical groups. Furthermore, to test if the differences between these coefficients were statistically significant, we followed the procedure proposed by Feldt et al.^Citation45 All domains showed an appropriate internal reliability in both samples. The levels were within the acceptable range, although being lower for the Risk domain ( and ).

Figure 1 Forest plot showing comparison between Spanish scores and UK referential data.

Table 2 Coefficient α (95% CI) denoting internal consistency for nonclinical and clinical samples

Download CSV Display Table

In comparison with the UK referential data, the pooled clinical and nonclinical α values for all items and all nonrisk items showed tight 95% CIs covering the UK referential values, and when the clinical and nonclinical samples were pooled, the lower confidence limit (CL) was above that for the UK data. For Subjective well-being, the Spanish α was above the UK one; for Problems/Symptoms, the clinical sample α had a CI covering the UK one, and the nonclinical α was slightly lower than the UK nonclinical value with the upper CL below the UK value; for Functioning, the CIs included the UK referential values. The values for the Risk domain were lower than the UK ones (which were the same for clinical and nonclinical samples at 0.79), though the CI for the combined clinical sample included 0.79.

Test–retest stability

Test–retest correlations were strong within domains in the nonclinical data (). The stabilities for all domains were satisfactory (range: 0.76–0.87), except for the Risk domain (0.45) reflecting the high rate of zero responses in answering these items in the nonclinical group. Changes of mean values between first and second survey were not significant for all scores.

Table 3 Test–retest stability and changes of mean values between first and second survey in a nonclinical sample (n=78)

Download CSV Display Table

Convergent validity

Correlations between domain scores and the BDI-II and the SCL-90-R were calculated (). Across domain scores, correlations were highest against conceptually close measures showing an acceptable convergent validity. The pattern and the correlations were generally very similar to the UK findings,^Citation36 although the Spanish correlations between the Risk scores and the BDI-II and SCL-90R were lower than the UK ones.

Table 4 Correlations with referential measures in clinical samples

Download CSV Display Table

Differences between clinical and nonclinical samples

There were significant differences between clinical and nonclinical samples in all domains () with higher scores for the clinical sample than the nonclinical one. With the exception of the Problem/Symptoms domain, the effect sizes of the differences were similar to the results of the UK study with CIs including the UK referential values.^Citation36 At 1.4 (CI 1.22–1.59), the effect size for the Problem/Symptoms score is lower than the UK referential that was 1.7 but remains respectable as discriminant validity against the clinical/nonclinical distinction. As in the UK data, the effect size of the difference for the Risk score at 0.8 was smaller than for all the other scores, actually higher than that in the UK data (0.7) but with the CI including the UK value.

Table 5 Mean and standard deviations for clinical and nonclinical samples

Download CSV Display Table

The box plot in shows no patients in the clinical sample scoring zero and a very few patients (outliers) in the nonclinical sample scoring very highly. The box for the one sample and the median line bisecting the box for the other sample do not overlap.

Figure 2 Box plot of mean item score for all items for clinical and nonclinical samples.

Abbreviation: CORE-OM, Clinical Outcomes in Routine Evaluation - Outcome Measure.

Sex and age differences

In the nonclinical sample, age was significantly and negatively related with all domain scores except Risk: Subjective well-being (ρ=−0.25, P<0.001), Problems/Symptoms (P=−0.23, P<0.001), and Functioning (ρ=−0.18, P<0.001); nevertheless, those relationships were weak. In the clinical sample only, the Functioning domain showed a significant correlation with age (ρ=−0.19, P=0.006), and again, this relationship was weak. Regarding sex, only the Subjective well-being domain showed a statistical difference between men and women in both samples with a small effect size ().

Table 6 Sex differences in scores for clinical and nonclinical samples

Download CSV Display Table

Correlations between domain scores

shows, as expected, significant and generally strong correlations between all domains. However, correlations between Risk domain scores and the other scores were lower, especially in the nonclinical sample.

Table 7 Correlations between Spearman’s ρ values for clinical and nonclinical samples

Download CSV Display Table

Clinically significant change

Values for clinical significant change were calculated for all domains following the c criterion which takes into account data from both clinical and nonclinical samples.^Citation46 Cutoff scores () separate typical clinical and nonclinical populations and will help to identify the extent to which change after treatment is clinically meaningful.

Table 8 Male and female cutoff scores between clinical and nonclinical populations

Download CSV Display Table

Discussion

To the extent that these psychometric analyses of these data from the Spanish version of the CORE-OM are good or acceptable, the translation is supported for use in Spanish-speaking populations.

Regarding acceptability, considered as the number of missing items and unusable measures, the results were excellent compared to those obtained in the original English-language test.^Citation37 In our study, the percentage of complete item responses was higher for both the clinical and the nonclinical sample than in the initial UK testing, which could be taken as an evidence not only for the proper design of the questionnaire but also for the quality of the translation process carried out to adapt this instrument into Spanish.^Citation33

These results are consistent with other studies of validation such as the Italian, where the percentages of item response (96% for the clinical sample and 81% for nonclinical sample)^Citation23 are comparable or lower than those observed in the current study (93.2% for the clinical and 95.6% for the nonclinical sample). Similarly, the results from Sweden have an omission rate of 0.44% of items,^Citation25 compared with 0.17% in our study. There are no patterns regarding specific items in which omissions occurred, indicating that there appears to be no connection to any specific dimension.

Considering reliability, the results are acceptable and consistent with the analysis made in other studies of adaptation and validation,^Citation25^–^Citation27 as well as with the original UK data. In all of these translations, including the present study, some differences in the internal consistency between clinical and nonclinical samples were identified; however, in all domains, the α value was between 0.7 and 0.9, which means that the reliability of the CORE-OM in Spanish has resulted as satisfactory as in other versions. α was lowest for the Risk domain, at 0.71 for the pooled nonclinical sample, lower than the observed value of 0.79 in the UK validation study (CI 0.77–0.81). It seems likely that this difference arises because the Risk items are tuned to catch mostly only quite significant levels of Risk, giving floor effects that curtail variance in nonclinical samples. It seems possible that both, the larger size of the UK nonclinical sample compared to that reported here and perhaps a higher rate of Risk to self in the UK populations, where, particularly in young adults, self-harm may be more prevalent than in some other countries probably including Spain,^{Citation47,Citation48} may have led to more inter-item covariance appearing in this score due to floor effects rather than necessarily to much lower population covariances.

Test–retest stability in our study was good with the exception of the Risk domain score, which again is likely to be explained by its small length, floor effects, and the intrinsically impulsive (and thus unstable) nature of some of the phenomena addressed by these items. Stability correlations were strong but slightly lower than in the UK study,^Citation36 which is consistent with other results such as the Icelandic data.^Citation27

Regarding convergent validity, correlations between the domain scores of the CORE-OM and the BDI-II and SCL-90-R were strong except again for the Risk scores, which is consistent with the original UK data.^Citation36 In different studies, the CORE-OM has shown satisfactory convergent validity with other conceptually close measures which supports its value as a wider general measure for psychotherapy outcome assessment.^{Citation23,Citation25,Citation27,Citation49}

Comparative analysis showed significant differences between clinical and nonclinical population in all domains, as in other validation studies, demonstrating discriminant validity across different countries and languages. The effect size (Cohen’s d) values were large for all domains.

As in the original UK data, small but statistically significant correlations between scores and age were found in our study, more so in the nonclinical than the clinical samples. These seem likely to be genuine demographic associations, but the small effect size illustrates that age does not strongly and systematically contaminate scores. However, the majority of participants in the nonclinical sample were students (72%) with a very different age mean and range from clinical population. Thus, larger replication studies with more diverse nonclinical samples are needed to ascertain the generalizability of these differences. Furthermore, a community sample of persons who exceed pensionable age, almost absent in these samples, would indicate whether specific norms are needed for older populations.^Citation17

In the analysis of sex differences in mean scores, only Subjective well-being domain showed a statistical difference between men and women in both samples, with a small effect size in the same direction as the results analyzed in the UK version.^Citation15 According to the UK authors, sex should be considered in the interpretation of individual data regardless of clinical or nonclinical condition. In the Swedish and Italian studies, sex differences were very similar to those found in the UK study.^{Citation23,Citation25,Citation36} However, it seems highly plausible that there will be sex effects, which may be culture specific.

The strong and positive correlations between the domain scores are expected because the items of the CORE-OM are designed to evaluate related aspects of psychological distress, and the correlations found in this study are not dissimilar from those in all explorations to date with the only scale showing low correlations with respect to the others being Risk.^{Citation5,Citation23,Citation25}^–^Citation27 This corroborates the special characteristics of the Risk domain,^Citation38 defined as an oblique factorial scale with fairly low positive correlation with the other items and domains illustrating that Risk issues are, generally, rather distinct from other aspects of psychological distress domains. The items were designed as much to provide flags of Risk more than to form a robust scale, while ensuring that the crucial issue of Risk would contribute to the overall score, in contrast to many measures which omit it. The findings in this study, in the UK, and of all other translations studied so far fit that design.

The cutoff scores obtained in our study are a little lower than those reported by the British and Lithuanian adaptations.^{Citation26,Citation36} Our values seem more similar to those found in the Italian version,^Citation23 with the exception of the Functioning domain, which again is lower in our data than the others. It seems entirely plausible that cutoff scores, which reflect service provision (implicit in the separation of clinical and nonclinical populations), will show cultural/national variations. Currently, data about cutoff scores for reliable change are being collected, and we hope that the results will be published soon.

Overall, the results provide very reassuring information about the psychometric properties and the potential of the Spanish version of the CORE-OM. The limitations of the study are the nonrandom sample frames, the relatively limited sample sizes, lack of interview measures, and relatively limited number of convergent validity tests. However, these results clearly support the use of the measure and justify development of subsequent studies with the forms derived from this questionnaire. A Catalan translation has been completed, and psychometric exploration of it is currently in progress. Another translation into Spanish considered more suited for use in Argentina has been completed but not tested. Initial discussions with a few natives suggest that the Spanish version assessed in this article is considered acceptable for use in Chile, Mexico, and Colombia. Further exploration of its acceptability, and then its psychometric properties in other Spanish-speaking countries other than Spain, is encouraged.

In summary, this study presents the Spanish version of the CORE-OM showing that it is a reliable and valid instrument for assessing psychological distress in patients and providing feedback to their therapists about overall change and ongoing progress. An additional advantage of this instrument in all its versions, including Spanish, is that it can be used without payment of license fees, and this should facilitate generation of much more evidence about the efficacy and effectiveness of psychological therapies in Spain and at least some other Spanish-speaking populations.

Disclosure

The contract grant sponsor for this study was the Ministry of Economy and Competitiveness of Spain, contract grant number: PSI2011-23246. The authors report no other conflicts of interest in this work.

References

MargisonFRBarkhamMEvansCMeasurement and psychotherapy evidence-based practice and practice-based evidenceBr J Psychiatry2000177212313011026951
PubMed Web of Science ®Google Scholar
CORE System Trust [homepage on the Internet]United KingdomCORE Information Management Systems [updated 2015 Jun 1; cited 2015 Oct 10]. Available from: http://www.coresystemtrust.org.uk/copyright.pdfAccessed March 18, 2016
Google Scholar
CORE System Trust [homepage on the Internet]United KingdomCORE Information Management Systems [updated 2015 Jun 1; cited 2015 Oct 10]. Available from: http://www.coresystemtrust.org.ukAccessed March 18, 2016
Google Scholar
Spanish CORE-OM [homepage on the Internet]SpainCORE-OM Official Spanish Website [updated 2015 Jun 1; cited 2015 Oct 10]. Available from: http://www.ub.edu/terdep/coreAccessed March 18, 2016
Google Scholar
BarkhamMEvansCMargisonFThe rationale for developing and implementing core batteries in service settings and psychotherapy outcome researchJ Ment Health1998713547
Google Scholar
BarkhamMMargisonFLeachCService profiling and outcomes benchmarking using the CORE-OM: toward practice-based evidence in the psychological therapiesJ Consult Clin Psychol200169218419611393596
PubMed Web of Science ®Google Scholar
BarkhamMGilbertNConnellJMarshallCTwiggESuitability and utility of the CORE-OM and CORE-A for assessing severity of presenting problems in psychological therapy services based in primary and secondary care settingsBr J Psychiatry200518623924615738505
PubMed Web of Science ®Google Scholar
EvansCConnellJBarkhamMMarshallCMellor-ClarkJPractice-based evidence: benchmarking NHS primary care counselling services at national and local levelsClin Psychol Psychother2003106374388
Web of Science ®Google Scholar
GilbertNBarkhamMRichardsACameronIThe effectiveness of a primary care mental health service delivering brief psychological interventions: a benchmarking study using the CORE systemPrim Care Ment Health200534241251
Google Scholar
McHughPGordonMByrneMEvaluating brief cognitive behavioural therapy within primary careMent Health Rev J2014193196206
Google Scholar
HardyGECahillJStilesWBSudden gains in cognitive therapy for depression: a replication and extensionJ Consult Clin Psychol2005731596715709832
PubMed Web of Science ®Google Scholar
StilesWBLeachCBarkhamMEarly sudden gains in psychotherapy under routine clinic conditions: practice-based evidenceJ Consult Clin Psychol2003711142112602421
PubMed Web of Science ®Google Scholar
Di BonaLSaxonDBarkhamMDent-BrownKParryGPredictors of patient non-attendance at improving access to psychological therapy services demonstration sitesJ Affect Disord201416915716425194784
PubMed Web of Science ®Google Scholar
Royal College of NursingWorking Well? Results from the RCN Working Well Survey into the Wellbeing and Working Lives of NursesLondonAuthor2003
Google Scholar
ConnellJBarkhamMStilesWBDistribution of CORE−OM scores in a general population, clinical cut-off points and comparison with the CIS-RBr J Psychiatry20071901697417197659
PubMed Web of Science ®Google Scholar
ConnellJBarkhamMMellor-ClarkJMental health CORE-OM norms of students attending university counselling services bench-marked against an age-matched primary care sampleBr J Guid Counc20073514156
Web of Science ®Google Scholar
BarkhamMCulverwellASpindlerKTwiggEThe CORE-OM in an older adult population: psychometric status, acceptability, and feasibilityAging Ment Health20059323524516019277
PubMed Web of Science ®Google Scholar
JenkinsPETurnerHMAn investigation into the psychometric properties of the CORE-OM in patients with eating disordersCouns Psychother Res2014142102110
Google Scholar
RichardsDALovellKGilbodySCollaborative care for depression in UK primary care: a randomized controlled trialPsychol Med2003380227928717803837
PubMed Web of Science ®Google Scholar
FeixasGBadosAGarcía-GrauEA dilemma-focused intervention for depression: a multicenter, randomized controlled trial with a 3-month follow-upDepress Anxiety Epub2016422
Web of Science ®Google Scholar
NordgrenLBHedmanEEtienneJEffectiveness and cost-effectiveness of individually tailored internet-delivered cognitive behavior therapy for anxiety disorders in a primary care population: a randomized controlled trialBehav Res Ther20145911124933451
PubMed Web of Science ®Google Scholar
CORE System Trust [homepage on the Internet]United KingdomCORE Information Management Systems [updated 2015 Jun 1; cited 2015 Oct 10]. Available from: https://www.coresystemtrust.org.uk/cst-translation-policyAccessed March 18, 2016
Google Scholar
PalmieriGEvansCHansenVValidation of the Italian version of the Clinical Outcomes in Routine Evaluation Outcome Measure (CORE-OM)Clin Psychol Psychother200916544444919701881
PubMed Web of Science ®Google Scholar
Dias SalesMCde Matos MoleiroCMEvansCGomes AlvesPCThe Portuguese version of the CORE-OM: translation, adaptation, and preliminary study of psychometric propertiesRev Psiquiatria Clín20113925459
Google Scholar
ElfströmMLEvansCLundgrenJJohanssonBHakebergMCarlssonSGValidation of the Swedish Version of the Clinical Outcomes in Routine Evaluation Outcome Measure (CORE-OM)Clin Psychol Psychother201320544745510.1002/cpp.178822438100
PubMed Web of Science ®Google Scholar
ViliūnienéREvansCHilbigJTranslating the Clinical Outcomes in Routine Evaluation Outcome Measure (CORE-OM) into LithuanianNord J Psychiatry201267530531123228157
PubMed Web of Science ®Google Scholar
KristjánsdóttirHSigurðssonBHSalkovskisPEvaluation of the psychometric properties of the Icelandic version of the Clinical Outcomes in Routine Evaluation–Outcome Measure, its transdiagnostic utility and cross-cultural validationClin Psychol Psychother2015221647410.1002/epp.187424170273
PubMed Web of Science ®Google Scholar
Jokić-BegićNLauri KorajlijaAJurinTEvansCFaktorska struktura, psihometrijske karakteristike i kritična vrijednost hrvatskoga prijevoda CORE-OM upitnika [Factor structure, psychometric properties and cut-off scores of Croatian version of clinical outcomes in routines evaluation – outcome measure (CORE-OM)]Psihologijske Teme2014232265288 Slovenian
Google Scholar
StilesWBBarkhamMWheelerSEffect of duration of psychological therapy on recovery and improvement rates: evidence from UK routine practiceBr J Psychiatry201520711512225953889
PubMed Web of Science ®Google Scholar
BiescadMTimulakLMeasuring psychotherapy outcomes in routine practice: examining Slovak versions of three commonly used outcome instrumentsEur J Psychother Couns2014162140162
Google Scholar
MechlerJHolmqvistRDeteriorated and unchanged patients in psychological treatment in Swedish primary care and psychiatryNord J Psychiatry2016701162325994483
PubMed Web of Science ®Google Scholar
SturludottirKGestsdottirSRafnssonRHJohannssonEÁhrif hrey-fiíhlutunar á einkenni geðklofa, andlega líðan og líkamssamsetningu hjá ungu fólki [The effects of physical activity intervention on symptoms in schizophrenia, mental well-being and body composition in young adults]Laeknabladid201510111519 Icelandic26584222
PubMed Web of Science ®Google Scholar
FeixasGEvansCTrujilloALa versión española del CORE-OM: Clinical Outcomes in Routine Evaluation – Outcome Measure [Spanish version of the CORE-OM: Clinical Outcomes in Routine Evaluation – Outcome Measure]Rev de Psicot201289109135 Spanish
Google Scholar
International Test CommissionInternational Test Commission guidelines for translating and adapting tests2010 Available from: http://www.intestcom.org/files/statement_using_tests_for_research.pdfAccesed March 18, 2016
Google Scholar
EvansCThe CORE-OM (Clinical Outcomes in Routine Evaluation) and its derivativesIntegra Sci Pract2012221214
Google Scholar
EvansCConnellJBarkhamMTowards a standardised brief outcome measure: psychometric properties and utility of the CORE-OMBr J Psychiatry2002180516011772852
PubMed Web of Science ®Google Scholar
EvansCMellor-ClarkJMargisonFCORE: clinical outcome in routine evaluationJ Ment Health20009247255
Google Scholar
LyneKJBarrettPEvansCBarkhamMDimensions of variation on the CORE-OMBr J Clin Psychol200645218520316783905
PubMed Web of Science ®Google Scholar
BarkhamMMellor-ClarkJConnellJCahillJA core approach to practice-based evidence: a brief history of the origins and applications of the CORE-OM and CORE SystemCouns Psychother Res200661315
Google Scholar
BeckATSteerRABrownGKManual for the Beck Depression Inventory – Second Edition (BDI-II)San Antonio, TXPsychological Corporation1996
Google Scholar
SanzJPerdigónALVázquezCAdaptación Española del inventario para la depresión de Beck-II (BDI-II): Propiedades psicométricas en población general [Spanish adaptation for depression inventory Beck-II (BDI-II): psychometric properties in general population]Clín Salud200314249280 Spanish
Google Scholar
DerogatisLRSCL-90-R: Administration, Scoring and Procedures: Manual IITowson, MDClinical Psychometric Research1983
Google Scholar
González de RiveraJLDe las CuevasCRodríguezMRodríguezFCuestionario de 90 síntomas SCL-90-R de Derogatis, L. Adaptación española [90 Symptoms Questionnaire SCL-90-R of Derogatis, L. Spanish Adaptation]Madrid, SpainPearson2002 Spanish
Google Scholar
CronbachLJCoefficient alpha and the internal structure of testPsychometrika1951163297334
Web of Science ®Google Scholar
FeldtLSWoodruffDJSalihFAStatistical inference for coefficient alphaAppl Psychol Meas198711193103
Web of Science ®Google Scholar
JacobsonNSTruaxPClinical significance: a statistical approach to defining meaningful change in psychotherapy researchJ Consult Clin Psychol199159112192002127
PubMed Web of Science ®Google Scholar
BernalMHaroJMBernertSRisk factors for suicidality in Europe: results from the ESEMED studyJ Affect Disord20071011–3273417074395
PubMed Web of Science ®Google Scholar
CaseyPRDunnGKellyBDFactors associated with suicidal ideation in the general population five-centre analysis from the ODIN studyBr J Psychiatry2006189541041517077430
PubMed Web of Science ®Google Scholar
CahillJBarkhamMStilesWBTwiggWReesAHardyGEConvergent validity of the CORE measures with measures of depression for clients in cognitive therapy for depressionJ Couns Psychol2006532253259
Web of Science ®Google Scholar

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Psychometric properties of the Spanish version of the Clinical Outcomes in Routine Evaluation – Outcome Measure

Abstract

Objective

Method

Results

Conclusion

Introduction

Method

The Spanish version

Participants and procedures

Table 1 Demographic data

Instruments and measures

Analyses

Results

Acceptability

Internal consistency

Table 2 Coefficient α (95% CI) denoting internal consistency for nonclinical and clinical samples

Test–retest stability

Table 3 Test–retest stability and changes of mean values between first and second survey in a nonclinical sample (n=78)

Convergent validity

Table 4 Correlations with referential measures in clinical samples

Differences between clinical and nonclinical samples

Table 5 Mean and standard deviations for clinical and nonclinical samples

Sex and age differences

Table 6 Sex differences in scores for clinical and nonclinical samples

Correlations between domain scores

Table 7 Correlations between Spearman’s ρ values for clinical and nonclinical samples

Clinically significant change

Table 8 Male and female cutoff scores between clinical and nonclinical populations

Discussion

Disclosure

References

Information for

Open access

Opportunities

Help and information

Psychometric properties of the Spanish version of the Clinical Outcomes in Routine Evaluation – Outcome Measure

Abstract

Objective

Method

Results

Conclusion

Introduction

Method

The Spanish version

Participants and procedures

Table 1 Demographic data

Instruments and measures

Analyses

Results

Acceptability

Internal consistency

Table 2 Coefficient α (95% CI) denoting internal consistency for nonclinical and clinical samples

Test–retest stability

Table 3 Test–retest stability and changes of mean values between first and second survey in a nonclinical sample (n=78)

Convergent validity

Table 4 Correlations with referential measures in clinical samples

Differences between clinical and nonclinical samples

Table 5 Mean and standard deviations for clinical and nonclinical samples

Sex and age differences

Table 6 Sex differences in scores for clinical and nonclinical samples

Correlations between domain scores

Table 7 Correlations between Spearman’s ρ values for clinical and nonclinical samples

Clinically significant change

Table 8 Male and female cutoff scores between clinical and nonclinical populations

Discussion

Disclosure

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date