1,643
Views
35
CrossRef citations to date
0
Altmetric
Research Article

Psychometric evaluation of the Dundee Ready Educational Environment Measure: Swedish version

, &
Pages e267-e274 | Published online: 25 Apr 2011

Abstract

Background: The Dundee Ready Educational Environment Measure (DREEM) has been used in various studies to evaluate the educational environment. However, psychometric evaluations of the instrument seem sparse, for all known versions of the instrument.

Aim: The aim was to psychometrically evaluate the Swedish version of the DREEM instrument.

Method: A total of 503 students (undergraduate medicine), aged 19–46 years, in semesters 2, 6 and 10 were included in the study. Validity was evaluated through analysis of construct validity and reliability.

Results: The instrument had in general both acceptable validity and reliability. Due to a rather poor model fit in the confirmatory factor analysis, an explorative factor analysis was also employed which suggested a new five-factor solution for the instrument.

Conclusions: The Swedish version of the DREEM instrument is shown to be valid and reliable, except for the factor structure. The new five-factor solution found in this study is not proven to be a superior measurement model compared with the original, but could be seen as an alternative model to the original, where the strong and weak areas are somewhat more easily identified.

Introduction

The Dundee Ready Educational Environment Measure (DREEM) was developed and validated for international use within health professions’ curricula about 15 years ago (Roff et al. Citation1997). It has since been extensively used to measure students’ perceptions of educational environment with different purposes. Schools and sites within schools (Roff et al. Citation2001; Al-Hazimi et al. Citation2004; Varma et al. Citation2005; McKendree Citation2009) and academic achievers and under-achievers (Mayya & Roff Citation2004) have been compared. Information about students’ perceptions of ideal environment (Till Citation2005) and expectations of the climate (Miles & Leinster Citation2007) has been collected. DREEM has also been used as an instrument for improvement (Whittle et al. Citation2007) and to follow what happens during curriculum reform (Jiffry et al. Citation2005; Bouhaimed et al. Citation2009).

The instrument has been translated into several languages (Roff Citation2005). Despite this extensive research, very few psychometric studies have been published since the original development of the instrument (Roff et al. Citation1997). Among those are a Portuguese version for use in training of residents (de Oliveira Filho et al. Citation2005), a Chinese version for nursing (Wang et al. Citation2009) and a Greek translation (Dimoliatis et al. Citation2010) The DREEM instrument contains 50 items and a Likert scale is used giving a global maximum score of 200. Factor analysis has resulted in five subscales (Roff et al. Citation1997). The original subscale structure was not maintained in the three translated versions (de Oliveira Filho et al. Citation2005; Wang et al. Citation2009; Dimoliatis et al. Citation2010). We have translated the instrument into Swedish and used it to measure the educational environment in an undergraduate medical programme (Edgren et al. Citation2010). A professional translation of the instrument does not guarantee that the original reliability and validity of the instrument remain when the instrument is used in a novel environment. Therefore, a psychometric test is warranted.

The aim of this study was thus to psychometrically evaluate the Swedish version of the DREEM.

Methods

Context

The medical programme at Lund University is 5½ years with undergraduate entry. At the time of the study, 96 students were enrolled twice a year (two semesters) and 50–60% of the students were females.

Questionnaire

The questionnaire (DREEM) contains 50 items with a five-point Likert response scale ranging from 0 = strongly disagree to 4 = strongly agree (Roff et al. Citation1997). Factor analysis has revealed five subscales: (1) perceptions of learning (maximum score 48), (2) perceptions of teachers (maximum score 44), (3) academic self-perceptions (maximum score 32), (4) perceptions of atmosphere (maximum score 48) and (5) social self-perceptions (maximum score 28). The total maximum score is 200. Some items are negative and the scores are reversed before calculations.

The DREEM questionnaire was translated into Swedish by a teacher proficient in English and then translated back into English by a professional translator. Differences were discussed between the translator and two of the authors. The instrument was piloted with a group of 20 students in the fourth year of study.

Sample

Data were collected through the DREEM questionnaire, which was distributed to all students (at lectures) in semesters 2, 6 and 10, respectively, in 2003 and 2005. Students who were present at the lecture got the form immediately and those who did not attend the lecture received it from course administrative staff. The cohort of semester 2 in 2003 and semester 6 in 2005 was thus the same, as were the cohorts of semester 6 in 2003 and semester 10 in 2005. The cohorts of semester 2 in 2005 differed slightly from the 2003 cohort, since they started their medical programme with an introductory course (8 weeks, focussing on an overview of the programme, PBL training and study techniques) which the cohort of semester 2 in 2003 did not. Else there were no changes in the curriculum during the study time.

Statistical analyses/data analysis

Data were analysed and compared across cohorts as well as for the total sample. The analyses were performed using the Student's t-test and the chi-square test. The chi-square test was used for comparisons of nominal data, while the Student's t-test was employed for interval/ratio data.

The instrument was tested for construct validity and reliability (Nunnally & Bernstein Citation1994). Validity was evaluated through analysis of item response rate, floor- and ceiling effects, corrected item-total correlations and factor structure.Item response rate was calculated as the number of (percentage of) respondents that had completed all items of the total instrument as well as in the subscales. A figure of 90% and above is considered as satisfactory. The proportion of floor and ceiling effects (i.e. people obtaining minimum and maximum scores, respectively) was studied among the subscales. The corrected item-total correlations (i.e. the correlation between each item and the total score of the remaining items in the hypothesised scale) should be 0.30 or greater to be considered as acceptable and interpreted as evidence supporting construct validity.

To evaluate the DREEM instrument's factor structure, confirmatory factor analyses (LISREL) were performed, based on Spearman's rank order correlation matrix. The factors were defined in concordance with the subscales in the instrument, while correlation between the latent variables, factor loadings on the latent variables and the residuals were set as free parameters. The chi-square goodness-of-fit test, the goodness-of-fit index (GFI), the adjusted goodness-of-fit index (AGFI), the normed fit index (NFI), the non-normed fit index (NNFI) and the root mean squared error of approximation (RMSEA) were used to assess goodness-of-fit. If the factor model fits, the chi-square test should be non-significant (i.e. p > 0.05). The GFI, AGFI and NFI are scaled to range between 0 and 1, with values above 0.90 suggesting a good model fit while values ≥0.85 can be considered to indicate acceptable model fit (Schermelleh-Engel & Moosbrugger Citation2003). An exception to this is the NNFI, which is not scaled to range between 0 and 1, although a higher value still indicates better fit (Schermelleh-Engel & Moosbrugger Citation2003), whereas RMSEA values ≤0.08 (preferably ≤0.05) are considered to indicate acceptable fit. The reason why the goodness-of-fit of the factor model was assessed by various descriptive fit indices was because this statistic is sensitive to sample size (e.g. the chi-square test). Furthermore, when the number of free parameters increases in the model, the better it will fit, e.g. in GFI (Nunnally & Bernstein Citation1994). However, the AGFI and the NNFI adjust (downwards) the GFI and NFI, respectively, based on the number of parameters/degrees of freedom.

In order to further investigate the factor structure of the DREEM instrument, an explorative factor analysis (i.e. principal component analysis) with varimax rotation was also performed. In the explorative factor analysis, both the Kaiser's eigenvalue criterion (Eigenvalue ≥1), and a scree plot was used to determine the number of factors to extract (Ferguson & Cox Citation1993). Missing values were excluded listwise. The quality of the factor analysis models was assessed using Bartlett's test for sphericity and the Kaiser–Meyer–Olkin (KMO) test, together with evaluation of communalities, factor loadings and eigenvalues. Bartlett's test is a measure of the probability that the initial correlation matrix is an identity matrix and should be <0.05. The KMO test measures the degree of multicollinearity (based on partial correlations) and varies between 0 and 1 (should be greater than 0.50–0.60).

The reliability of the instrument was analysed by means of tests for internal consistency, employing Cronbach's alpha (Cronbach Citation1951). The alpha values preferably range between 0.7 and 0.9 even if figures as low as 0.6 may be acceptable (Nunnally & Bernstein Citation1994). Alpha values lower than 0.7–0.6 indicate too high heterogeneity and values higher than 0.9 indicate that the items may be too similar.

The items (nos 4, 8, 9, 17, 25, 35, 39, 48 and 50) were revised so that all items ranged from positive to negative replies before the factor analyses, correlations and Cronbach's alpha were performed. Data were analysed using SPSS 15.0 for Windows and LISREL 8.51.

Results

The study comprised 395 individuals with a mean age of 25.5 (95% CI 25.0–26.0), range 19–46 years (). The overall response rate was 82% (201/246) in 2003 and 75% (194/257) in 2005. Of the students that responded, 58.9% were females and 41.1% males. Mean results and SD are presented in .

Table 1.  Description of the sample and comparison between the two cohorts.

The rate of incomplete questionnaires seemed at first rather high. Of the 103 persons (26%) who failed to complete all 50 items, the number of missing responses in each item ranged between 1 (0.3%) and 73 (18.5%). However, it was actually item number 6 (‘The teachers are patient with the patients’) that had the highest internal missing (n = 73, 18.5%), and the missing responses were all from the students in semester 2 (i.e. students in the beginning of the programme containing core pre-clinical studies and have only had a few encounters with patients). When not considering this item, the missing responses ranged between 1 (0.3%) and 6 (1.5%). Missing responses in the subscales (when including all items) were subscale I: 3.8%, subscale II: 20.5%, subscale III: 4.6%, subscale IV: 17.0% and subscale V: 2.3%. Analyses of floor- and ceiling effects showed no major problems (). No floor effects were identified and only minor ceiling effects were detected; subscale II had two respondents (0.5%) that reached the maximum value, subscale IV had four respondents (1.0%) reaching the maximum, and in subscale V three respondents (0.8%) did so. The corrected item-total correlations indicated that the instrument had acceptable construct validity (). All correlations were r ≥ 0.3, except item number 48 (‘The teaching is too teacher-centred’) in subscale I, item number 17 (‘Cheating is a problem in this school’) in subscale IV, and item number 46 (‘My accommodation is pleasant’) in subscale V.

Figure 1. The distribution of the responses for each factor.

Figure 1. The distribution of the responses for each factor.

Table 2.  Corrected item-total correlations for each item (divided in the pre-defined subscales).

Factor loadings from the confirmatory factor analysis ranged from 0.028 to 0.762 () and GFIs did not fully support the hypothesised items-to-summary measures structure (). The chi-square test (p < 0.001), GFI (=0.75), AGFI (=0.73), produced by the LISREL program all indicated a rather low goodness-of-fit, while NFI (=0.89), NNFI (=0.93) and the RMSEA (=0.068) indicated an acceptable fit. All items had factor loadings of ≥0.3 except item number 17 (‘Cheating is a problem in this school’), 23 (‘The atmosphere is relaxed during lectures’), 39 (‘The teachers get angry in class’), 46 (‘My accommodation is pleasant’). Item 39 had an especially low factor loading.

Table 3.  Result from the confirmatory factor analysis (the items are divided in the predefined subscales).

The instrument was also found to have acceptable Cronbach's alpha for all scales even if the alpha value for subscale V was a little bit low (but was approximately α = 0.7). Cronbach's alpha for the five subscales (I–V) were I: 0.807; II: 0.785; III: 0.720; IV: 0.786 and V: 0.689. The alpha value for the whole DREEM instrument (all items) was 0.930.

Due to the rather poor model fit in the confirmatory factor analysis, an exploratory factor analysis was also performed. Various factor solutions (ranging between a five- to a nine-factor solution) were suggested. However, only the five-factor structure could give a logical classification of the items (). This new five-factor solution explained 42% of the variance, and the communalities varied between 0.119 and 0.627. Two items (no. 26 and 46) had communality values <0.2 (). Each new factor was reviewed and given new headings based on its contents (Appendix). In the review two items were found, based on its content, to be more suitable in another factor, and these two changed place. Item 26 was moved from F2 to F4 and item 11 from F4 to F2. The decision to make these changes was due to, on the one hand, that the item better matched the content in the new factor, and on the other hand, the factor loadings were approximately the same for these items in both factors. A description of the items for each factor is presented in the appendix, and descriptive statistics for the new factors is presented in . Cronbach's alpha was calculated for the new five factors and ranged between 0.705 and 0.892 (). Missing responses in the subscales (for the new five-factor solution) were subscale I: 6.6%, subscale II: 22.3%, subscale III: 1.8%, subscale IV: 2.8% and subscale V: 2.0%. No floor or ceiling effects were identified.

Table 4.  Principal component analysis with varimax rotation for the DREEM instrument.

Students could write comments on other things that they thought were important for the environment, and a few of them commented that the buildings (lecture halls, group rooms and self-study rooms) are important.

Discussion

The Swedish version of the DREEM instrument was found to have both acceptable validity and reliability. The validity was tested in terms of construct validity and reliability in terms of internal consistency. Also the feasibility was assessed through analysis of item response rate.

The overall response rate of 82% and 75%, respectively, can be considered acceptable. The proportion of female and male respondents fitted well with the figures in the whole group.

Item 17 (‘Cheating is a problem in this school’) and item 46 (‘My accommodation is pleasant’) appear to have both low correlation and low factor loading. Do the students misinterpret the item about cheating? Does it mean only at exams or otherwise too? The item about accommodation is probably not relevant in a Swedish context. It has the highest score in the instrument (mean 3.46, SD 0.79). Swedish standard of living is generally high, and most importantly it is not the responsibility of the universities to provide accommodation for students.

The other items that had low factor loadings were number 23 (‘The atmosphere is relaxed during lectures’) and 39 (‘The teachers get angry in class’) which had an especially low factor loading. Maybe these items do not appear relevant to the Swedish students where the relation between students and teachers are very egalitarian, and they therefore make different interpretations than students in other contexts.

Due to a rather poor model fit in the confirmatory factor analysis, an explorative factor analysis was employed which suggested a new five-factor solution for the instrument. The new five-factor solution had slightly better Cronbach's alpha values than the original factor model. Neither the confirmatory factor analysis nor the exploratory analysis provided clear support for the internal construct validity (factor structure) of the proposed measurement model (Roff et al. Citation1997). This has also been seen in other psychometric analyses of the instrument that have been published (de Oliveira Filho et al. Citation2005; Wang et al. Citation2009; Dimoliatis et al. Citation2010). Dimoliatis et al. (Citation2010) presented both confirmative and exploratory factor analyses, and neither completely confirmed the original model. According to these authors (Dimoliatis et al. Citation2010), the original model was developed by a qualitative method, and this could explain the differences. Wang et al. (Citation2009) proposed yet another five-factor solution in their study from an undergraduate nursing school, and cultural differences are suggested as an explanation. The Portuguese version was used for residents, which could explain some differences, since the instrument was developed for undergraduate students (de Oliveira Filho et al. Citation2005). A reason for the poor model fit in some of the tests, for the confirmatory factor analysis in this study, might be due to the fact that there were few observations in relation to the number of parameters estimated. The recommended ratio is 20 observations for each estimated parameter (i.e. ≈1/20) (Hair et al. Citation2008). In this study, this ratio was 50/395 (≈1/4), while the number of respondents should be around 1000 according to this rule of thumb. Another reason to the poor model fit may be the influence of some items (i.e. item numbers 17, 23, 39, 46 and 48) that had low correlations with the summary score of the subscale to which they were intended to belong. However, despite these limitations, half of the goodness-of-fit tests indicated an acceptable model fit and the majority (92%) of factor loadings were ≥0.3, which could be seen as an indication for acceptable construct validity of the DREEM instrument anyway. Furthermore, the analysis of item-total correlation () supported this assumption.

Both the confirmatory factor analysis and the item-total correlation identified some items to have low correlation with the factor. These items were item numbers 17, 39 and 46 ( and ). One explanation mentioned above might be that these questions are not as relevant for the Swedish context. However, this problem was not found in the factor model derived from the explorative factor analysis (except for item number 46 which had a communality of 0.156). Thus, the low factor loadings may still be a result of a poor model fit and if the items are to be included in the Swedish version of the DREEM instrument, then the factor structure suggested by the explorative factor analysis is probably a better option.

The suggested five-factor solution was not in accordance with the hypothesised (original) measurement model of the DREEM instrument and is not proven to be a superior measurement model compared with the original, but could be seen as an alternative model which can be further explored/tested. The new solution however added some value to the interpretation of the results. There was a somewhat bigger difference in the average scores on these subscales (0.68–0.82% of the maximum) than on the original ones (0.70–0.77% of the maximum) (). The strong and weak areas, respectively, are somewhat more easily identified, and can be subjected to development efforts. In our case, it became obvious that areas where students have a stronger influence (e g ‘psychosocial situation’, 82% of maximum) were more highly rated than areas where the teachers have a more obvious responsibility (‘communication’ and ‘teaching organisation and progression’, both 68% of maximum) (cf. ).

It was interesting that some students commented that buildings and rooms are also a part of the environment and contribute to the perceptions of the students. It seems reasonable to believe that rooms for teaching and learning also influence the climate, and an item about those could be added to the instrument.

The results of this study and others (de Oliveira Filho et al. Citation2005; Wang et al. Citation2009; Dimoliatis et al. Citation2010) indicate that the DREEM may not be as culturally independent as was originally stated (Roff et al. Citation1997). For future use in Sweden, some items should probably be removed, at least item 46. Item numbers 17 and 39 should perhaps also be taken out or subjected to an analysis of how the students interpret these items. When we used the instrument for measurement of the climate in Lund university medical programme (Edgren et al. Citation2010), we made use of the total score, scores on individual items and comparisons between years and semesters. Our conclusions in that study should not be affected by the potential weaknesses discovered in this study.

Study limitations

The relatively small number of students is a limitation of this study and also that it has been undertaken in a single context.

Acknowledgement

The authors are most grateful to Ann-Christin Haffling for assistance with the data collection.

Declaration of interest: The authors report no conflicts of interest. The authors alone are responsible for the content and writing of this article.

References

  • Al-Hazimi A, Zaini R, Al-Hyiani A, Hassan N, Gunaid A, Ponnamperuma G, Karunathilake I, Roff S, McAleer S, Davis M. Educational environment in traditional and innovative medical schools: A study in four undergraduate medical schools. Educ Health 2004; 17(2)192–203
  • Bouhaimed M, Thalib L, Doi SAR. Perception of the educational environment by medical students undergoing a curriculum transition in Kuwait. Med Princ Pract 2009; 18(3)204–208
  • Cronbach LJ. Coefficient alpha and the internal structures of tests. Psychometrika 1951; 3: 297–334
  • de Oliveira Filho GR, Edson Vieira J, Schonhorst L. Psychometric properties of the Dundee Ready Educational Environment Measure (DREEM) applied to medical residents. Med Teach 2005; 27(4)343–347
  • Dimoliatis IDK, Vasilaki E, Anastassopoulos JPA, Roff S. Validation of a Greek translation of the Dundee Ready Education Environment measure (DREEM). Educ Health 2010; 23(1), [Epub ahead of print]
  • Edgren G, Haffling AC, Jakobsson U, McAleer S, Danielsen N. Comparing the educational environment (as measured by DREEM) at two different stages of curriculum reform. Med Teach 2010; 32(6)e233–238
  • Ferguson E, Cox T. Exploratory factor analysis: A users’ guide. Int J Sel Assess 1993; 1: 84–94
  • Hair JF, Black B, Babin BJ, Anderson RE, Tatham RL. Multivariate data analysis, 6th. Prentice-Hall, New Jersey 2008
  • Jiffry MTM, McAleer S, Fernando S, Marasinghe RB. Using the DREEM questionnaire to gather baseline information on an evolving medical school in Sri Lanka. Med Teach 2005; 27(4)348–352
  • Mayya SS, Roff S. Students’ perception of educational environment: A comparison of academic achievers and under-achievers at Kasturba Medical College, India. Educ Health 2004; 17(3)280–291
  • McKendree J. Can we create an equivalent educational experience on a two campus medical school?. Med Teach 2009; 31: e202–e205
  • Miles S, Leinster SJ. Medical students’ perceptions of their educational environment: Expected versus actual perceptions. Med Educ 2007; 41(3)265–272
  • Nunnally JC, Bernstein IH. Psychometric theory, 3rd. McGraw-Hill Inc, New York 1994
  • Roff S. The Dundee Ready Educational Environment Measure (DREEM) – A generic instrument for measuring students’ perceptions of undergraduate health professions curricula. Med Teach 2005; 27(4)322–325
  • Roff S, McAleer S, Harden RM, Al-Qahtani M, Ahmed AU, Deza H, Groenen G, Primparyon P. Development and validation of the Dundee Ready Education Environment Measure (DREEM). Med Teach 1997; 19(4)295–9
  • Roff S, McAleer S, Ifere OS, Bhattacharya S. A global diagnostic tool for measuring educational environment: Comparing Nigeria and Nepal. Med Teach 2001; 23(4)378–382
  • Schermelleh-Engel K, Moosbrugger H. Evaluating the fit of structural equation models: Tests of significance and descriptive goodness-of-fit measures. Methods Psychol Res Online 2003; 8: 23–74
  • Till H. Climate studies: Can students’ perceptions of the ideal educational environment be of use for institutional planning and resource utilization?. Med Teach 2005; 27(4)332–337
  • Varma R, Tiyagi E, Gupta JK. Determining the quality of educational climate across multiple undergraduate teaching sites using the DREEM inventory. BMC Med Educ 2005; 5(1)8
  • Wang J, Zang S, Shan T. Dundee Ready Educational Environment Measure psychometric testing with Chinese nursing students. J Adv Nurs 2009; 65(12)2701–2709
  • Whittle S, Whelan B, Murdoch-Eaton DG. DREEM and beyond: Studies of the educational environment as a means for its enhancement. Educ Health 2007; 20(1)7

Appendix

Roman numerals indicate the original subscale of the items.

F1: Learning and motivation

  •  1 I am encouraged to participate in class I

  •  2 The teachers are knowledgeable II

  •  3 There is a good support system for students who get stressed V

  •  7 The teaching is often stimulating I

  • 12 This school is well time-tabled IV

  • 13 The teaching is student centred I

  • 14 I am rarely bored on this course V

  • 16 The teaching is sufficiently concerned to develop my competence I

  • 20 The teaching is well focused I

  • 21 I feel I am being well prepared for my profession III

  • 22 The teaching is sufficiently concerned to develop my confidence I

  • 24 The teaching time is put to good use I

  • 35 I find the experience disappointing IV

  • 40 The teachers are well prepared for their classes II

  • 41 My problem solving skills are being well developed here III

  • 43 The atmosphere motivates me as a learner IV

  • 44 The teaching encourages me to be an active learner I

F2: Communication

  •  6 The teachers are patient with patients II

  • 11 The atmosphere is relaxed during the ward teaching IV

  • 18 The teachers have good communication skills with patients II

  • 29 The teachers are good at providing feedback to students II

  • 30 There are opportunities for me to develop interpersonal skills IV

  • 31 I have learned a lot about empathy in my profession III

  • 32 The teachers provide constructive criticism here II

  • 37 The teachers give clear examples II

F3: Psychosocial situation

  •  4 I am too tired to enjoy this course V

  • 15 I have good friends in this school V

  • 19 My social life is good V

  • 23 The atmosphere is relaxed during lectures IV

  • 28 I seldom feel lonely V

  • 33 I feel comfortable in class socially IV

  • 34 The atmosphere is relaxed during seminars/tutorials IV

  • 42 The enjoyment outweighs the stress of studying medicine IV

  • 45 Much of what I have to learn seems relevant to a career in medicine III

  • 46 My accommodation is pleasant V

F4: Teaching organisation and progression

  •  5 Learning strategies which worked for me before continue to work for me now III

  • 10 I am confident about my passing this year III

  • 26 Last year's work has been a good preparation for this year's work III

  • 27 I am able to memorise all I need III

  • 36 I am able to concentrate well IV

  • 38 I am clear about the learning objectives of the course I

  • 47 Long term learning is emphasised over short term I

F5: Bad teaching

  •  8 The teachers ridicule the students II

  •  9 The teachers are authoritarian II

  • 17 Cheating is a problem in this school IV

  • 25 The teaching over-emphasises factual learning I

  • 39 The teachers get angry in class II

  • 48 The teaching is too teacher-centred I

  • 49 I feel able to ask the questions I want IV

  • 50 The students irritate the teachers II

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.