2,103
Views
28
CrossRef citations to date
0
Altmetric
Research Article

Assessing the learning environment of a faculty: Psychometric validation of the German version of the Dundee Ready Education Environment Measure with students and teachers

, , , , &
Pages e624-e636 | Published online: 24 Oct 2011

Abstract

Aims: The teachers’ perspectives of the educational environment have as yet only been sparsely considered. This study aimed at validating the first German version of the Dundee Ready Education Environment Measure (DREEM) from the points of view of both students and teachers.

Methods: Data from 1119 students and 258 teachers were available for analysis. Psychometric validation included the analysis of homogeneity and discrimination at item level as well as reliability (Cronbach's α), criterion and construct validity at test level. Effect sizes were calculated and the independent samples t-test was used for statistical inference testing of mean differences between two groups.

Results: Item characteristics were satisfactory in both samples. Reliability was high with α = 0.92 (students) and 0.94 (teachers), respectively. Factor analyses revealed five dimensions which slightly diverged from the five subscales postulated by the DREEM authors though. The environment was evaluated significantly (p < 0.001) more positively by teachers (M = 117.63) than by students (M = 109.75). Further significant differences were observed with regard to gender, mother language, stage of studies and previous professional training among others.

Conclusions: With convincing psychometric properties at item and test levels, the suitability of DREEM not only for students but also for teachers to assess the educational environment has been demonstrated.

Introduction

Knowledge about the teaching and learning environment is an important prerequisite for the successful conception and implementation of a new curriculum (Bouhaimed et al. Citation2009) and should become a part of educational institutions’ good practices (Soemantri et al. Citation2010). Although the educational environment is rather a fuzzy construct which makes a binding definition difficult (Genn & Harden Citation1986), it can be deduced from the numerous attempts at definition that here not only external parts in the sense of an objective environment but also internal parts in the sense of a personality trait are important. The individual perception of one and the same objective fact is known to vary widely (Dornic & Ekehammar Citation1990). Furthermore, the individual perception of the learning environment by the student has a significant impact on the educational results realised in this learning climate: Desirable learning outcomes and gains are positively associated with favourable aspects of socio-psychological learning environments, i.e. the better the environment, the better the results (Haertel et al. Citation1981). Besides the performance behaviour, the student's gender also correlates with his/her perception of the environment (Chaput de Santonge & Dunn Citation2001): Some authors found female students to assess their learning environment more negatively than male students (Mayya & Roff Citation2004), others, however, found the opposite results, i.e. female students evaluating the climate more positively than their male counterparts (Bassaw et al. Citation2003). Apparently, female and male students often experience their learning environments differently (Roff et al. Citation2001) which is, in light of an increasing proportion of female students not only in Germany but also worldwide, of interest for the traditionally male-dominated medical teaching facilities (Genn Citation2001b).

The teachers’ perspectives of the educational environment have as yet only been sparsely considered in spite of the many curriculum reforms that have been initiated worldwide (Fraser Citation1986; Genn Citation2001a, Citation2001b; Miles & Leinster Citation2009). This is even more remarkable since curriculum changes represent a challenge for the teachers and, just for this group of people, an evaluation appears to be particularly important (Till Citation2005).

In German (medical) educational research, the educational environment has received only little attention and there is a lack of suitable measurement tools in the German language. An international, widely used and validated measurement tool that has been translated into several other languages is the Dundee Ready Education Environment Measure (DREEM; Roff et al. Citation1997; cf. Soemantri et al. Citation2010). The test construction included a Delphi panel of almost 100 international health-care educators and the combination of quantitative with qualitative test construction techniques. The questionnaire was thus conceived as a non-culturally specific instrument for assessing the teaching and learning environment in the perception of students of the health-care professions (Roff Citation2005). Up to now, it has been used only once to compare staff and student perceptions of the learning environment at a given medical school. To explore potential perceptional discrepancies between these two groups, Miles and Leinster (Citation2009) administered DREEM to students and – in a reworded version – to teachers who were thereby able to complete the questionnaire with their opinion about the student experience. Although the overall results were closely aligned, there were significant differences at subscale level, i.e. teachers perceived themselves and learning more positively than students did, whereas students’ social perception of themselves was more positive than in the teachers’ view. In addition, the results demonstrated staff's unfamiliarity with student aspects of the learning environment. In its present form, DREEM consists of 50 items that must be answered using a scale of 0 (“strongly disagree”) to 4 (“strongly agree”); thus, a maximum of 200 points can be achieved. A high number of points is indicative of a good environment. The items are divided into the five subscales: perception of teaching, perception of teachers, academic self-perception, perception of the atmosphere and social self-perception.

The aim of this study is to psychometrically validate DREEM for the first time in the German-speaking area as a standardised questionnaire for assessing the learning environment from the points of view of both students and teachers.

Methods

A German version of DREEM was completed online by 1119 medical students and 258 teachers of the Medical Faculty of the Heinrich-Heine-University (HHU) Duesseldorf. In addition, the students and teachers were asked extensive questions about their demographic background.

First, a German translation of DREEM was prepared using the back-translation method of Brislin (Citation1970) as modified by Jones et al. (Citation2001). This was then translated back by two independent, bilingual experts, discrepancies were discussed and a final version agreed upon. For the preparation of the teachers’ version of the questionnaire, items formulated in the original in the “I” form (e.g. “I am encouraged to participate actively in the classes”) were adapted for teachers according to the procedure of Miles and Leinster (Citation2009; e.g. “The students are encouraged to participate in class”; see Appendix 1).

Medical education in Duesseldorf currently follows a traditional curriculum that is divided into a 2-year, preclinical period (science and basic subjects) and a 4-year clinical period. The first state examination is held at the end of the preclinical period while the second and final state examination is held at the end of the clinical period.

Data collection was carried out as an online procedure at the end of the summer semester 2010 for all students and teachers. Participation was voluntary and the data were stored in anonymous form. Details of demographic variables were collected prior to the questionnaire. Hereby the students were, among others, asked to provide information about their school leaving certificates and their grades in the first state examination.

Altogether, questionnaires from 1119 students (=55.0%) and 258 teachers were analysed. As it was not known who in the faculty was actually really involved in teaching, the questionnaires were first sent to all scientific personnel in the faculty (N = 1294). Staff members who could be safely ignored as not being involved in teaching, such as clerks, technicians or nurses, were omitted from the very beginning. The accompanying letter was addressed specifically to teachers and demographic questions on the topic “teaching activity” were posed. Even if staff members, such as scientific personnel solely involved in research, had begun to complete the questionnaire, they would not have been able to provide pertinent answers to most questions because of their lacking teaching experience and would have therefore been rather likely to cancel the survey. This is why we only evaluated completed surveys and additionally run post hoc plausibility data checks to identify respondents – both students and staff – who had not answered the questions in a meaningful way. Thanks to these precautions, it can thus be safely assumed that the questionnaires we analysed were only completed by personnel actually involved in teaching. Because of these circumstances, however, no statements about the proportion of all teaching staff who participated are possible. contains demographic information on both samples.

Table 1.  Demographic data of the student and the teacher sample

Within the framework of item and test analysis, besides the mean values and the discrimination indices of the items, also item homogeneity as well as the reliability and validity of the entire test were to be examined.

A control of the administration and scoring objectivity was not necessary because the whole questionnaire contained standardised instructions and answering possibilities. Thanks to the interpretation guidelines for this questionnaire by Lai et al. (Citation2009) as well as McAleer and Roff (Citation2001), the interpretation objectivity of DREEM was also high and thus did not need to be examined. Content validity did not require any examination either because DREEM was developed by experts in a Delphi process.

  • Homogeneity: Explorative factor analyses were performed to analyse the homogeneity of the items and the factorial validity of DREEM.

  • Discrimination: Item discrimination was assessed by means of part-whole-corrected discrimination indices.Footnote1

  • Reliability: Examination of the reliability was limited to an analysis of the internal consistency by means of Cronbach's α.

  • Criterion validity: With regard to criterion validity, we investigated both predictively (on the basis of the school leaving grades) and concurrently (on the basis of the grades in the first state examination) whether good students evaluated the environment better than did their weaker counterparts (Pimparyon et al. Citation2000; Sun Citation2003; Mayya & Roff Citation2004; Carmody et al. Citation2009) and whether the perception of the environment became more negative with increasing number of semesters, even when age was held constant (Hutchins Citation1964; Till Citation2004; Zaini Citation2005; Bouhaimed et al. Citation2009; Riquelme et al. Citation2009). Furthermore, possible relationships with mother language and place of residence were studied. Also, an answer to the question of the discrepancy between the evaluation of the environment by the students in comparison to the evaluation by the teachers was sought in an undirected manner, since the few available results on this subject are inconsistent (Sheehan Citation1970; Fraser Citation1986; Miles & Leinster Citation2009) and allow the assumption of a series of difficultly predictable factors. Finally, an evaluation of possible gender differences was undertaken.

  • Construct validity: With regard to construct validity, we wanted to examine whether DREEM, also in its German version, exhibits the five-factorial structure (“perception of teaching”, “perception of teachers”, “academic self-perception”, “perception of atmosphere”, and “social self-perception”) as postulated by the authors (Roff et al. Citation1997) but as yet hardly ever tested (de Oliveira Filho et al. Citation2005; Wang et al. Citation2009).

All data were analysed with the help of the programme Statistical Package for the Social Sciences 17.0 for Windows (2008). Effect sizes were calculated with the freely available programme G*Power 3 (Faul et al. Citation2007). For inference testing of mean differences between two groups (e.g. students vs. teachers), the independent samples t-test was used. Pearson's correlations were calculated to examine relationships. Cohen's (Citation1988) guidelines were used for the interpretation of effect sizes. For the effect size measure d in the t-test, it holds that d ≥ 0.20 = small effect, d ≥ 0.50 = medium effect, d ≥ 0.80 = large effect. The product–moment correlation coefficient r, on the other hand, is already a measure of effect size, it holds that r ≥ 0.10 = small effect, r ≥ 0.30 = medium effect, r ≥ 0.50 = large effect.

Explorative factor analyses were conducted to analyse the homogeneity of the items and the factorial validity of the employed questionnaire. Besides theoretical expectations, the Kaiser–Guttman criterion (Gorsuch Citation1983) and the scree test (Cattell Citation1966) were used as criteria for the extraction of factors.

Results

Item analysis

DREEM item means dispersed sufficiently in both the student and teacher samples. In the student group, the total item mean amounted to M = 2.19 (SD = 0.50), only just above the limit of 2, which in DREEM indicates a region worthy of improvement at item level (Vieira et al. Citation2003; Whittle et al. Citation2007). Altogether, the means for 20 items were less than 2. For the teachers, the mean values were somewhat less dispersed and on the whole somewhat higher with a total mean of M = 2.35 (SD = 0.35). In this group, only six items had a mean value of less than the limit of 2. In both samples, there were no ceiling or floor effects at item level.

With regard to homogeneity, for students all, and for teachers most, of the items loaded highly (>0.30) on the dimensions identified by us by means of factor analyses (Appendix 2). Exceptions in the teachers’ questionnaire were items 24 (“The students are able to memorise all that is needed”), 38 (“The students are able to concentrate well”) and 42 (“Cheating in exams is a problem at this university”), which did not exhibit satisfactory loadings (i.e. >0.30) on any of the identified dimensions.Footnote2 The part-whole-corrected discrimination indices for the students were all positive and amounted on average to M = 0.42 (SD = 0.13); in the teacher sample, the discrimination indices were on average M = 0.46 (SD = 0.16; see ).

Table 2.  Mean values, standard deviations and part-whole-corrected discrimination indices of DREEM items in the student and the teacher sample.

Table 3.  DREEM total and subscale scores in the student and the teacher sample.

Table 4.  DREEM total and subscale scores in the student sample depending on students’ marks in the school leaving examination and the first state examination.

Table 5.  Association between DREEM total and subscale scores and stage of course in the student sample.

Table 6.  DREEM total and subscale scores in the student sample depending on students’ mother tongue.

Table 7.  Association between DREEM total and subscale scores and distance between respondents’ residence and university (in km) in the student and teacher sample.

Table 8.  DREEM total and subscale scores depending on students’ previous occupational training.

Table 9.  Gender differences in the assessment of teaching and learning environment in the student sample.

Reliability

At the level of the entire test, the results of the reliability analyses using Cronbach's α were similar in both groups, and DREEM proved highly reliable with α = 0.92 (students) and α = 0.94 (teachers), respectively. At subscale level, the coefficients ranged between 0.53 and 0.86 (). In accordance with the interpretation guidelines recommended by Lai et al. (Citation2009) as well as McAleer and Roff (Citation2001), the educational environments as measured by means of DREEM exhibited both for the students (M = 109.75, SD = 21.71) and for the teachers (M = 117.63, SD = 20.80) – albeit only marginally – more positive than negative aspects (100 − 149 points). However, this is far removed from an excellent climate (more than 150 points) and lies in the typical region for traditional curricula (<120; Roff Citation2005).

Criterion validity

For the total DREEM score, the analyses showed significant mean differences between students with a school leaving grade below the median of 1.60 (M = 111.26, SD = 21.06) and those with a school leaving grade above the median of 1.60 (M = 108.74, SD = 22.01), t(1073) = 1.92, p < 0.05 and d = 0.16. With regard to the grade in the first state examination, there were no longer any significant differences between the good and less good students ().

Students in later semesters (clinical part of course) evaluated the environment as being significantly poorer than did their colleagues in earlier semesters (preclinical part of course) on the DREEM subscale “perception of teaching” (preclinical: M = 23.91, SD = 6.52 vs. clinical: M = 22.08, SD = 7.14; t(1109) = 4.42; p < 0.001; d = 0.27). In contrast, the atmosphere in the clinical stage was considered to be significantly better (preclinical: M = 26.56, SD = 5.89 vs. clinical: M = 28.12, SD = 6.07; t(1109) = −4.32; p < 0.001; d = 0.26]. The significant negative correlations between number of semesters and evaluation of the environment (e.g. “perception of teaching”: r = −0.18, p < 0.001) remained significant even after partialling out the factor age which, as expected, correlated positively with the number of semesters (r = 0.47, p < 0.001), but, however, did become weaker (“perception of teaching”: r* = −0.13, p < 0.001; see ).

In the student sample, native speakers tended to evaluate the environment (e.g. DREEM total score: M = 109.04, SD = 21.68) more negatively than non-native speakers (M = 114.95, SD = 21.24), t(1117) = −2.97, p < 0.01 and d = 0.28. The non-native speakers also considered themselves to be significantly more positive in academic self-perception in DREEM (M = 19.19, SD = 3.83) than the native speakers (M = 17.15, SD = 4.49), t(1117) = 5.01, p < 0.001, d = 0.49, and this even though – in the subgroup of students who had already completed the first state examination – the non-native speakers (M = 2.92, SD = 0.71) achieved significantly poorer results than the German native speakers (M = 2.66, SD = 0.74), t(581) = −2.77, p < 0.01, d = 0.36 ().

Students who did not live in Duesseldorf (M = 25.05, SD = 4.94) evaluated the teachers in DREEM significantly better than those students who lived in Duesseldorf (M = 24.11, SD = 5.11), t(1112) = 2.68, p < 0.01, d = 0.18. The further the teachers lived away from the university, the lower was their total score in DREEM (r = −0.13, p < 0.05) and the poorer did they evaluate the atmosphere in DREEM (r = −0.14, p < 0.05) as well as the social self-perception of the students (r = −0.19, p < 0.01; see ).

Students who had completed a professional training programme, e.g. in nursing or geriatric care (DREEM total score: M = 104.67, SD = 22.48), evaluated the environment in all dimensions of DREEM as being significantly poorer than did their fellow students without a completed profession training (M = 111.14, SD = 21.04), t(1098) = −4.11, p < 0.001, d = 0.30 ().

Comparison of student perspectives versus teacher perspectives

On comparison between teachers and students, the overall environment was evaluated significantly more positively by our teachers (DREEM total score: M = 117.63, SD = 20.80) than by our students (M = 109.75, SD = 21.71), t(1375) = −5.30, p < 0.001, d = 0.37. In both groups, however, the DREEM total score was significantly poorer than the average score (weighted according to sample size) of the available international studies with medical students (DREEM total score: M = 121.04) in which DREEM has been used so far (see Ostapczuk et al. (Citation2011) for an overview of all studies); students: t(1118) = –17.40, p < 0.001, d = 0.52, and, respectively, teachers: t(257) = −2.63, p < 0.01, d = 0.16.

Examining the different DREEM subscales in this study, teachers’ assessments were more positive than students’ assessments with regard to the subscales “perception of teaching”, “perception of teachers” and “perception of atmosphere”, i.e. our teachers were more satisfied with themselves as teachers, their teaching and the atmosphere in comparison with their students. Students evaluated only their “social self-perception” more positively than teachers did. On the DREEM subscale “academic self-perception”, there was no significant difference between student and teacher perceptions ().

Gender differences

In the student sample, female students evaluated the educational environment as being significantly more positive in practically all dimensions than did their male counterparts; however, the effects were small (). In contrast, there were absolutely no over-coincidental gender differences among the teachers. In order to clarify whether or not the gender difference among the students was due to a better academic performance of the female students, male and female students were analysed with regard to their school leaving grades and grades in the first state examination. It was found that female students (M = 1.81, SD = 0.55) on average achieved a significantly better school leaving grade than the male students (M = 1.92, SD = 0.59), t(1073) = −2.97, p < 0.01, d = 0.20, but then performed on average significantly more poorly in the first state examination (M = 2.76, SD = 0.50) than their male counterparts (M = 2.54, SD = 0.80), t(581) = 3.44, p < 0.001, d = 0.33.

Construct validity

In order to examine the factorial validity of DREEM, we investigated in both groups whether the five DREEM dimensions postulated by Roff et al. (Citation1997) could be identified by means of an explorative principal components analysis. Since the Kaiser–Meyer–Olkin (KMO) value (Kaiser 1970, 1974) was 0.93 and Bartlett's (Citation1954) test of sphericity significant, χ2(1225) = 18,561.63, p < 0.001, in the student sample, the factor analysis was carried out.Footnote3 After successful factorisation of the 50 DREEM items, there were 10 factors with an eigenvalue > 1; after the analysis of the scree plot and according to expectations, five factors were extracted and subjected to an orthogonal rotation. In total, the five factors explained 41.3% of the variance and exhibited a satisfactory simple structure. However, the five dimensions did not correspond exactly to the five DREEM subscales (Appendix 2): we interpreted them as “teaching and learning”, “self-perception”, “interpersonal relations”, “prevailing social conditions”, and a “method factor”, on which mainly items that were keyed negatively had high loadings. A factor analysis could also be carried out in the teacher sample, KMO = 0.91, Bartlett's test: χ2(1225) = 5066.23, p < 0.001. In analogy to the situation with students, of the original 13 factors with an eigenvalue >1, five components were extracted (variance explanation: 41.4%) and rotated orthogonally. As with the students, the five factors did not fully correspond with the expected subscales: we interpreted them as “teaching and academic perception”, “teachers and atmosphere”, “social perception” and “communication with patients” and, as already the case with the students, a “method factor”. Altogether, some of the extracted factors (e.g. “self-perception” for the students) represented mixed factors of the original scales, whereas others (e.g. “communication with patients” for the teachers) could not be assigned to any of the original DREEM subscales.2

Discussion

In this present investigation, a German version of DREEM (Roff et al. Citation1997) was subjected to a psychometric validation at item and test levels in a sample of students and teachers of human medicine, respectively. We were able to demonstrate convincing psychometric properties at both item and test levels, so that DREEM is now available in German and can be used to assess the teaching and learning environments for both students and teachers. With regard to other published studies on DREEM, ours involves the as-yet largest investigated samples.

The item means of DREEM were in a similar range as those in international studies in which DREEM had previously been employed (1.50–3.40; Whittle et al. Citation2007), whereby the dispersion among the teachers was somewhat lower than among the students. Items with a mean <2 indicate regions in need of improvement (Vieira et al. Citation2003; Whittle et al. Citation2007). In our samples, 20 items for the students and 6 items for the teachers were in this region.

The low discrimination of the DREEM items 24 (“I am able to memorise all I need”) and 48 (“My accommodation is pleasant”) for the teachers is most probably related to the fact that the teachers did not have the necessary information to answer these questions, so that their responses to these items were less predictive in regard to the DREEM total score. Item 42 (“Cheating is a problem at this university”), on the other hand, had already demonstrated a low discrimination in another study employing DREEM (Wang et al. Citation2009). Analyses of item discrimination have only been reported in two studies to date. In the above-mentioned work, the item discrimination indices of 49 items were in an acceptable range 0.27–0.77, solely the mentioned item 42 exhibited a discrimination index of <0.20 (Wang et al. Citation2009). Unfortunately, the authors do not clearly state whether or not they had performed the necessary part-whole corrections of the discrimination index; de Oliveira Filho et al. (Citation2005), on the other hand, reported that the discrimination had been examined, but did not provide any results of these analyses. The homogeneity of our items was, above all, in the student sample very good.

The reliability of DREEM was as yet most frequently tested in the sense of internal consistency by means of Cronbach's α. In the previously published studies, it varied from α = 0.90–0.95 for the entire test and from α = 0.51–0.90 for the five subscales (Mayya & Roff Citation2004; de Oliveira Filho et al. Citation2005; O’Brien et al. Citation2008; Lai et al. Citation2009; Riquelme et al. Citation2009; Wang et al. Citation2009). The retest reliability amounted to rtt = 0.43 for the entire test at a retest interval of 6 months for the original 50-item version. In our study, the reliability indices (internal consistency) in both samples were comparable and very good.

Criterion validation revealed weak relationships between the evaluation of the teaching and learning climates and the school leaving grades: students with higher grades assessed the climate to be somewhat better than did students with lower grades. In contrast, there were practically no significant associations between perception of the climate and performance in the first state examination. These results are in contradiction to those of other studies in which students with good examination results assessed the climate as being better than did students with poorer examination results (Pimparyon et al. Citation2000; Sun Citation2003; Mayya & Roff Citation2004). One reason for our results could be that the learning environment was on the whole rather negatively evaluated in the present sample and that the expected relationship between a good learning environment and good performance would rather develop in a positive atmosphere. Another reason could be the assessment of performance by means of objective and, above all, written theoretical measures (first state examination), for which Carmody et al. (Citation2009) had previously found no relationship with DREEM in contrast to clinical practical performance. However, this was not confirmed by other authors (Pimparyon et al. Citation2000; Sun Citation2003; Mayya & Roff Citation2004). Further clarification of this situation may be provided by planned investigation with a further German-speaking sample in which the environment will be assessed on the whole as better and/or the grades achieved in the final state examination will be available, as in this examination, also clinical practical performance is being tested.

In our study as well, the perceived learning environment is found to be poorer on average with increasing duration of studies. Students in the clinical phase of their education, however, assessed the atmosphere as being better than did their counterparts in the preclinical stage. Above all, teaching in the clinic received a poorer assessment. The differences remained weakened but still significant when age was partialled out of the correlation between number of semesters and perception of the learning environment; this has been investigated for the first time in our analysis of DREEM. Thus, it can be deduced that the perception of an increasing deterioration of the educational environment is not solely due to the circumstances of education but also to personal factors such as, possibly, “becoming older and more critical”. The initial enthusiasm in medical education seems for many students to decrease in the course of studies, also independent of any concrete bad experiences (Miles & Leinster Citation2007). These results agree with those of other investigations in which DREEM was employed. These have shown both in cross-section and longitudinally that the perception of the educational environment becomes poorer in the course of (medical) education (Till Citation2004, Citation2005; Zaini Citation2005; Bouhaimed et al. Citation2009; Riquelme et al. Citation2009) or at best remains the same (Jiffry et al. Citation2005; McKendree Citation2009), and improves only in isolated cases (Pimparyon et al. Citation2000).

In agreement with Miles and Leinster (Citation2007), another interesting result is that students who had already completed training in another area had a more negative perception of climate than their counterparts without such experience. The possibility of drawing (negative or positive) comparisons with external experiences may possibly affect the impression of the educational climate in medical studies, leading to higher (or lower) expectations. These findings might be taken into account in choosing applicants for medical studies or in curriculum development, for example by designing areas of the curriculum suited to the needs of particular groups.

Similar to the experience of Miles and Leinster (Citation2009), our students perceived themselves to be socially more positive than they were perceived by their teachers, whereas in contrast, the teachers evaluated themselves and their teaching more positively than did the students which could be due to a mutual self-serving bias (Lewicki Citation1983). In consequence of the transformation of the questionnaire in a teachers’ version, the question regarding content validity arises, as some of the items may not be really relevant to teachers or the teachers might not be in a position to give pertinent responses (e.g., “The students are encouraged to participate in class” or “The students seldom feel lonely”). In fact, some of our teachers reported difficulties in answering some of these items which might be indicative of lacking content validity. On the other hand, items relating to students’ self-perception can provide us with valuable feedback about how well teachers know the reality of students’ lives, feelings and sorrows (cf. Miles & Leinster Citation2009). The fact that – in contrast to the study of Miles and Leinster (Citation2009) – our teachers assessed both the atmosphere and the entire environment significantly more positively than students did can be interpreted along these lines. Divergent assessments are indicative of the intensity of exchange and communication between these two groups. Thus, the exploration and communication of discrepancies allow both groups to reflect on group immanent self- and other-perception and thereby create chances for the improvement of the learning environment, in particular the student–teacher relationship. The promotion of mentoring programmes (Kalén et al. Citation2010) and the implementation of effective feedback (Archer Citation2010), for example, could contribute to the formation of a true community of teachers and students.

In agreement with other studies (Roff et al. Citation1997; Bassaw et al. Citation2003; de Oliveira Filho et al. Citation2005; Bouhaimed et al. Citation2009), our data confirm that the educational environment is assessed more positively by females in comparison to males, even though the effect is small. Roff (Citation2005) reported in summary that gender differences in the perception of the teaching and learning environment depend on numerous cultural factors and are on the whole smaller when the total satisfaction is high. In contrast, we have detected significant but less relevant differences in spite of a rather lower total satisfaction. Other authors have reported exactly opposing results (Mayya & Roff Citation2004), i.e. female students were significantly less satisfied with the educational environment than male students in Argentina (Roff et al. Citation1997, Citation2001), or no or only minor gender differences were found (Till Citation2005; Miles & Leinster Citation2007; Carmody et al. Citation2009). Cultural differences could be playing a role here. Besides, socio-cultural factors seem to play a role since students whose mother tongue was not German perceived the educational climate more positively according to DREEM. In view of increasing internationalisation of courses, such findings might also be useful in developing a curriculum suited to the individual needs of particular groups.

In view of the large number of international studies in which DREEM was employed, its factorial validity as one facet of construct validity has previously only been rarely investigated. The five subscales described above are based on an explorative factor analysis by Roff et al. (Citation1997) of an Argentinean precursor of DREEM with 58 items. Replications of the five-factorial structure were only partially successful: not only de Oliveira Filho et al. (Citation2005) but also Wang et al. (Citation2009) reported that, although they also found five factors by means of explorative analysis of the 50 DREEM items, these were not absolutely identical with the original five factors. Finally, O’Brien et al. (Citation2008) also found five DREEM factors, although their results cannot be directly compared with the above-mentioned results since the shortened 32-item version of DREEM was used.

In this study, we also could not unambiguously identify the five subscales postulated by the authors. In our investigation, both in the student and the teacher samples, there was one factor that is to be considered as a methodological artefact on which mainly negatively keyed items loaded highly. Such effects occur frequently when a questionnaire contains both positively and negatively keyed items (Paulhus Citation1984; Paulhus & Reid Citation1991) and often can only be avoided by one of the following means: (1) exclusive use of positively keyed items and (2) balanced use of equal numbers of positively and negatively keyed items per scale/factor. The first solution option can be criticised in that it makes a questionnaire susceptible to acquiescence effects, i.e. when for all items, a “yes” or “I agree” answer is associated with a high manifestation of the characteristic in question or general agreement, after a certain time, the participants answer more and more frequently and automatically with “yes” or “I agree” (Hinz et al. Citation2007). On the other hand, the second solution can only be realised when one constructs the questionnaire oneself and not − as is the case in this investigation with DREEM − when one translates the original into another language.

Because a direct comparison with the two other studies (de Oliveira Filho et al. Citation2005; Wang et al. Citation2009) in which the five factors could also not be replicated is not possible due to insufficient details, it is not at present reasonable or it would be premature to discard the five original scales. Instead, future research efforts should be directed at verifying the factor structure in various languages and cultural environments.

With convincing psychometric properties at item and test levels, the suitability of DREEM not only for students but also for teachers to assess the educational environment has been demonstrated and it is now available in German for both groups. Both questionnaires provide the foundation for defining future improvements for faculty development. Repeated measurements will enable the assessment of progress in improving the learning environment. In future German and international investigations, a further-reaching psychometric validation of the questionnaire such as, e.g. an investigation of the factorial structure of DREEM, the relationship between the perception of climate by means of DREEM and academic performance and its retest reliability with use of an adequate retest interval would be of interest.

Declaration of interest: The authors have no conflicts of interests with respect to their authorship or the potential publication of this article. We did not receive financial support for the research and/or the authorship of this article.

Notes

1. Item discrimination is determined by correlating the sum of the respective item score (i) with the total score consisting of the sum of all item scores (t). The resulting correlation coefficient (rit), however, is spuriously high, because the item whose discrimination is to be determined is also part of the total sum. The fewer items a test contains, the larger the resulting bias when calculating item discrimination. To obtain an unbiased item discrimination coefficient, one therefore must correlate the respective item score (e.g. item 30) with the total score minus the contribution of the item in question (i.e. total score based on the sum of items 1–29, without item 30). The resulting coefficient is called a part-whole-corrected item discrimination coefficient (cf. Davis Citation1958).

2. The loading matrix for the factor analysis of the DREEM items in the teacher sample is available on request from the authors.

3. A KMO value >0.60 or even better >0.75 (Kaiser 1970, 1974) as well as a significant Bartlett's (Citation1954) test show that the variables correlate highly enough with each other to be able to perform a meaningful factor analysis.

References

  • Archer JC. State of the science in health professional education: Effective feedback. Med Educ 2010; 44: 101–108
  • Bartlett MS. A note on the multiplying factors for various chi2 approximations. J Royal Stat Soc B 1954; 16: 296–298
  • Bassaw B, Roff S, McAleer S, Roopnarinesingh S, de Lisle J, Teelucksingh S, Gopaul S. Students’ perspectives on the educational environment, Faculty of Medical Sciences, Trinidad. Med Teach 2003; 25: 522–526
  • Bouhaimed M, Thalib L, Doi SAR. Perception of the educational environment by medical students undergoing a curricular transition in Kuwait. Med Princ Pract 2009; 18: 204–208
  • Brislin RW. Back-translation for cross-cultural research. J Cross Cult Psychol 1970; 1: 185–216
  • Carmody DF, Jacques A, Denz-Penhey H, Puddey I, Newnham JP. Perceptions by medical students of their educational environment for obstetrics and gynaecology in metropolitan and rural teaching sites. Med Teach 2009; 31: 596–602
  • Cattell RB. The scree test for number of factors. Multivar Behav Res 1966; 1: 245–276
  • Chaput de Santonge DMC, Dunn DM. Gender and achievement in clinical medical students: A path analysis. Med Educ 2001; 35: 1024–1033
  • Cohen J. Statistical power analysis for the behavioral sciences, 2nd. Lawrence Erlbaum, Hove 1988
  • Davis FB. A note on part-whole correlation. J Educ Psychol 1958; 49: 77–79
  • de Oliveira Filho GR, Vieira JE, Schonhorst L. Psychometric properties of the Dundee Ready Educational Environment Measure (DREEM) applied to medical residents. Med Teach 2005; 27: 343–347
  • Dornic S, Ekehammar B. Extraversion, neuroticism and noise sensitivity. Pers Ind Diff 1990; 11: 989–992
  • Faul F, Erdfelder E, Lang AG, Buchner A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Meth 2007; 39: 175–191
  • Fraser BJ. Differences between students’ and instructors’ perceptions of actual and preferred classroom environment in higher education. High Educ Res Dev 1986; 5: 191–199
  • Genn J. AMEE Medical Education Guide No. 23 (Part 1): Curriculum, environment, climate, quality and change in medical education – a unifying perspective. Med Teach 2001a; 23: 337–344
  • Genn J. AMEE Medical Education Guide No. 23 (Part 2): Curriculum, environment, climate, quality and change in medical education – a unifying perspective. Med Teach 2001b; 23: 445–454
  • Genn J, Harden RM. What is medical education here really like? Suggestions for action research studies of climates of medical education environments. Med Teach 1986; 8: 111–124
  • Gorsuch RL. Factor analysis, 2nd. Erlbaum, Hillsdale 1983
  • Haertel GD, Walberg HJ, Haertel EH. Socio-psychological environments and learning: A quantitative synthesis. Br Educ Res J 1981; 7: 27–36
  • Hinz A, Michalski D, Schwarz R, Herzberg PY. The acquiescence effect in responding to a questionnaire. Psychosoc Med 2007; 4: 1–9
  • Hutchins EB. The AAMC Longitudinal Study: Implications for medical education. Acad Med 1964; 39: 265–277
  • Jiffry MTM, McAleer S, Fernando S, Marasinghe RB. Using the DREEM questionnaire to gather baseline information on an evolving medical school in Sri Lanka. Med Teach 2005; 27: 348–352
  • Jones PS, Lee JW, Phillips LR, Zhang XE, Jaceldo KB. An adaptation of Brislin's translation model for cross-cultural research. Nurs Res 2001; 50: 300–304
  • Kaiser HF. A second generation little jiffy. Psychometrika 1970; 35: 401–415
  • Kaiser HF. An index of factorial simplicity. Psychometrika 1974; 39: 31–36
  • Kalén S, Stenfors-Hayes T, Hylin U, Forsberg Larm M, Hindbeck H, Ponzer S. Mentoring medical students during clinical courses: A way to enhance professional development. Med Teach 2010; 32: e315–e321
  • Lai N, Nalliah S, Jutti RC, Hla Y, Lim VKE. The educational environment and self-perceived clinical competence of senior medical students in a Malaysian medical school. Educ Health 2009; 22: 148
  • Lewicki P. Self-image bias in person perception. J Pers Soc Psychol 1983; 45: 384–393
  • Mayya SS, Roff S. Students’ perceptions of educational environment: A comparison of academic achievers and under-achievers at Kasturba Medical College, India. Educ Health 2004; 17: 280–291
  • McAleer S, Roff S. A practical guide to using the Dundee Ready Education Environment Measure (DREEM). Curriculum, environment, climate, quality and change in medical education: A unifying perspective. AMEE Education Guide No. 23, JM Genn. Association for Medical Education in Europe, Dundee 2001; 29–33
  • McKendree J. Can we create an equivalent educational experience on a two campus medical school?. Med Teach 2009; 31: 202–205
  • Miles S, Leinster SJ. Medical students’ perceptions of their educational environment: Expected versus actual perceptions. Med Educ 2007; 41: 265–272
  • Miles S, Leinster SJ. Comparing staff and student perceptions of the student experience at a new medical school. Med Teach 2009; 31: 539–546
  • O’Brien AP, Chan TMF, Cho MAA. Investigating nursing students’ perceptions of the changes in a nursing curriculum by means of the Dundee Ready Education Environment Measure (DREEM) Inventory: Results of a cluster analysis. Int J Nurs Educ Scholarship 2008; 5: 25
  • Ostapczuk MS, Hugger A, de Bruin J, Rotthoff T. 2011. DREEM on, dentists! Students' perceptions of the educational environment in a German dental school as measured by the Dundee Ready Education Environment Measure. Eur J Dent Educ (in press)
  • Paulhus DL. Two-component models of socially desirable responding. J Pers Soc Psychol 1984; 46: 598–609
  • Paulhus DL, Reid DB. Enhancement and denial in socially desirable responding. J Pers Soc Psychol 1991; 60: 307–317
  • Pimparyon P, Roff S, McAleer S, Poonachai B, Pemba S. Educational environment, student approaches to learning and academic achievement in a Thai nursing school. Med Teach 2000; 22: 359–364
  • Riquelme A, Oporto M, Oporto J, Mendez JI, Viviani P, Salech F, Chianale J, Moreno R, Sanchez I. Measuring students’ perceptions of the educational climate of the new curriculum at the Pontificia Universidad Catolica de Chile: Performance of the Spanish translation of the Dundee Ready Education Environment Measure (DREEM). Educ Health 2009; 22: 112
  • Roff S. The Dundee Ready Educational Environment Measure (DREEM) – a generic instrument for measuring students’ perceptions of undergraduate health professions curricula. Med Teach 2005; 27: 322–325
  • Roff S, McAleer S, Harden RM, Al-Qahtani M, Ahmed AU, Deza H, Groenen G, Primparyon P. Development and validation of the Dundee Ready Education Environment Measure (DREEM). Med Teach 1997; 19: 295–299
  • Roff S, McAleer S, Ifere OS, Bhattacharya S. A global diagnostic tool for measuring educational environment: Comparing Nigeria and Nepal. Med Teach 2001; 23: 378–382
  • Sheehan TJ. Medical school climate: Comparative faculty and student views. J Med Educ 1970; 45: 880–887
  • Soemantri D, Herrera C, Riquelme A. Measuring the educational environment in health professions studies: A systematic review. Med Teach 2010; 32: 947–952
  • SPSS 17.0 for Windows. 2008. Users guide. New York: McGraw-Hill.
  • Sun B. Reforming medical curricula in China Medical University. University of Dundee, Masters dissertation, DundeeUK 2003
  • Till H. Identifying the perceived weaknesses of a new curriculum by means of the Dundee Ready Education Environment Measure (DREEM) Inventory. Med Teach 2004; 26: 39–45
  • Till H. Climate studies: Can students’ perceptions of the ideal educational environment be of use for institutional planning and resource utilization?. Med Teach 2005; 27: 332–337
  • Vieira JE, Nunes MDPT, Martins MDA. Directing student response to early patient contact by questionnaire. Med Educ 2003; 37: 119–125
  • Wang J, Zang S, Shan TM. Dundee ready education environment measure: Psychometric testing with Chinese nursing students. J Adv Nurs 2009; 65: 2701–2709
  • Whittle S, Whelan B, Murdoch-Eaton DG. DREEM and beyond; studies of the educational environment as a means for its enhancement. Educ Health 2007; 20: 9
  • Zaini R. The use of DREEM as curriculum need analysis tool. Med Teach 2005; 27: 385

Appendices

Appendix 1. Teachers’ version of the questionnaire.

Appendix 2. Factor analytic loadings of DREEM items in the student sample.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.