13,013
Views
176
CrossRef citations to date
0
Altmetric
Web Paper BEME Guide

The effectiveness of team-based learning on learning outcomes in health professions education: BEME Guide No. 30

, , , &
Pages e1608-e1624 | Published online: 18 Nov 2013

Abstract

Background: Team-Based Learning (TBL) is a student-centred active learning method, requiring less faculty time than other active learning methods. While TBL may have pedagogical value, individual studies present inconsistent findings. The aim of this systematic review was to assess the effectiveness of TBL on improving learning outcomes in health professions education.

Methods: A peer-reviewed systematic review protocol was registered with the Best Evidence in Medical Education (BEME) organization. After comprehensive literature searching, title and full-text review were completed by two independent reviewers. Included studies assessed TBL and a valid comparator in health professions. Included studies were assessed for methodological quality by two independent reviewers. Studies were categorised by outcomes using the Kirkpatrick framework.

Results: Of 330 screened titles, 14 were included. Seven studies reported significant increase in knowledge scores for the TBL group, four reported no difference and three showed improvement but did not comment on statistical significance. Only one study reported significant improvement in learner reaction for the TBL group while another study reported a significant difference favouring the comparator.

Conclusions: Despite improvement in knowledge scores, there was mixed learner reaction. This may reflect the increased demands on learners in this student-centred teaching strategy, although further study is needed.

Introduction

With increasing enrolment in health professions programmes, there is a growing interest in active learning strategies due to the belief that active learning results in enhanced knowledge retention and skills application (Forsetlund et al. Citation2009). While traditional lecture-based learning can be successful in teaching students to recall information, health professions students must also be able to think critically and apply their knowledge in novel situations (Parmelee & Michaelsen Citation2010).

Several active learning methods have been investigated in the past decades. Problem-based learning (PBL) was one of the first to be implemented in health professions education, starting with McMaster University in 1969 and surviving many curricular revisions since then (Koles et al. Citation2010). Other active learning strategies currently being used include case-based group discussion, workshops and audience response systems. While these methods have proven to be effective in increasing student engagement, they can also be extremely resource-intensive. With medical education facing a crisis in faculty time for teaching, alternative methods that do not require high faculty to student ratios are being sought.

Team-based learning (TBL) attempts to balance the issues of active learning and faculty teaching time (Searle et al. Citation2003). By breaking up a large lecture hall of 100 or more students into small groups, TBL gets students actively learning while only requiring one faculty to facilitate (Koles et al. Citation2005). TBL was developed by Dr. Larry Michaelsen in a business curriculum in the 1970s. Michaelsen assigned the students into teams of 5–7 and informed them that he would not lecture but instead oversee their discussions, ensure that they covered the course content and create challenging problems to which they would apply their knowledge (Michaelsen et al. 2008). Michaelsen refined the process of TBL over the years to characterise it with three main phases: (1) advanced preparation by the students, (2) individual and group readiness assurance and (3) application, including discussion and analysis with the entire class (Koles et al. Citation2010). The permanence of teams, immediate feedback and a meaningful peer evaluation process are additional cornerstones of this learner-centred educational strategy (Michaelsen & Parmelee, personal communication, 2011). After consulting with Dr. Larry Michaelsen, founder of TBL, and Dr. Dean Parmelee, a second key expert in the field, we have used this model of TBL in our inclusion criteria because we feel this definition captures the essence of TBL most holistically. We acknowledge that other TBL models exist (e.g. hybrid models), but excluded studies of these models on the basis that their outcomes could not be a true evaluation of TBL if only certain elements of TBL were used. As this systematic review protocol was registered in 2011, subsequent published definitions were not included in this review.

The first reported implementation of TBL in health professions education was at the Baylor College of Medicine in 2001 (Haidet et al. Citation2002). Within one year, 10 medical institutions in the United States piloted TBL (Searle et al. Citation2003). Currently, TBL is being used at schools of medicine, nursing, dentistry, pharmacy, residency programs and continuing medical education in various jurisdictions, including Japan, Korea, Singapore and the Middle East (Parmelee & Michaelsen Citation2010).

While the primary literature on TBL describes some advantages in particular settings, no systematic review has been done assessing the effectiveness of TBL on learning outcomes in health professions education. Some non-systematic reviews explore how to best implement TBL, personal experiences with TBL and TBL combined with e-learning (Michaelsen et al. 2008; Davidson Citation2009; Parmelee & Michaelsen Citation2010). However, these reviews address one aspect of a TBL experience rather than providing a holistic view of whether TBL serves to improve learning outcomes.

A more thorough review has been done on the effectiveness of several active learning strategies; however, this review was specific to engineering education (Prince Citation2004). The review included studies on collaborative learning, cooperative learning and TBL. The author concluded that support can be found for all forms of active learning and provided some evidence to promote team-based, collaborative learning environments (Prince Citation2004). Health professions curriculum planners do not have a resource that evaluates whether or not health professionals’ learning outcomes improve with TBL. There is a demand for such a resource due to the increasing use of TBL in health professions education. Some of the specific features of clinical teaching are unique to health professions education. For example, health professions students require preparation for intense workplace-based learning which is often in a high stakes setting that requires attention to multiple domains of competency such as team work and collaboration (Amin & Eng Citation2009). In addition, elements like clinical reasoning, patient-based teaching and tripartite interaction between students, teachers and patients set health professions education apart from other educational disciplines (Amin & Eng Citation2009).

Despite the increasingly wide use of TBL in health professions education, no systematic review has been done to rigorously assess the effectiveness of this teaching strategy. We conducted a systematic review to examine the totality of evidence on the effectiveness of TBL in the distinct setting of health professions education. A synthesis of the evidence can provide direction for those in positions of curriculum design and resource allocation.

Research question

The research question for this systematic review was: “Is TBL effective in improving learning outcomes in health professions education?” For the purpose of this review, TBL was defined as above to limit heterogeneity between TBL models included in the review.

Methods

Search strategy

A comprehensive search strategy was developed by a health science librarian (SC) in consultation with the co-authors. We identified relevant studies from the electronic databases listed in . Two search strategies were used depending on whether the database in question was health related or not. This was done to ensure the identification of all relevant studies. The specific terms and search strategies can be found for health-related databases in and general databases in . In addition, the reference lists of all included studies were hand searched, as were those of relevant reviews that were identified during the title screening procedure described below. We also hand-searched conference proceedings for the Association of American Medical Colleges, the Association of Medical Education in Europe, the Canadian Conference of Medical Education and the Team-based Learning Collaborative from 2009 to 2011. A separate cited reference search was conducted using Web of Science and SCOPUS for each included study to identify papers where it had been cited. The primary authors of all included studies were contacted through email to determine if they knew of any unpublished, recently published or ongoing studies relevant to the review.

Table 1.  Included online databases

Table 2.  Search terms and strategy for health-related databases

Table 3.  Search terms and strategy for general databases

Screening and selection of studies

The titles and abstracts generated from the electronic database searches were collated in a RefWorks reference management database. They were then screened by two reviewers (AO and MF) to exclude those that obviously did not meet the inclusion criteria or address the question under study (). For any abstracts that did not provide enough information on the study to determine if they meet our inclusion criteria, the primary authors of the study were contacted twice for further information, after which the study was excluded if we did not receive clarification. The full texts of the remaining studies were retrieved and a pre-approved inclusion form was applied independently by two reviewers (AO and MF) to each to identify relevant studies. Disagreements were resolved through discussion, or with the aid of a third reviewer (LH) as required.

Table 4.  Inclusion and exclusion criteria applied to potentially relevant studies to determine suitability for systematic review purposes

Assessment of methodological quality

The methodological quality of included studies was evaluated independently by two reviewers (TH and MF) using the Cochrane Risk of Bias tool for controlled trials (Higgins & Green Citation2009) and the Newcastle–Ottawa Scale (NOS) for cohort studies (Wells et al. Citation2000). These tools are commonly used in systematic reviews (Wells et al. Citation2000; Hartling et al. Citation2012) to assess the methodological quality of primary studies and are recommended by The Cochrane Collaboration (Higgins & Green Citation2009). The Cochrane Risk of Bias tool assesses six general domains that have empirical evidence demonstrating their association with biased estimates of effect: sequence generation, allocation concealment, blinding (participants/personnel, outcome assessment), incomplete outcome data, selective outcome reporting and “other sources of bias” (e.g. baseline imbalances between groups, design specific items for cross-over or cluster trials). Domains are rated as low, unclear or high risk of bias. Overall risk of bias can be assessed for each study: studies are considered low risk of bias if all individual domains are assessed as low, high risk of bias if one domain is assessed as high and unclear otherwise. The Newcastle–Ottawa Scale for cohort studies assesses the selection of participants, comparability of study groups and ascertainment of the outcome of interest. A rating system is used to indicate the overall quality of a study with a maximum assessment of nine points (Wells et al. Citation2000; Hartling et al. Citation2012). Discrepancies were resolved through consensus or with the aid of a third reviewer (LH) as required.

Data extraction

Data were extracted and entered into an electronic data extraction form. The form was developed and piloted in a systematic review performed by the authors (Hartling et al. Citation2010), and further revised and tailored to the current review. One reviewer extracted data (MF). To ensure accuracy and consistency a 20% sample of the articles was randomly selected for extraction by a second reviewer (TH). The data extracted by the two reviewers was compared and no significant discrepancies or errors were detected.

Analysis

Our inclusion criteria allowed for a mixture of qualitative and quantitative studies. The data were synthesized with guidance from methods described by Ogawa and Malen (Citation1991) who have developed a method for synthesis based on the exploratory case study method. Thus, evidence was iteratively synthesised by grouping studies by various constructs including by the population involved and nature of the comparison group, and summarised according to the reported outcomes which were grouped by Kirkpatrick level. The results of this synthesis were then described, categorised and summarised in . Evidence tables detailing study characteristics (including population, intervention, comparison, outcomes, design and any modifications to classic TBL) (), results and authors’ conclusions are provided (). Statistical meta-analysis was not performed because of substantial heterogeneity across the intervention type, comparator, study designs and insufficient reporting of data at the study level. Conclusions about the effectiveness of TBL were drawn based on review of results from studies reporting similar outcomes.

Table 5.  Summary of findings

Table 6.  Study characteristics

Table 7.  Main findings of the review

Table 8.  Methodological quality of randomised controlled trials (RCTs) using the Cochrane Collaboration's tool for assessing risk of bias

Results

presents a flow diagram of the study selection process. Three hundred and thirty studies were identified, and 14 were included (). Among the included studies were one RCT (Koles et al. Citation2005), two NRCTs (Thomas & Bowen Citation2011; Willett et al. Citation2011), two prospective cohort studies (Torralba et al. Citation2009; Wiener et al. Citation2009), one retrospective cohort study (Koles et al. Citation2010), one concurrent cohort study (Zingone et al. Citation2010) and seven non-concurrent cohorts (Nieder et al. Citation2005; Levine et al. Citation2004; Letassy et al. Citation2008; Pileggi & O'Neill Citation2008; Mennenga Citation2010; Simaan et al. 2010; Zgheib et al. Citation2010). Of the 14 included studies, 12 took place in the United States, 1 in Lebanon (Zgheib et al. Citation2010) and 1 in Austria (Wiener et al. Citation2009). Thirteen of the studies concerned undergraduate education, including eight in medicine, three in pharmacy (Letassy et al. Citation2008; Conway et al. Citation2010; Zgheib et al. Citation2010), one in dentistry (Pileggi & O'Neill Citation2008) and one in nursing (Mennenga Citation2010). The only study that did not include undergraduate learners assessed internal medicine residents in their first, second and third postgraduate years (Torralba et al. Citation2009). All 14 studies assessed knowledge as a learning outcome, and 7 studies also assessed learner reaction (Koles et al. Citation2005; Levine et al. Citation2004; Letassy et al. Citation2008; Conway et al. Citation2010; Zingone et al. Citation2010; Thomas & Bowen Citation2011; Willett et al. Citation2011). None of the studies evaluated Kirkpatrick outcomes such as skills or improvements for patients. In total, the studies included over 3535 participants (exact numbers are not known as three studies did not report the number of control group participants) of which 1869 students received TBL sessions.

Figure 1. Flow diagram of included studies.

Figure 1. Flow diagram of included studies.

Due to the nature of TBL, all studies assessed knowledge scores for the TBL groups during the teaching session in the form of IRATs and/or GRATS. However, because most alternative teaching methods do not require teachers to assess students’ knowledge during the actual teaching session and therefore do not report this short-term knowledge score, there were no comparative data between TBL and non-TBL groups for this short-term knowledge variable. All included studies provided knowledge scores comparing groups for the course or the semester in which TBL was implemented and so these are the variables analysed in this review.

Methodological quality and risk of bias of included studies

Quality assessment reveals various methodological shortcomings and these are reported by study in and . The three randomised and non-randomised controlled trials were assessed for quality using the Cochrane Risk of Bias tool. Allocation appeared to be appropriately randomised in the RCT (Koles et al. Citation2005), while the two NRCTs allocated students to groups based on last name (Thomas & Bowen Citation2011; Willett et al. Citation2011). None of the trials attempted to blind the students to their allocation or to the study hypothesis. None of the trials seemed to be at risk of selective outcome reporting or other sources of bias.

Table 9.  Methodological quality of prospective cohort studies (PCS) and retrospective cohort studies (RCS) using Newcastle–Ottawa Quality Assessment Scale

For 8 of the 11 cohort studies, the learners were truly representative of the average health-care professions student, and both exposed and non-exposed cohorts were drawn from the same community (Nieder et al. Citation2005; Levine et al. Citation2004; Letassy et al. Citation2008; Pileggi & O'Neill Citation2008; Conway et al. Citation2010; Mennenga Citation2010; Zgheib et al. Citation2010; Zingone et al. Citation2010). One study lost five failing students, leading the reviewers to deem the remaining cohort somewhat, but not truly, representative of the average health professions class. Two cohort studies assessed TBL within a group of self-selected volunteers (Torralba et al. Citation2009; Wiener et al. Citation2009). The majority of studies did not attempt to control for potential confounders between the exposed and non-exposed cohorts with regard to learning aptitudes, histories, etc.; however, one study did analyse scores after taking pre-intervention GPAs into account while another looked at age, health-care experience, previous GPAs and other measures (Mennenga Citation2010; Zingone et al. Citation2010). The results observed among the studies that provided adjusted estimates were inconsistent; therefore, it is uncertain whether unadjusted results from the remaining studies may have over- or underestimated intervention effects.

All studies had a clear definition of the outcome being assessed and collected the outcome data via record linkage for knowledge scores and via student self-reporting for learner reaction. Four studies did not report on completeness of follow-up (Letassy et al. Citation2008; Pileggi & O'Neill Citation2008; Torralba et al. Citation2009; Conway et al. Citation2010). One study appeared to have a loss to follow-up rate of greater than 10% with an unclear explanation of learners lost; however, this was a very large-scale study that still reported a large number of participants despite the loss to follow-up (Wiener et al. Citation2009).

Characteristics of included studies

provides a summary of the interventions, comparators, outcomes measured and main findings of all included studies. Best efforts to limit heterogeneity between the models of TBL were employed. However, for full transparency a column of any minor discrepancies between our definition of TBL and the model implemented in the study are described in . and describe the results of all included studies. The following narrative provides a summary of the findings grouped by outcome.

Knowledge

All 14 studies evaluated knowledge-based learning outcomes in a TBL group compared with a non-TBL group in a health education curriculum, assessing a total of at least 3535 participants; 3 studies were unclear about the number of participants in their historical controls. All studies reported knowledge outcomes and these were measured within the semester or course in which TBL was implemented. Seven studies reported a statistically significant increase (p < 0.05) in knowledge scores for the TBL group compared with a non-TBL group. However, one NRCT (n = 167) reported that after adjusting for pre-intervention knowledge scores between groups, there was no significant difference after TBL implementation (Willett et al. Citation2011). Among the other studies reporting a significant difference were one NRCT (n = 112) (Thomas & Bowen Citation2011), one retrospective cohort (n = 186) (Koles et al. Citation2010), one concurrent cohort (n = 64) (Zingone et al. Citation2010) and three non-concurrent cohorts (n = 306, n = 371, n = unclear) (Nieder et al. Citation2005; Levine et al. Citation2004; Zgheib et al. Citation2010). Two studies (one RCT and one prospective cohort) did not report a significant difference between TBL and comparators overall, but found a significant difference in subgroup analyses (Koles et al. Citation2005; Torralba et al. Citation2009). Koles et al. (n = 83) found significantly improved knowledge retention in students in the lowest academic quartile after experiencing a TBL session compared with CBGD (p = 0.035) (Koles et al. Citation2005). Torralba et al. (n = 121) found a significant improvement in resident in-training exam scores in favour of the TBL groups over the lecture groups within Year 1 residents (p = 0.03), but not Year 2 or 3 residents (Torralba et al. Citation2009).

Four studies reported no statistically significant difference between knowledge scores of the TBL and non-TBL group (p > 0.05). Of this group are the two studies that did find differences in subgroup analyses mentioned above (Koles et al. Citation2005; Torralba et al. Citation2009), and two non-concurrent cohort studies that reported no difference at all (n = unclear, n = 143) (Conway et al. Citation2010; Mennenga Citation2010). These studies included undergraduate studies in pharmacy (Conway et al. Citation2010) and nursing (Mennenga Citation2010).

Three studies did not report on significance testing and did not provide sufficient data for calculations; however, in all three studies the authors concluded that TBL was more effective based on some differences observed between groups (see ) (Letassy et al. Citation2008; Pileggi & O'Neill Citation2008; Wiener et al. Citation2009). Two of these studies were non-concurrent cohort studies (n = 280, n = unclear) (Letassy et al. Citation2008; Pileggi & O'Neill Citation2008) and one was a prospective cohort study (n = 1417) (Wiener et al. Citation2009). The subjects of these studies were undergraduate studies in pharmacy (Letassy et al. Citation2008), dentistry (Pileggi & O'Neill Citation2008) and medicine (Wiener et al. Citation2009).

Comparison group

The choice of comparator did not appear to influence whether knowledge scores favoured TBL or any one alternate teaching method. Eight studies compared TBL to traditional didactic lectures, three of which reported significant improvements in knowledge scores for the TBL group (Nieder et al. Citation2005; Levine et al. Citation2004; Koles et al. Citation2010). Two studies compared TBL to CBGD; one study found statistically significant improvements, while the other found significant improvements only in students in the lowest academic quartile (Zgheib et al. Citation2010; Koles et al. Citation2005). Two trials compared TBL to their own form of SGL and both reported statistically significant results favouring TBL (Thomas & Bowen Citation2011; Willett et al. Citation2011). The remaining two studies used mixed active learning methods and independent study as comparators, which are less standardised learning strategies. While both reported increases in knowledge scores for the TBL group, the latter study did not report results of significance testing (Wiener et al. Citation2009; Zingone et al. Citation2010).

Learner reaction

Seven studies, involving at least 1152 participants (726 of whom received TBL), reported controlled learner reaction scores between a TBL and non-TBL group (one study did not report the number of control group participants). Only one cohort study reported a significant difference (p < 0.05) favouring the TBL group (Levine et al. Citation2004). In one NRCT, students significantly preferred the alternative (SGL) to TBL (p < 0.001) (Willett et al. Citation2011). These two studies demonstrate that students had a positive reaction to TBL despite the more abrupt increase in workload when compared to a traditional lecture; however, when students were comparing to a less structured active learning strategy, such as SGL, they did not react as positively to TBL. Three studies reported non-significant differences, of which one was an RCT (n = 83) (Koles et al. Citation2005), one was an NRCT (n = 112) (Thomas & Bowen Citation2011) and one was a concurrent cohort reporting a trend in learner preference for the alternate teaching strategy (n = 64) (Zingone et al. Citation2010). Two studies did not comment on statistical significance (n = unclear, n = 280): one study favoured TBL and the other favoured the lecture comparator (Conway et al. Citation2010; Letassy et al. Citation2008).

Studies in which a recurrent TBL curriculum was implemented were not found in the literature search, and therefore the authors were not able to determine if learner reaction scores might improve over time as students become familiar with this new learning strategy.

Discussion

This is the first systematic review that we are aware of that examines the effects of TBL in health professions education. Previous reviews of TBL either did not study health education (Prince Citation2004) or were not full systematic reviews (Parmelee & Michaelsen Citation2010). The health professions educational setting represents a truly distinct population with specific needs and resources. This review was rigorous in its inclusion eligibility, especially with regard to what constitutes TBL; we ensured that the studies presented here complied with a definition of TBL that was verified with founders and leaders of the TBL community. By including only controlled studies, this review provides evidence on how TBL compares to traditional teaching strategies.

The purpose of this systematic review was to evaluate the effectiveness of TBL in improving learning outcomes in health professions education. The results show both positive and neutral effects on knowledge scores, while learner reaction towards TBL was mixed. These findings are beneficial to those who seek evidence suggesting the effectiveness of TBL in achieving the same, if not better, knowledge objectives as more traditional methods. However, curriculum planners who do implement TBL are advised to take precautions to mitigate potentially negative learner reactions to this teaching strategy.

Fourteen studies were included in the analysis and half (7) of these studies reported a statistically significant increase (p < 0.05) in knowledge scores favouring TBL (Koles et al. Citation2005; Nieder et al. Citation2005; Levine et al. Citation2004; Zgheib et al. Citation2010; Zingone et al. Citation2010; Thomas & Bowen Citation2011; Willett et al. Citation2011). These seven studies reporting a significant difference (p < 0.05) varied in their study designs. Three studies did not comment on comparisons of statistical significance between groups, despite a trend in knowledge outcomes favouring TBL (Letassy et al. Citation2008; Pileggi & O'Neill Citation2008; Wiener et al. Citation2009); it is possible that a greater proportion of included studies would have favoured TBL had these three studies been reported more thoroughly. The four studies reporting no statistically significant difference between the study groups (p > 0.05) were robust and scored high on quality assessment (see and ).

The learner reaction findings after the implementation of TBL require more careful consideration. Of seven studies reporting a controlled comparison of learner reaction, only one reported a statistically significant difference in learner reaction favouring TBL (Levine et al. 2004), while one study favoured the comparator (Willett et al. Citation2011). However, the Willett et al. results must be interpreted with caution as in this study the traditional lecture continued to be the primary instruction modality and the TBL-associated scores, including the GRAT and peer evaluation, did not contribute to the students’ grade; thus, there may have been less student engagement in the TBL process. Of the four studies that reported both a significant increase in knowledge scores for TBL and also reported controlled learner satisfaction data, only one study reported a significant increase in learner reaction (Levine et al. 2004). We hypothesise that these conflicting knowledge and learner satisfaction results may be due to the strain of increased workload associated with TBL (e.g. required advanced readings and preparation) and a change in the professional culture towards peer assessment and accountability that TBL introduces. These hypotheses are supported by some of the authors’ (AO and TH) recent experiences of TBL implementation in a preclinical medical student course on musculoskeletal medicine at our home university. However, further studies are required to confirm and shed further light for the reasons of poor learner satisfaction.

The results for knowledge outcomes appeared to be consistent regardless of the type of comparator used in the included studies. The majority of comparison groups were comprised of traditional lectures, while CBGD and SGL were also common alternative teaching methods to TBL. There was a suggestion in the data that learner reactions were more positive in favour of TBL when traditional lectures were used as the comparator rather than less structured small group learning strategies, but further study is needed.

The limitations of this review were minimised with regard to the review design by prospectively establishing thorough parameters for database searches, by having two reviewers screen, reconcile and assess the quality of potentially relevant studies, and by contacting authors for any recently published or unpublished data. The major limitations of this review are due to the methodological quality of the included studies. Most of the included studies were cohort designs, and the majority of them were non-concurrent. Further, the authors of these convenience studies often did not report full statistical information or sufficient data to allow for statistical comparisons. Many studies did not control for differences between groups to prevent confounding. In an attempt to explore and better understand the effects of poor study quality, a thorough quality assessment of included studies was performed and reconciled by two reviewers. Although the more robust trials reported data more thoroughly, it is reassuring that their findings were as mixed as those of the cohort studies, indicating that study design was unlikely to be a major source of bias in this review.

A second potential limitation relates to the constraints of the inclusion criteria for this review. One constraint was restricting inclusion to studies that fully complied with the validated definition of TBL that was accepted in the registered protocol for this review. Overall, this resulted in the exclusion of certain robust studies implementing forms of TBL that did not meet the criteria in the protocol, for example, those that omitted a major component such as readiness assurance testing. However, we felt that this strategy prevented dilution of the results by allowing extreme heterogeneity of interventions.

As with most studies in the field of education, there are inherent limitations with regard to asserting one strategy's effectiveness over another. Prince et al. outline the problems on interpreting the literature of active learning (Prince Citation2004). Although we have attempted to address Prince et al.'s noted difficulty in defining exactly what is being studied, there are still challenges in measuring what works (Prince Citation2004). The latter difficulty is particularly relevant in this review, as the outcomes measured in the included studies are limited to learner reaction and knowledge scores. This is somewhat disappointing as this learning strategy is meant to emphasise the application of knowledge, and these higher level application outcomes were not reported in the included studies. We recommend that future studies on TBL look at academic outcomes beyond knowledge retention, such as critical thinking abilities, and non-academic outcomes alike as studies of these outcomes were not found through our searches.

This review included studies investigating undergraduate medical, dental and pharmacy student populations as well as one resident population. Although the heterogeneity of interventions, comparators, designs and outcomes must be taken into account, this review represents the most comprehensive overview of the effects of TBL on health professions education. Most of the included studies took place at the undergraduate level and thus findings of this review are most relevant to these populations. However, the one study of residents by Torralba et al. (Citation2009) is worthy of further examination. This study found a significant improvement in resident in-training exam scores in favour of the TBL groups over the lecture groups for Year 1 residents, but not for Year 2 or 3 residents. We hypothesise that this may be because residents who are further along in their postgraduate training are already exposed to a wide variety of practical applications of their in-class education through their clinical training; whereas Year 1 residents may benefit more from the practical aspects of TBL during their transition to an intensely practical training environment.

In addition to the growing body of evidence suggesting that TBL enables students to achieve knowledge scores as high as, or higher, than traditional teaching strategies, there also exist other potential benefits to TBL not captured in this review. These include a renewed appreciation for group work amongst students, a demand that students take control over their learning and significant reduction in faculty time as compared to many small group learning methods (Searle et al. Citation2003; Parmelee & Michaelsen Citation2010).

Conclusions

This review provides the first comprehensive evaluation of the existing evidence on the efficacy of TBL in health professions education. We anticipate it will be of use to educators considering the implementation of TBL in that it demonstrates the potential for TBL to significantly increase knowledge scores. Further, no studies reported a decrease in scores in the TBL-group and we feel this is reassuring to curriculum planners who are looking for active learning strategies that emphasise the application of previously acquired knowledge and to those who have limited numbers of faculty for small group learning activities.

Learner reaction to TBL generally was not higher than the comparator group, even when students’ knowledge scores increased; that is, despite improved performance, students did not prefer TBL to the alternate teaching method. This may be due to the increased student workload and accountability associated with active learning and peer assessment in TBL. More research is required to better understand this discrepancy.

While several areas in need of further research have been outlined, based on the results of this systematic review we would support more widespread implementation of this learning strategy while cautioning curriculum planners to carefully and prospectively consider how they will mitigate potential difficulties in the learners’ reactions.

Implications for Research

  1. We recommend that future research on TBL specifically focus on academic outcomes beyond knowledge retention, such as critical thinking abilities and application of knowledge, as studies with these outcomes were not found through our searches and are core to the principles underlying this teaching strategy.

  2. We recommend future research to examine in more detail why learner reaction is mixed and generally not positive compared to evident increases in knowledge retention, as these two outcomes often have significant correlation in other learning strategies.

  3. We find that there is a definite need for more robust primary research to be conducted in TBL with thorough and descriptive reporting. We hope this will allow for statistical meta-analyses or full qualitative meta-synthesis to be performed in this field.

  4. Having established some preliminary evidence for the effectiveness of TBL on improving academic learning outcomes, we recommend that future research work towards drawing specific associations on how and why particular elements of TBL are effective in the way they are.

Declaration of interest: The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the article.

References

  • Amin Z, Eng KH. Basics in Medical Education. World Scientific, Hackensack, NJ 2009
  • Conway SE, Johnson JL, Ripley TL. Integration of team-based learning strategies into a cardiovascular module. Am J Pharm Educ 2010; 74(2)35
  • Davidson L. Educational innovation in an undergraduate medical course: Implementation of a blended e-learning, team-based learning model. Queen's University (Canada), Kingston, ON 2009, Available from: http://qspace.library.queensu.ca/bitstream/1974/1667/3/Davidson_L_K_200901_MEd.pdf
  • Forsetlund L, Bjørndal A, Rashidian A, Jamtvedt G, O'brien Mary A, Wolf F, Davis D, Odgaard-Jensen J, Oxman Andrew D. Continuing education meetings and workshops: Effects on professional practice and health care outcomes. John Wiley & Sons, Chichester, UK 2009
  • Haidet P, O'malley KJ, Richards B. An initial experience with “team learning” in medical education. Acad Med 2002; 77: 40–44
  • Hartling L, Hamm M, Milne A, Vandermeer B, Santaguida PL, Ansari M, Tsertsvadze A, Hempel S, Shekelle P, Dryden DM. Validity and inter-rater reliability testing of quality assessment instruments. Agency for Healthcare Research and Quality, Rockville, MD 2012
  • Hartling L, Spooner C, Tjosvold L, Oswald A. Problem-based learning in pre-clinical medical education: 22 years of outcome research. Med Teach 2010; 32(1)28–35
  • Higgins PT, Green S, (Eds.). 2009. Cochrane handbook for systematic reviews of interventions. Available from http://cochrane-handbook.org
  • Kirkpatrick DL, Kirkpatrick JD, 2006. Evaluating training programs: The four levels (3rd ed.). San Francisco, CA: Berrett-Koehler
  • Koles P, Nelson S, Stolfi A, Parmelee D, DeStephen D. Active learning in a Year 2 pathology curriculum. Med Educ 2005; 39: 1045–1055
  • Koles PG, Stolfi A, Borges NJ, Nelson S, Parmelee DX. The impact of team-based learning on medical students' academic performance. Acad Med 2010; 85(11)1739–1745
  • Letassy NA, Fugate SE, Medina MS, Stroup JS, Britton ML. Using team-based learning in an endocrine module taught across two campuses. Am J Pharm Educ 2008; 72(5)103
  • Levine RE, O'Boyle M, Haidet P, Lynn DJ, Stone MM, Wolf DV, Paniagua FA. Transforming a clinical clerkship with team learning. Teach Learn Med 2004; 16: 270–275
  • Mennenga H. Team-based learning: Engagement and accountability with psychometric analysis of a new instrument. University of Nevada, Las Vegas 2010
  • Michaelsen L, Parmelee D, McMahon K, Levine R. Team-based learning for health professions education: A guide to using small groups for improving learning. Stylus Publishing, Sterling, VA 2008
  • Morrison A, Moulton K, Clark M, Polisena J, Fiander M, Mierzwinski-Urban M, et al. 2009. English-language restriction when conducting systematic review-based meta-analyses: Systematic review of publish studies. Canadian Agency for Drugs and Technologies in Health
  • Nieder GL, Parmelee DX, Stolfi A, Hudes PD. Team-based learning in a medical gross anatomy and embryology course. Clin Anat 2005; 18(1)56–63
  • Ogawa RT, Malen B. Towards rigor in reviews of multivocal literatures: Applying the exploratory case study method. Rev Educ Res 1991; 61: 265–286
  • Parmelee DX, Michaelsen LK. Twelve tips for doing effective Team-Based Learning (TBL). Med Teach 2010; 32(2)118–122
  • Pileggi R, O'Neill PN. Team-based learning using an audience response system: An innovative method of teaching diagnosis to undergraduate dental students. J Dental Educ 2008; 72(10)1182–1188
  • Prince M. Does active learning work? A review of the research. J Eng Educ Washington 2004; 93: 223–232
  • Searle NS, Haidet P, Kelly PA, Schneider VF, Seidel CL, Richards BF. Team learning in medical education: Initial experiences at ten institutions. Acad Med 2003; 78(10 Suppl)S55–S58
  • Thomas PA, Bowen CW. A controlled trial of team-based learning in an ambulatory medicine clerkship for medical students. Teach Learn Med 2011; 23(1)31–36
  • Torralba KD, Ben-Ari R, Quismorio FP, Boateng BA. Arthritis and rheumatism conference: American College of Rheumatology/Association of Rheumatology Health Professionals Annual Scientific Meeting, ACR/ARHP 09 Atlanta, GA United States. 2010/11/06 Conference End: 2010/11/11., Conference Start 2009, Conferenc(TRUNCATED), 60, pp. 1356
  • Wells G, Shea B, O'Connell J, Robertson J, Peterson J, Welch V, Losos M, Tugwell P. The NewcastleOttawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analysis [Abstract]. Beyond the basics, July 3–5; Oxford, 3rd Symposium on systematic reviews 2000
  • Wiener H, Plass H, Marz R. Team-based learning in intensive course format for first-year medical students. Croatian Med J 2009; 50(1)69–76
  • Willett LR, Rosevear GC, Kim S. A trial of team-based versus small-group learning for second-year medical students: Does the size of the small group make a difference?. Teach Learn Med 2011; 23(1)28–30
  • Zgheib NK, Simaan JA, Sabra R. Using team-based learning to teach pharmacology to second year medical students improves student performance. Med Teach 2010; 32(2)130–135
  • Zingone MM, Franks AS, Guirguis AB, George CM, Howard-Thompson A, Heidel RE. Comparing team-based and mixed active-learning methods in an ambulatory care elective course. Am J Pharm Educ 2010; 74(9)160

Glossary

CBGD – Case-based group discussion takes place in small groups and centres around application problems. In health professions education, a team of about 5–15 learners works on a patient case simulating a real-life clinical case.

GRAT – A group readiness assurance test is a multiple-choice quiz taken after an individual readiness assurance test and contains identical questions. It requires discussion and consensus amongst the entire group. Groups are then given immediate feedback after finalizing their answers.

IRAT – An individual readiness assurance test is a multiple-choice quiz administered to ensure that learners have completed and understood the assigned reading. The same test is subsequently administered as a GRAT.

PBL – Problem-based learning is an active learning strategy developed and introduced into health professions education as an alternative to traditional lectures. Learners discuss an evolving case in small groups with the instructor acting as a facilitator. There is an expectation that the group will identify gaps in knowledge related to the case, go away and research around these knowledge gaps and return to apply this information to the case.

SGL – Small group learning is a general form of student-centred learning that takes place in groups of about 5–15 learners.

TBL – Team-based learning is a well-defined active learning strategy developed in the 1970s by Dr. Larry Michaelsen and was implemented in health professions education at the start of the twenty-first century. For the purpose of this review, TBL is defined as a learning strategy that includes the three classic phases: advanced individual preparation, readiness assurance and group application (Parmelee & Michaelsen, Citation2010). To minimize the heterogeneity of studies reporting use of TBL, we used the above definition in this review. We validated the definition with the literature (Michaelsen et al. 2008), with the founder (Larry Michaelsen) and with a second key expert in this field (Dean Parmelee). The validated definition of TBL includes the three phases above while focusing on learner-centred education, individual and group accountability, permanence of teams, immediate feedback and a meaningful peer evaluation process (Michaelsen & Parmelee, personal communication, 2011). The nature of the group assignments must be described in detail and must aim to include full cooperation of the team.

Traditional Lecture – Defined as an instructor lecturing to a group of learners (varying in size) in some didactic presentation format. Learner interaction and group discussion is usually minimal in a traditional lecture.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.