56,377
Views
672
CrossRef citations to date
0
Altmetric
Articles

Internet-based vs. face-to-face cognitive behavior therapy for psychiatric and somatic disorders: an updated systematic review and meta-analysis

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 1-18 | Received 23 Aug 2017, Accepted 01 Nov 2017, Published online: 07 Dec 2017

Abstract

During the last two decades, Internet-delivered cognitive behavior therapy (ICBT) has been tested in hundreds of randomized controlled trials, often with promising results. However, the control groups were often waitlisted, care-as-usual or attention control. Hence, little is known about the relative efficacy of ICBT as compared to face-to-face cognitive behavior therapy (CBT). In the present systematic review and meta-analysis, which included 1418 participants, guided ICBT for psychiatric and somatic conditions were directly compared to face-to-face CBT within the same trial. Out of the 2078 articles screened, a total of 20 studies met all inclusion criteria. Results showed a pooled effect size at post-treatment of Hedges g = .05 (95% CI, −.09 to .20), indicating that ICBT and face-to-face treatment produced equivalent overall effects. Study quality did not affect outcomes. While the overall results indicate equivalence, there have been few studies of the individual psychiatric and somatic conditions so far, and for the majority, guided ICBT has not been compared against face-to-face treatment. Thus, more research, preferably with larger sample sizes, is needed to establish the general equivalence of the two treatment formats.

Introduction

The first treatment studies on Internet-delivered cognitive behavior therapy (ICBT) were carried out in the late 1990s (Andersson, Carlbring, & Lindefors, Citation2016). They were designed to mirror face-to-face treatments in terms of content and length. Since then, more than 200 randomized controlled trials have been published, often with promising results indicating that ICBT is clinically effective when compared to controls (Andersson, Carlbring, & Hadjistavropoulos, Citation2017). Diagnoses targeted have typically included anxiety disorders and depression (Arnberg, Linton, Hultcrantz, Heintz, & Jonsson, Citation2014), but there are ample examples of subclinical problems, such as interventions for procrastination (Rozental, Forsell, Svensson, Andersson, & Carlbring, Citation2015) and perfectionism (Shafran et al., Citation2017). ICBT can be successfully adapted to an internet-delivered format for psychiatric and somatic conditions even in children and adolescents (Vigerland et al., Citation2016).

Treatment typically consists of a person first being screened or diagnosed in order to match a specific treatment program to the individual’s unique set of problems. For example, a patient presenting with panic disorder symptoms will receive a dedicated program specifically designed to target that particular problem. However, there are a few transdiagnostic programs in which this matching is less central (Păsărelu, Andersson, Bergman Nordgren, & Dobrean, Citation2017). Following the assessment phase, the patient is often allocated a therapist who, using encrypted asynchronous text messages, introduces the patient to the program and platform (cf. Vlaescu, Alasjö, Miloff, Carlbring, & Andersson, Citation2016). This is usually done by writing that the treatment will last for a predetermined number of weeks and that a new treatment module will be assigned each week. The modules mimic the treatment given in face-to-face therapy and consist of information and exercises (e.g. psychoeducation, thought records, and behavior experiments). Each module ends with essay questions that must be completed and sent to the therapist, who in turn will give personalized feedback on the patient’s progress. While there are variations in how much human interaction is included in the guidance (usually 1–15 min per week), research shows that the guidance of a human therapist is typically beneficial for a patient’s outcome (Baumeister, Reichler, Munzinger, & Lin, Citation2014). However, human guidance can possibly be, to some extent, replaced by smart computer-generated responses and automated personalized feedback (Titov et al., Citation2013).

In addition to ICBT’s short-term effect sizes indicating equivalence to therapist-administered therapy (Andersson & Cuijpers, Citation2009; Andrews, Cuijpers, Craske, McEvoy, & Titov, Citation2010; Cuijpers, van Straten, & Andersson, Citation2008), a few long-term follow-up studies have shown that the ICBT effects are maintained for as long as five years post treatment (Hedman, Furmark, et al., Citation2011).

In spite of the promising results in controlled trials, in which ICBT is often compared with waitlist control groups, an outstanding question has been how well guided ICBT compares against standard manualized face-to-face treatment. This was investigated in a meta-analysis by Andersson, Cuijpers, Carlbring, Riper, and Hedman (Citation2014) that included a total of 13 studies (N = 1053) published up until June 2013. The results showed a pooled effect size at post-treatment of Hedges g = −.01 (95% CI, −.13 to .12), suggesting that ICBT and face-to-face treatment produce equivalent overall effects. However, since the field is moving forward rapidly and a considerable number of new studies have been published, an updated systematic review and meta-analysis is needed. The aim of this study was to reinvestigate the efficacy of ICBT compared to face-to-face cognitive behavior therapy (CBT) for psychiatric and somatic disorders, giving consideration to studies published in the past four years.

Methods

Design and selection of studies

This was an updated systematic review and meta-analysis of original articles investigating the effect of ICBT compared to face-to-face treatment. We used the same methods as in the previously published systematic review (Andersson et al., Citation2014), which meant that, in order to be included, the studies had to (a) compare therapist-guided ICBT to face-to-face treatment using a randomized controlled design; (b) use interventions aimed at the treatment of psychiatric or somatic disorders (and not, for example, prevention or mere psychoeducation); (c) compare treatments that were similar in content in both treatment conditions; (d) investigate a form of ICBT wherein the internet treatment was the main component and not a secondary complement to other therapies; (e) investigate a form of full-length face-to-face treatment; (f) report outcome data from an adult patient sample; (g) report outcomes in terms of assessment of symptoms of the target problem; and (h) be written in English. We included only studies in which there was some therapist contact during the trial.

We calculated effect sizes based on the primary outcome measure at post-treatment in each study. If no primary outcome measure was specified in the original study, a validated measure assessing target symptoms of the clinical problem was used, following the procedures generated by Thomson and Page (Citation2007).

Search methods

To identify studies, systematic searches in PubMed (the MEDLINE database) were conducted using various search terms related to psychiatric and somatic disorders, including “depression,” “panic disorder,” “social phobia,” “social anxiety disorder,” “generalized anxiety disorder,” “obsessive-compulsive disorder,” “post-traumatic stress disorder,” “specific phobia,” “hypochondriasis,” “bulimia,” “tinnitus,” “erectile dysfunction,” “chronic pain,” and “fatigue.” Each of these search terms was combined with each of the following: “Internet,” “computer,” and “computerized.” The search filter “randomized controlled trial” was used. Reference lists in the included studies were checked for potential additional studies. We did not search for unpublished English studies. In addition, studies needed to target an adult population, excluding a study on clinically anxious adolescents aged 12–18 years and their parent (Spence et al., Citation2011) and on treating youth depression and anxiety (Sethi, Citation2013). Finally, since the Internet-based therapy should not be merely a pure self-help book albeit delivered via email, a study on perfectionism was excluded (Egan et al., Citation2014). In the original search (Andersson et al., Citation2014), there were no restrictions regarding publication date but data collection ended in June 2013. This updated search identified additional studies published between July 2013 and February 2017. Hence, this study includes results from both the previous and the new search.

Assessment of quality of the included studies

Each included study was assessed using the quality criteria proposed by the Cochrane Collaboration (Higgins & Green, Citation2011. Available from www.cochrane-handbook.org.). Five dimensions were evaluated: (I) risk of selection bias due to the method for generating the randomization sequence, (II) risk of selection bias in terms of allocation concealment (i.e. due to foreknowledge of the forthcoming allocations), (III) detection bias in terms of blinding of outcome assessors, (IV) attrition bias due to incomplete outcome data, and (V) reporting bias due to selective reporting of results. The criterion for performance bias relating to the masking of participants was not used because that form of masking is not possible in the types of treatments investigated in this review. For each dimension, the statuses of the studies were rated using one of the following response options: “low risk,” “high risk,” or “unclear.” The alternative “unclear” was used when there were no data to assess the quality criterion in the original study. In studies that used self-reporting, criterion III was judged to be not applicable.

Statistical analysis

Data were analyzed using Cochrane Review Manager (RevMan) version 5.1.0 (Higgins & Green, Citation2011. Available from www.cochrane-handbook.org.). In the main meta-analyses, we assessed the effect of ICBT compared to face-to-face treatment using the standardized mean difference at post-treatment (Hedges’ g) as the outcome, meaning that the difference between treatments was divided by the pooled standard deviation. If both intention-to-treat and per-protocol data were presented, the former estimate was used in the meta-analysis. Estimates of treatment effects were conducted using all included studies as well as separately for each clinical disorder (e.g. depression). Potential differences in dropout rates between ICBT and face-to-face treatment were analyzed using meta-analytic logistic regression. All pooled analyses were carried out within a random effects model framework, assuming variation in true effects in the included studies and accounting for the hypothesized distribution of effects (Borenstein, Hedges, Higgins, & Rothstein, Citation2009). Studies were assessed for heterogeneity using χ2 and I2 tests. In addition, forest plots were inspected to assess variation in effects across studies. Sensitivity analyses were conducted to assess whether study quality was related to outcome by comparing studies judged as having a low risk of bias on all five quality criteria dimensions with the studies that did not (i.e. those assessed as “unclear” or “low risk” on at least one quality criterion). Publication bias was investigated using funnel plots. Power calculations were conducted as suggested by Borenstein et al. (Citation2009) and showed that, in order to have a power of 80% to detect a small effect size (d = .3), given an alpha-level of .05, 14 studies, with an average of 25 participants in each treatment arm, were needed.

Results

Studies included in the review

Of the 2078 screened studies, 20 (total N = 1418) met all review criteria and were included in the study (13 previous and 7 new studies). Figure displays the study inclusion process. The 20 studies all investigated ICBT against some form of CBT (individual format, n = 10 or group format, n = 10). In terms of conditions studied, three of the studies targeted social anxiety disorder (Andrews, Davies, & Titov, Citation2011; Botella et al., Citation2010; Hedman, Andersson, et al., Citation2011), three studied panic disorder (Bergström et al., Citation2010; Carlbring et al., Citation2005; Kiropoulos et al., Citation2008), four studied depressive symptoms (Andersson, Hesser, et al., Citation2013; Lappalainen et al., Citation2014; Spek et al., Citation2007; Wagner, Horn, & Maercker, Citation2014), two studied body dissatisfaction (Gollings & Paxton, Citation2006; Paxton, McLean, Gollings, Faulkner, & Wertheim, Citation2007), two studied insomnia (Blom et al., Citation2015; Lancee, van Straten, Morina, Kaldo, & Kamphuis, Citation2016), two studied tinnitus (Jasper et al., Citation2014; Kaldo et al., Citation2008), one studied male sexual dysfunction (Schover et al., Citation2012), one studied spider phobia (Andersson et al., Citation2009), one studied snake phobia (Andersson, Waara, et al., Citation2013), and one studied fibromyalgia (Vallejo, Ortega, Rivera, Comeche, & Vallejo-Slocker, Citation2015). The total number of participants was 731 in ICBT and 687 in face-to-face conditions. The studies were carried out in Australia, Finland, the Netherlands, Spain, Sweden, Switzerland, and the USA. The smallest study had 26 participants, and the largest had 201. Twelve studies recruited participants solely through self-referral while the remainder included participants from clinical samples or used a mix of self-referral and clinical recruitment. All studies were published between the years 2005 and 2016. The characteristics of each study are presented in Table .

Figure 1. Study inclusion process throughout the review.

Figure 1. Study inclusion process throughout the review.

Table 1. Characteristics of the included studies.

Assessment of quality

The results of the quality assessment in terms of random sequence generation, allocation concealment, blinding of outcome assessment, data completion, and reporting bias are presented in Table . When blinding of outcome assessment was included, only three studies were judged as having low risk of bias in all five quality dimensions (Bergström et al., Citation2010; Hedman, Andersson, et al., Citation2011; Kiropoulos et al., Citation2008). If the criterion of blinding of outcome assessors was disregarded in the studies that assessed outcome only through self-reporting, 10 of the 20 studies were judged as having low risk of bias in all quality dimensions. Figure displays the average risk of bias in the included studies. In terms of dropouts, meta-analytic logistic regression showed no significant difference between the two treatment formats (OR = .85; 95% CI, .63–1.15), indicating that dropouts did not systematically favor one treatment over the other.

Figure 2. Forest plot displaying effect sizes of studies comparing internet-based treatment with face-to-face treatment.

Figure 2. Forest plot displaying effect sizes of studies comparing internet-based treatment with face-to-face treatment.

Main findings: ICBT vs. face-to face treatment

All studies

A forest plot presenting the effect sizes (g) of each study as well as the pooled between-group effect size of all studies is presented in Figure . Throughout the results, an effect size estimate (g) below 0 favors ICBT while an effect size above 0 represents larger effects for face-to-face CBT. The pooled between-group effect size at post-treatment across all 20 studies was g = .05 (95% CI, −.09 to .20), showing that ICBT and face-to-face treatment produced equivalent overall effects. Below, results are presented for each clinical disorder separately.

Figure 3. Estimated risk of bias across all included studies.

Note: Studies using self-report only were omitted from assessment of blinding of outcome assessment. Hence, the white space in the bar for detection bias.
Figure 3. Estimated risk of bias across all included studies.

Social anxiety disorder

The pooled between-group effect size in the three studies targeting social anxiety disorder (Andrews et al., Citation2011; Botella et al., Citation2010; Hedman, Andersson, et al., Citation2011) was g = −.16 (95% CI, −.47 to .16) in non-significant favor of ICBT but indicating equivalent effects.

Panic disorder

The pooled between-group effect size in the three studies targeting panic disorder (Bergström et al., Citation2010; Carlbring et al., Citation2005; Kiropoulos et al., Citation2008) was g = .05 (95% CI, −.20 to .30), in line with the notion of equivalent effects.

Depressive symptoms

The pooled between-group effect size in the four studies targeting depressive symptoms (Andersson, Hesser, et al., Citation2013; Lappalainen et al., Citation2014; Spek et al., Citation2007; Wagner, Horn, & Maercker, Citation2014) was g = −.02 (95% CI, −.22 to .19), showing equivalent effects for this condition as well.

Body dissatisfaction

The pooled between-group effect size in the two studies targeting body dissatisfaction (Gollings & Paxton, Citation2006; Paxton et al., Citation2007) was g = .07 (95% CI, −.49 to .62), again showing largely equivalent effects.

Insomnia

The pooled between-group effect size in the two studies targeting body insomnia (Blom et al., Citation2015; Lancee et al., Citation2016) was g = .71 (95% CI, −.18 to 1.60), suggesting a non-significant but larger effect of face-to-face CBT compared to ICBT.

Tinnitus

The pooled effect size in the two studies targeting tinnitus (Jasper et al., Citation2014; Kaldo et al., Citation2008) was g = −.09 (95% CI, −.43 to .25), suggesting no difference between the formats for this condition.

Male sexual dysfunction

The pooled between-group effect size for the study targeting male sexual dysfunction (Schover et al., Citation2012), which used a clinical sample of patients that had been treated for prostate cancer, was g = −.14 (95% CI, −.58 to .29). This represents a small, non-significant effect in favor of ICBT.

Spider phobia

The effect size in the study targeting spider phobia (Andersson et al., Citation2009) was g = .43 (95% CI, −.30 to 1.15) in favor of face-to-face treatment, but given the small size of the study, the finding was not significant.

Snake phobia

The effect size in the study targeting snake phobia (Andersson, Waara, et al., Citation2013) was g = .67 (95% CI, −.17 to 1.42) in favor of face-to-face treatment, but given the small size of the study, the finding was not significant.

Fibromyalgia

The effect size in the study targeting fibromyalgia (Vallejo et al., Citation2015) was g = −006 (95% CI, −068 to 056), indicating equivalent effects.

Sensitivity analysis

In order to estimate whether there was an association between study quality and treatment effects, subgroup analyses were conducted. In the three studies judged to have low risk of bias on all five quality criteria, the pooled effect size (ICBT vs. face-to-face treatment) was g = −.11 (95% CI, −.42 to .21) while the corresponding effect size for the 10 other studies was g = .10 (95% CI, −.06 to .26), suggesting that study quality did not affect the outcomes significantly.

Publication bias

Figure presents a funnel plot relating effect sizes on the primary outcomes of the studies to the standard errors of the estimates. As shown in Figure , effect sizes were evenly distributed around the averaged effect. Of specific interest, the lower right section of the funnel plot includes studies suggesting that there is no major bias of the pooled effect estimate. This is due to small, unpublished studies with results favoring face-to-face treatment.

Figure 4. Funnel plot to assess for publication bias by relating effect sizes of the studies to standard errors.

Abbreviation: SMD, standardized mean difference.
Figure 4. Funnel plot to assess for publication bias by relating effect sizes of the studies to standard errors.

Test of heterogeneity

Tests showed significant heterogeneity ( = 32.91; I2 = 42%; p = .02). This heterogeneity was largely driven by the study of ICBT vs. face-to-face CBT for insomnia by Lancee et al. (Citation2016). If this study was removed from the analysis, I2 dropped from 42 to 0%, and there was no longer significant heterogeneity (= 16.19; p = .58). The pooled effect size across all studies changed marginally from g = .05 (95% CI, −.09 to .20) to g = −.01 (95% CI, −.12 to .10) if this study was removed from the analysis.

Discussion

This updated meta-analysis strengthens the previous finding by Andersson et al. (Citation2014), which indicated equivalence between the two treatment formats. High dropout and non-compliance is sometimes put forward as a major challenge for internet interventions (Christensen, Griffiths, & Farrer, Citation2009). However, this meta-analysis found that dropouts did not systematically favor one treatment format over the other. In fact, the average dropout rate across the studies was 15.7%, which is perfectly in line with a recent meta-analysis on dropouts from individual psychotherapy for major depression (Cooper & Conklin, Citation2015).

While there were no general or specific effects in favor of either ICBT or face-to-face administrated treatment, there were a number of interesting, albeit non-significant, findings in opposite directions, depending on diagnosis. Most evident was the non-significant advantage of face-to-face treatment over ICBT in the two animal phobia trials. A possible reason for this is that the gold-standard face-to-face treatment is highly effective (Ollendick & Davis III, Citation2013) and also solves the problem of acquiring the phobic stimuli, which is needed for successful treatment. In face-to-face sessions, the therapist is the supplier of the snake or spider while an internet intervention only uses text and pictures as stimuli. Perhaps this will change with the advent of virtual reality (Lindner et al., Citation2017). The other notable difference was that, in treatment of social anxiety disorder, the results were in slight favor of ICBT. Perhaps this non-significant result can be understood in light of the possibility that the therapist herself may be a phobic object. Hence, in face-to-face therapy, the patient’s self-focus will be heightened and, thus, his ability to fully concentrate on the therapy might be hampered.

The quality of the studies varied both in terms of rigor and sample size. However, study quality did not affect outcomes significantly, although the number of studies was too small to conclude that with certainty. In addition, there was no major bias of the pooled effect estimate. The present meta-analysis had some strengths, including a consistent outcome across studies regarding the efficacy of ICBT compared to face-to-face CBT and the relatively high quality of the trials included. However, the study also had limitations. First, there was a potential problem with heterogeneity. Since that was driven by a single outlier study on insomnia (Lancee et al., Citation2016) that clearly favored face-to-face therapy (d = 1 .16), we decided to present the results both with and without that study included in the analysis. However, the relative impact of that study on the pooled effect size across all studies was negligible, with g = .05 changing to g = −.01 if it was excluded.

Second, potential negative effects, such as increased or novel symptoms, of the treatments were not assessed at all. It has been suggested that internet interventions have great potential for alleviating emotional distress, promoting mental health, and enhancing well-being, but there could also be negative effects associated with treatment (Rozental et al., Citation2014). Indeed, a meta-analysis found that the conditions of 5.8% of participants involved in ICBT deteriorated (Rozental, Magnusson, Boettcher, Andersson, & Carlbring, Citation2017). That number corresponds well with the 5–10% found in face-to-face treatments (Hansen, Lambert, & Forman, Citation2002) and is much lower than the 17.4% of control group participants in internet-delivered trials (Rozental et al., Citation2017).

Third, the treatment reviewed is based only on CBT, which makes generalizations to other treatment approaches difficult. While the great majority of current treatments are based on cognitive behavioral principles, there are other forms of internet-delivered psychotherapy, such as psychodynamic psychotherapy (Johansson et al., Citation2012), physical exercise (Nyström et al., Citation2017), and radically different forms of ICBT, including attention bias modification (Carlbring et al., Citation2012), problem solving therapy (Warmerdam, van Straten, Twisk, Riper, & Cuijpers, Citation2008), and acceptance and commitment therapy (Ivanova et al., Citation2016). These internet-based intervention programs were not included in the analysis since they typically do not a make direct comparisons with face-to-face psychotherapy (Johansson et al., Citationin press).

Fourth, we have compared ICBT to face-to-face therapy regardless of whether it was delivered in an individual or group setting. It could be argued that group CBT is a suboptimal comparison (Morrison, Citation2001), at least when it comes to patient preferences. To untangle the relative efficacy of individual and group settings, head-to-head comparisons need to be carried out (cf. Sandell et al., Citation2015).

Fifth, we analyzed only the primary outcome measures in the trials; we did not include secondary outcomes. Indeed, the heterogeneity of the clinical conditions included can be viewed as a problem on its own. We cannot, at this stage and with the few studies available for each condition, conclude that ICBT and face-to-face therapy are equally effective on all conditions. This does not necessarily imply that face-to-face therapy is better. For example, there are very few studies on knowledge acquisition following CBT and even fewer on ICBT (Andersson, Carlbring, Furmark, & on behalf of the SOFIE Research Group, Citation2012), and the therapy formats may differ in this regard. In addition, patient characteristics, such as cognitive flexibility (Lindner et al., Citation2016), have not been taken into account. This is potentially important since there are studies suggesting that different predictors of outcome are relevant when comparing face-to-face treatment vs. Internet treatment (Ebert et al., Citation2016).

Sixth, a majority of the studies recruited participants solely through self-referral or using a mixture of self-referral and clinical recruitment. It has been suggested that recruiting through sources that imply more active treatment-seeking behaviors (e.g. Google searches, viewing postings on mental health websites) results in participants with more severe depression and anxiety than those recruited through more passive sources of information (Lindner, Nyström, Hassmén, Andersson, & Carlbring, Citation2015).

Finally, we did not analyze the long-term effects of the treatments due to large differences in follow-up times and since drop-out were handled in dissimilar ways. This is a possible area for future research; in the type of trials included, randomization can be maintained for long time periods (Andersson, Rozental, Shafran, & Carlbring, Citationin press).

The results of our meta-analysis are thought-provoking both from theoretical and practical points of view. In terms of theories about change in psychotherapeutic interventions, the results suggest that the role of a face-to-face therapist may not be as crucial as suggested in the literature (Wampold, Citation2001) for generating large treatment effects. Even if factors such as therapeutic alliance are established in guided ICBT (Andersson, Paxling, et al., Citation2012), they are rarely important for its outcome. Indeed, understanding what makes ICBT work is a challenge for future research as only a few studies to date have investigated mediators of outcome (e.g. Hedman et al., Citation2013; Hesser, Westin, & Andersson, Citation2014; Karyotaki et al., Citation2015).

In conclusion, the aim of this systematic review and meta-analysis was to collect and analyze studies in which ICBT had been directly compared with face-to-face CBT. The findings are clear in that the overall effect for the main outcomes was close to zero, indicating that the two treatment formats are equally effective in treating social anxiety disorder, panic disorder, depressive symptoms, body dissatisfaction, insomnia, tinnitus, male sexual dysfunction, spider phobia, snake phobia, and fibromyalgia.

Disclosure statement

No potential conflict of interest was reported by the authors.

Acknowledgements

We would like to thank the following people for assisting in the assessment of abstracts: Martin Oscarsson, Axel D'Angelo, Ina Wood, Lars Taberman, Mikael Ersson, Sandra Frööjd, and Josefin Garberg.

References