301
Views
1
CrossRef citations to date
0
Altmetric
Methodological Studies

Student Log-Data from a Randomized Evaluation of Educational Technology: A Causal Case Study

ORCID Icon & ORCID Icon
Pages 241-269 | Received 03 Aug 2018, Accepted 01 Sep 2020, Published online: 21 Dec 2020
 

Abstract

Randomized evaluations of educational technology produce log data as a bi-product: highly granular data on student and teacher usage. These datasets could shed light on causal mechanisms, effect heterogeneity, or optimal use. However, there are methodological challenges: implementation is not randomized and is only defined for the treatment group, and log datasets have a complex structure. This article discusses three approaches to help surmount these issues. One approach uses data from the treatment group to estimate the effect of usage on outcomes in an observational study. Another, causal mediation analysis, estimates the role of usage in driving the overall effect. Finally, principal stratification estimates overall effects for groups of students with the same “potential” usage. We analyze hint data from an evaluation of the Cognitive Tutor Algebra I curriculum using these three approaches, with possibly conflicting results: the observational study and mediation analysis suggest that hints reduce posttest scores, while principal stratification finds that treatment effects may be correlated with higher rates of hint requests. We discuss these mixed conclusions and give broader methodological recommendations.

Notes

1 This subsection draws heavily on comments from an anonymous reviewer.

2 Technically, if h is the event that a student requests a hint on a problem and e is the event that the student makes an error, then Pr(h|h or e)=1/{1+Pr(e and not h)/Pr(h)}.

3 The principal stratification model was re-run without dropping sections, and after dropping sections worked by fewer than 500 students, with similar results.

4 Including these students in a principal stratification model is straightforward (Sales & Pane, Citation2019a). Including subjects with missing log data in a mediation or observational study design can be more problematic (see, e.g. Li & Zhou, Citation2017).

5 This is equivalent to the “controlled direct effect,” CDE(0) e.g. VanderWeele (Citation2015, p. 57).

Additional information

Funding

This project was supported by National Science Foundation [1420374].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 302.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.