Student Log-Data from a Randomized Evaluation of Educational Technology: A Causal Case Study

Adam C. Salesa University of Texas, Austin, Texas, USACorrespondence[email protected]

https://orcid.org/0000-0003-0416-0610 View further author information

John F. Paneb Education and Labor Division at RAND Corporation, Pittsburgh, Pennsylvania, USA

https://orcid.org/0000-0001-5155-2436 View further author information

Abstract

Randomized evaluations of educational technology produce log data as a bi-product: highly granular data on student and teacher usage. These datasets could shed light on causal mechanisms, effect heterogeneity, or optimal use. However, there are methodological challenges: implementation is not randomized and is only defined for the treatment group, and log datasets have a complex structure. This article discusses three approaches to help surmount these issues. One approach uses data from the treatment group to estimate the effect of usage on outcomes in an observational study. Another, causal mediation analysis, estimates the role of usage in driving the overall effect. Finally, principal stratification estimates overall effects for groups of students with the same “potential” usage. We analyze hint data from an evaluation of the Cognitive Tutor Algebra I curriculum using these three approaches, with possibly conflicting results: the observational study and mediation analysis suggest that hints reduce posttest scores, while principal stratification finds that treatment effects may be correlated with higher rates of hint requests. We discuss these mixed conclusions and give broader methodological recommendations.

Keywords:

Notes

1 This subsection draws heavily on comments from an anonymous reviewer.

2 Technically, if h is the event that a student requests a hint on a problem and e is the event that the student makes an error, then $P r (h | h or e) = 1 / {1 + P r (e and not h) / P r (h)} .$

3 The principal stratification model was re-run without dropping sections, and after dropping sections worked by fewer than 500 students, with similar results.

4 Including these students in a principal stratification model is straightforward (Sales & Pane, Citation2019a). Including subjects with missing log data in a mediation or observational study design can be more problematic (see, e.g. Li & Zhou, Citation2017).

5 This is equivalent to the “controlled direct effect,” CDE(0) e.g. VanderWeele (Citation2015, p. 57).

Additional information

Funding

This project was supported by National Science Foundation [1420374].

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Student Log-Data from a Randomized Evaluation of Educational Technology: A Causal Case Study

Information for

Open access

Opportunities

Help and information

Student Log-Data from a Randomized Evaluation of Educational Technology: A Causal Case Study

Abstract

Notes

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature