20
Views
0
CrossRef citations to date
0
Altmetric
Short Technical Note

Performance Is Not Enough: The Story Told by a Rashomon Quartet

ORCID Icon, ORCID Icon, & ORCID Icon
Received 17 Mar 2023, Accepted 12 Apr 2024, Published online: 07 Jun 2024
 

Abstract

The usual goal of supervised learning is to find the best model, the one that optimizes a particular performance measure. However, what if the explanation provided by this model is completely different from another model and different again from another model despite all having similarly good fit statistics? Is it possible that the equally effective models put the spotlight on different relationships in the data? Inspired by Anscombe’s quartet, this article introduces a Rashomon Quartet, that is a set of four models built on a synthetic dataset which have practically identical predictive performance. However, the visual exploration reveals distinct explanations of the relations in the data. This illustrative example aims to encourage the use of methods for model visualization to compare predictive models beyond their performance.

Acknowledgments

We appreciate the careful reviews provided by two anonymous reviewers, who discussed in great detail the weaknesses of the presented models and the potential for better ways to tell the story of the Rashomon Quartet. The results and plots in the article were produced using these R packages: DALEX (Biecek Citation2018), partykit (Hothorn and Zeileis Citation2015), randomForest (Liaw and Wiener Citation2002), neuralnet (Fritsch, Guenther, and Wright Citation2019), ggplot2 (Wickham Citation2016), GGally (Schloerke et al. Citation2023).

Disclosure Statement

We have no conflicts of interest to disclose.

Notes

1 Linear models are a very versatile family; here we consider only its simplest (default) subset of models with a linear relationship between xi and y.

2 For illustrative purposes, we have limited ourselves to trees with a maximum depth of 3.

3 Consistently obtaining results in neural network fits is still hard, so this fit was ensured by controlling the random number seed and parameterization.

Additional information

Funding

The first three authors contributions were financially supported by SONATA BIS grant 2019/34/E/ST6/00052 from the Polish National Science Centre (NCN).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 180.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.