724
Views
25
CrossRef citations to date
0
Altmetric
Articles

Unbiased variable importance for random forests

ORCID Icon
Pages 1413-1425 | Received 06 Nov 2019, Accepted 28 Apr 2020, Published online: 14 May 2020
 

Abstract

The default variable-importance measure in random forests, Gini importance, has been shown to suffer from the bias of the underlying Gini-gain splitting criterion. While the alternative permutation importance is generally accepted as a reliable measure of variable importance, it is also computationally demanding and suffers from other shortcomings. We propose a simple solution to the misleading/untrustworthy Gini importance which can be viewed as an over-fitting problem: we compute the loss reduction on the out-of-bag instead of the in-bag training samples.

Acknowledgments

I gratefully acknowledge the Berlin School of Economics and Law for granting me a research sabbatical without which this work would have been difficult to complete.

Notes

1 p̂in short for p̂inbag.

2 (0): α=1,λ=0, (1): α=1,λ=1, (2): α=0.5,λ=1, (3): α=0.5,λ=0.5.

3 It turns out that PGoob(2) is equivalent to the measure defined in Zhou and Hooker (2019), while a slightly modified version of PGoob(1) leads to the same OOB based MDI score defined in Li et al. (Citation2019) as shown in the Appendix A2.

4 For easier notation we have (i) left the multiplier 2 and (ii) omitted an index for the class membership.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 1,069.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.