Abstract
As gambling operators become increasingly sophisticated in their analysis of individual gambling behaviour, this study evaluates the potential for using machine learning techniques to identify individuals who used self-exclusion tools out of a sample of 845 online gamblers, based on analysing trends in their gambling behaviour. Being able to identify other gamblers whose behaviour is similar to those who decided to use self-exclusion tools could, for instance, be used to share responsible gaming messages or other information that aids self-aware gambling and reduces the risk of adverse outcomes. However, operators need to understand how accurate models can be and which techniques work well. The purpose of the article is to identify the most accurate technique out of four highly diverse techniques and to discuss how to deal analytically and practically with a rare event like self-exclusion, which was used by fewer than 1% of gamblers in our data-set. We conclude that balanced training data-sets are necessary for creating effective models and that, on our data-set, the most effective method is the random forest technique which achieves an accuracy improvement of 35 percentage points versus baseline estimates.
Acknowledgements
We are grateful to InnovateUK, the ESRC and the EPSRC for part-funding this research. We would also like to thank IGT for supplying the de-identified player data that was primarily used for this study, as well as the Transparency Project (www.thetransparencyproject.org), Division on Addiction, the Cambridge Health Alliance, a teaching affiliate of Harvard Medical School, which provided the second data-set reviewed. We would also like to thank Dr Tillman Weyde and Dr Gregory Slabaugh, both from City University London, for their support in completing this article.
Notes
1. http://www.gamblingcommission.gov.uk/FAQs/Problem-gambling/What-is-self-exclusion.aspx (accessed 4 June 2015; page last reviewed date given as July 2013).
2. The self-excluders used in this study similarly gambled more than 10 sessions in ~95% of months in which they gambled at all, discarding partial months in which they started or stopped. The comparison to the control group is nonetheless inexact and represents a limitation on our analysis: ~5% of self-excluders did gamble fewer than 10 sessions for over a quarter of their total months gambling.