433
Views
4
CrossRef citations to date
0
Altmetric
Articles

Concept Drift Monitoring and Diagnostics of Supervised Learning Models via Score Vectors

ORCID Icon, ORCID Icon &
Pages 137-149 | Received 18 Jul 2021, Accepted 28 Jul 2022, Published online: 31 Oct 2022
 

Abstract

Supervised learning models are one of the most fundamental classes of models. Viewing supervised learning from a probabilistic perspective, the set of training data to which the model is fitted is usually assumed to follow a stationary distribution. However, this stationarity assumption is often violated in a phenomenon called concept drift, which refers to changes over time in the predictive relationship between covariates X and a response variable Y and can render trained models suboptimal or obsolete. We develop a comprehensive and computationally efficient framework for detecting, monitoring, and diagnosing concept drift. Specifically, we monitor the Fisher score vector, defined as the gradient of the log-likelihood for the fitted model, using a form of multivariate exponentially weighted moving average, which monitors for general changes in the mean of a random vector. In spite of the substantial performance advantages that we demonstrate over popular error-based methods, a score-based approach has not been previously considered for concept drift monitoring. Advantages of the proposed score-based framework include applicability to broad classes of parametric models, more powerful detection of changes as shown in theory and experiments, and inherent diagnostic capabilities for helping to identify the nature of the changes.

Supplementary Materials

In the online supplementary materials of this article, we provide a file containing derivations of Fisher Decoupling EquationEquations (10) and Equation(11), examples demonstrating effectiveness of our methods, simulation study results, and a discussion on potential extensions of our methods. Additionally, the zip file contains python code and data necessary to reproduce some results presented in this article.

Additional information

Funding

This work was funded in part by the Air Force Office of Scientific Research Grant (#FA9550-18-1-0381), which we gratefully acknowledge. This work was also supported by a startup grant (#DMR200021) from the Extreme Science and Engineering Discovery Environment (XSEDE) (Towns et al. Citation2014), which is supported by the National Science Foundation (#ACI-1548562). We also acknowledge the many helpful comments from the Editor (Dr. Roshan Joseph) and the anonymous Associate Editor and Referees.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 97.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.