Full article: A review and comparison of control charts for ordinal samples

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Qualitative, more specifically, ordinal data generating processes are common in real-world process control implementations. In this study, a survey of control charts for the sample-based monitoring of independent and identically distributed ordinal data is provided together with critical comparisons of the control statistics, for memory-less Shewhart-type and for memory-utilizing exponentially weighted moving average (EWMA) and cumulative-sum types of control charts. New results and proposals are also provided for process monitoring. Using some real-world quality scenarios from the literature, a simulation study for performance comparisons is conducted, covering sixteen different types of control chart. It is shown that demerit-type charts used in combination with EWMA smoothing generally perform better than the other charts, which may rely on quite sophisticated derivations. A real-world data example for monitoring flashes in electric toothbrush manufacturing is discussed to illustrate the application and interpretation of the control charts in the study.

Keywords:

1. Introduction

Control charts are an important tool of statistical process control (SPC), which are used for monitoring quality-related processes. Most of the SPC literature focuses on the case where the relevant quality characteristics are measured on a quantitative scale, such as real-valued measurements or integer-valued counts (Montgomery Citation2009). Recently, there has been increasing interest in monitoring qualitative data (Weiß Citation2021a). In particular, ordinal data can be found in many SPC applications, i.e., data generated by some random variable X, the range of which consists of finitely many categories exhibiting a natural orderFootnote¹. We denote the range of X as $S = {s_{0}, s_{1}, \dots, s_{d}}$ with some $d \in N = {1, 2, \dots},$ where $s_{0} < s_{1} < \dots < s_{d} .$ The monitoring of ordinal data is not only relevant in manufacturing (e.g., severity of a defect) and the service industry (e.g., perceived quality of service), but also in non-industrial areas such as health surveillance (e.g., surgical performance) as well as climatological (e.g., wind force) and environmental applications (e.g., air quality). In the following, we focus on control charts applied to independent and identically distributed (i.i.d.) samples of univariate ordinal data. Note that there are also a few contributions concerning multivariate ordinal data, ordinal time series, or continuous monitoring of individual ordinal observations (see Weiß (Citation2021a) for references). However, as the majority of publications consider sample-based monitoring of i.i.d. ordinal data and to keep the scope of the present study manageable, we concentrate on univariate ordinal samples, while recommending more studies on the aforementioned topics for future research in Section 6.

In Section 2, we introduce the required concepts and notations for this research. In Section 3, we survey (and complement by some new proposals) important types of control charts for i.i.d. ordinal samples, which are classified in the following groups: Section 3.1 presents control charts, where the monitored statistic is a quadratic form in the sample frequencies, with weights being derived from ordinal principles. In Section 3.2, the monitored statistic is a weighted class count, where the weights are either determined based on a demerit scheme, or they follow from some probabilistic reasoning. In Section 3.3, statistics from ordinal data analysis are monitored, such as ordinal dispersion, and Section 3.4 concludes the survey with some references on further approaches that have been proposed in the literature so far. Section 4 provides an extensive performance analysis, where the main charts from Section 3 are compared to each other regarding their average run length (ARL) properties. There, we consider both medium- and high-quality scenarios, where the considered in-control models are inspired by real applications that have previously been reported in the literature. The application and interpretation of the control charts in practice are illustrated by a real-world data example in Section 5, where a manufacturing process of electric toothbrushes is monitored. Finally, Section 6 concludes the article and outlines directions for future research.

2. Basic concepts and notations

In this research, we assume the tth sample of size n > 1 (one may also consider time-varying sample sizes n_t) to consist of the i.i.d. ordinal random variables $X_{t, 1}, \dots, X_{t, n}$ with range $S,$ and the different samples are assumed to be mutually independent as well. The stochastic properties of $X_{t, i}$ are fully specified by its probability mass function (PMF), $p = (p_{0}, \dots, p_{d}) \in {[0; 1]}^{d + 1}$ with $p_{i} = P (X = s_{i}) .$ In order to account for the natural order among the categories, it is actually more common to consider the cumulative distribution function (CDF) instead (Agresti Citation2010), given by $f = (f_{0}, \dots, f_{d - 1}) \in {[0; 1]}^{d}$ with $f_{i} = P (X \leq s_{i}) .$ Note that f_d is omitted in f as f_d = 1 always holds.

2.1. Basics of process monitoring

We declare the monitored process to be in control as long as it follows the specified in-control distribution, given by $p_{0}$ and $f_{0},$ respectively. If the sample at time $t = τ$ is the first one with distribution being different to $p_{0}, f_{0},$ then τ is said to be a change point, and the process is called out of control for $t \geq τ .$ In SPC, a control chart is applied to the samples ${X_{t, i}}$ by sequentially computing a certain statistic for each sampling time $t = 1, 2, \dots,$ and these statistics are used to decide about the state of the process: if a statistic exceeds the given control limits (CLs), an alarm is triggered and entails an inspection of the process for a possible out-of-control situation. Obviously, we do not wish to intervene in the process if this is in control (false alarm, i.e., if $t < τ$ ), while a true change ( $t \geq τ$ ) should be detected as soon as possible. A common way of evaluating a control chart’s performance with respect to these two opposing poles is to compute its ARL for a given scenario, i.e., the mean time until the first alarm is triggered. The ARL should be large if the process is in control, whereas it should be small if the process is out of control. For these and further basics on SPC and control charts, see the textbook by Montgomery (Citation2009).

The fundamental idea behind an ordinal control chart is to compare (sequentially) the information provided by the tth sample ${X_{t, i}}$ to the in-control model as given by $p_{0}, f_{0} .$ The full information on the sample ${X_{t, i}},$ in turn, is comprised by the corresponding vectors of frequencies and cumulative frequencies, $N_{t} = (N_{t, 0}, \dots, N_{t, d})$ and $C_{t} = (C_{t, 0}, \dots, C_{t, d - 1}),$ respectively, where $N_{t, j}$ expresses the number of $X_{t, i} = s_{j}$ in the sample, and $C_{t, j}$ the number of $X_{t, i} \leq s_{j} .$ If the distribution of ${X_{t, i}}$ is given by $p, f,$ we have $E [N_{t}] = n \cdot p$ and $E [C_{t}] = n \cdot f .$ Thus, the corresponding vectors of relative (cumulative) frequencies, ${\hat{p}}_{t} = \frac{1}{n} N_{t}$ and ${\hat{f}}_{t} = \frac{1}{n} C_{t},$ are unbiased estimators of p and f, respectively. For the control charts considered in this article, the tth statistic is either a function of solely $N_{t},$ say $g (N_{t}),$ then it constitutes a (memory-less) Shewhart chart, or it is a function of the $N_{1}, \dots, N_{t}$ being available up to time t, say $h (N_{1}, \dots, N_{t}),$ then we are concerned with a control chart having an inherent memory. Note that by appropriately modifying g, h, we could equivalently use $C_{t}, {\hat{p}}_{t},$ or ${\hat{f}}_{t}$ instead of $N_{t}$ to express these statistics. Popular examples for constructing memory-type control charts are the cumulative sum (CUSUM) concept dating back to Page (Citation1954), and the exponentially weighted moving-average (EWMA) concept dating back to Roberts (Citation1959). In the ordinal case, a widely-used approach for generating a memory-type chart is to apply EWMA smoothing to the process of frequency vectors, e.g., to ${(N_{t})}_{t \in N} :$ for given smoothing parameter $λ \in (0; 1),$ one computes [2.1] $N_{t}^{(λ)} = λ N_{t} + (1 - λ) N_{t - 1}^{(λ)} for t = 1, 2, \dots, N_{0}^{(λ)} = n p_{0} .$ [2.1]

Then, the monitored statistic is computed as $g (N_{t}^{(λ)})$ for $t = 1, 2, \dots,$ whereas the corresponding Shewhart version monitors $g (N_{t})$ instead, see Li, Tsung, and Zou (Citation2014); Wang, Su, and Xie (Citation2018); Bai and Li (Citation2021) for such examples. Possible adoptions of the CUSUM approach to the case of ordinal data are described later in Section 3.2.

Remark 2.1.1.

The term “ARL” is not uniquely defined in the literature, but there are competing approaches, see Knoth (Citation2006). The default concept is the zero-state ARL, where it is assumed that the possible process change happens with the start of process monitoring, i.e., τ = 1. Other concepts such as the steady-state ARL assume a delayed change point. In this research, we are concerned with independent samples $N_{1}, N_{2}, \dots$ Consequently, if applying a Shewhart-type chart to $N_{1}, N_{2}, \dots,$ then all these ARL concepts coincide. For memory-type control charts, the obtained ARL values usually differ. But, it is widely known that the well-established CUSUM and EWMA charts lead to only modest differences with no practical implications in the independence scenario considered here. Therefore, in what follows, we focus on zero-state ARLs and omit a discussion of further ARL concepts.

2.2. Parametric models for ordinal data

For performance analyses of control charts, it is useful to have parametric models for an ordinal random variable X, which allows for modifications of the relevant distributional properties, such as the location or dispersion of X. There are only few “direct” proposals on parametric distributions for the ordinal X; instead, it is more common to derive an ordinal distribution from either an underlying discrete count distribution, or from a continuous real-valued distribution. A tailor-made proposal for the ordinal X is the “λ-μ-distribution” of (Kvålseth Citation2011a, Section 5), which is defined as [2.2] $p = ((1 - \frac{1}{2} μ) (1 - λ) + \frac{1}{d + 1} λ, \frac{1}{d + 1} λ, \dots, \frac{1}{d + 1} λ, \frac{1}{2} μ (1 - λ) + \frac{1}{d + 1} λ)$ [2.2] for $λ, μ \in [0; 1] .$ It comprises the most relevant extreme scenarios for qualitative random variables:

$μ = λ = 0 :$ the one-point distribution $p = (1, 0, \dots, 0)$ or $f = (1, \dots, 1),$ respectively, i.e., the case of minimal qualitative dispersion (maximal consensus);
λ = 1: the uniform distribution $p = (\frac{1}{d + 1}, \dots, \frac{1}{d + 1})$ or $f = (\frac{1}{d + 1}, \dots, \frac{d}{d + 1}),$ respectively, i.e., the case of maximal nominal dispersion (maximal uncertainty);
μ = 1 and λ = 0: the extreme two-point distribution (polarized distribution) $p = (\frac{1}{2}, 0, \dots, 0, \frac{1}{2})$ or $f = (\frac{1}{2}, \dots, \frac{1}{2}),$ respectively, i.e., the case of maximal ordinal dispersion (maximal dissent).

Also, the “Lehmann approach” (Lehmann Citation1953; Klein and Doll Citation2021) should be mentioned as a possibility for achieving parametric ordinal models: starting from some baseline CDF f, one considers the CDF $f^{α}$ with component-wise exponent $α > 0 .$ But as mentioned before, it is more common to derive a parametric model for the ordinal X from some underlying quantitative model. One such approach was referred to as rank counts by Weiß (Citation2020), where the probabilities in p are assumed to be equal to those of a chosen parametric distribution for bounded counts with range ${0, \dots, d},$ such as a binomial (Bin), beta-binomial (BB), or k-inflated binomial (k-IB) distribution with upper bound d. Thus, the shape of p is controlled by one or two parameters. This can be formalized by writing X = s_I, where the random rank count I follows a discrete bounded-counts distribution. For example, if $I \sim k - IB (d, π, ω)$ with $π, ω \in [0; 1]$ and $k \in {0, \dots, d},$ then this implies that [2.3] $P (X = s_{j}) = ω \cdot 1 (j = k) + (1 - ω) \cdot (\begin{matrix} d \\ j \end{matrix}) π^{j} {(1 - π)}^{d - j},$ [2.3] where $1 (A)$ denotes the indicator function, which equals 1 (0) if A is true (false). Note that (2.3) reduces to the PMF of $Bin (d, π)$ if ω = 0. As an example, if k = 0 (zero inflation), then we get the one-point distribution $p = (1, 0, \dots, 0)$ for ω = 1, while $ω = \frac{1}{2}$ and π = 1 leads to the extreme two-point distribution.

Another approach to derive a parametric model for the ordinal X is the assumption of a real-valued latent variable Q (Agresti Citation2010, p. 11), which follows a specified continuous distribution; let $F_{Q} (x)$ denote the corresponding CDF. Then, threshold parameters $- \infty = η_{- 1} < η_{0} < \dots < η_{d - 1} < η_{d} = + \infty$ have to be specified, and one defines [2.4] $P (X = s_{j}) = F_{Q} (η_{j}) - F_{Q} (η_{j - 1}) for j = 0, \dots, d .$ [2.4]

Popular choices for F_Q are the (standard) logistic (Log) or normal (N) distribution, leading to the ordered logit or probit model, respectively. While Q is typically assumed to exist only virtually (thus “latent”), there are some applications where Q also has a physical meaning, e.g., if grouped data resulting from gauging are used instead of exact measurements, see Steiner, Geyer, and Wesolowsky (Citation1994, Citation1996a,Citation1996b).

3. Control charts for ordinal samples

This section provides a broad survey of existing control charts for i.i.d. ordinal samples, supplemented by a few new results and proposals. Doing this, we identified groups of more general monitoring concepts and classified the control charts into these groups. As the groups are not perfectly disjoint, we shall point out if a control chart could be assigned to another group as well.

3.1. Ordinal quadratic-form statistics

One of the earliest proposals for the monitoring of i.i.d. qualitative samples ${X_{t, i}}$ is a Shewhart-type control chart relying on Pearson’s goodness-of-fit (GoF) statistic [3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1]

The second representation shows that $X_{t}^{2}$ constitutes a particular type of quadratic form in $N_{t}$ or ${\hat{p}}_{t},$ respectively. The idea of monitoring $(X_{t}^{2})$ can be traced back to (Shewhart Citation1931, p. 329), while the first comprehensive discussion appears to be in Duncan (Citation1950). Ordinal applications of [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] are presented by Marcucci (Citation1985); Wardell and Candia (Citation1996); Edwards, Govindaraju, and Lai (Citation2007); Samimi, Aghaie, and Tarokh (Citation2010). Obviously, [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] is not limited to ordinal data, but can be used for qualitative data in general. However, its application to specifically ordinal data can be justified as detailed in Appendix B. There, it is shown that $X_{t}^{2}$ can be equivalently expressed as a quadratic form in the CDF ${\hat{f}}_{t},$ see (B.1), which, as explained in Section 2, accounts for the natural order among the categories through the accumulation. In other words: even if one defines the quadratic-form statistic (B.1) in an explicitly ordinal manner, we end up with the traditional Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] anyway.

To establish a closer connection to ordinal data, the Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] can be extended to a weighted quadratic-form statistic, where the weights account for the ordering among the categories. This was proposed by Wang, Su, and Xie (Citation2018), whose average cumulative data (ACD) chart relies on [3.2] ${ACD}_{t} = n^{- 1} (N_{t} - n p_{0}) ^{⊤} V (N_{t} - n p_{0}),$ [3.2] where V denotes a given weight matrix. Obviously, the Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] follows if choosing $V = diag {(p_{0})}^{- 1} .$ Wang, Su, and Xie (Citation2018) suggest to use $V = L^{⊤} diag (w) L$ with weight vector w, where L has a triangular structure: the lower (upper) triangle is filled with 2 (0), and the main diagonal with 1. In this case, setting $C_{t, - 1} = f_{0, - 1} = 0,$ ${ACD}_{t}$ calculates as [3.3] ${ACD}_{t} = n^{- 1} \sum_{j = 0}^{d} w_{j} {(C_{t, j - 1} + C_{t, j} - n (f_{0, j - 1} + f_{0, j}))}^{2},$ [3.3] i.e., it is a quadratic form in the average cumulative proportions $\frac{1}{2} (f_{0, j - 1} + f_{0, j}),$ the so-called “ridits” (Agresti Citation2010, p. 10). This artificial word was created by Bross (Citation1958) in analogy to “logit” and “probit” (recall Section 2.2), where “rid” expresses “relative to an identified distribution”. The weight vector w might simply be chosen as $w = 1 = {(1, \dots, 1)}^{⊤}$ (Wang, Su, and Xie Citation2018, p. 119) if there is no reason for a more sophisticated weighting scheme.

Another weighted quadratic-form scheme is the univariate location-scale ordinal (ULSO) control chart recently proposed by Bai and Li (Citation2021), where the weights are derived from the latent-variable approach [Equation2.4[2.4] $P (X = s_{j}) = F_{Q} (η_{j}) - F_{Q} (η_{j - 1}) for j = 0, \dots, d .$ [2.4] ]. It can be expressed as that ACD chart [Equation3.2[3.2] ${ACD}_{t} = n^{- 1} (N_{t} - n p_{0}) ^{⊤} V (N_{t} - n p_{0}),$ [3.2] ], where [3.4] $V = Q^{⊤} {(Q (diag (p_{0}) - p_{0} p_{0}^{⊤}) Q^{⊤})}^{- 1} Q .$ [3.4]

If using the cumulative logit approach as recommended by (Bai and Li Citation2021, p. 941), the $2 \times (d + 1)$ -matrix $Q = (q_{k l})$ has the entries $q_{1 j} = f_{0, j - 1} + f_{0, j} - 1, q_{2 j} = p_{0, j}^{- 1} (η (f_{0, j}) - η (f_{0, j - 1})) for j = 0, \dots, d,$ where $η (z) = z (1 - z) \ln ((1 - z) / z)$ with $η (0) = η (1) = 0 .$ Thus, the “location scores” $q_{1 j}$ again rely on ridits, whereas the structure of the “scale scores” $q_{2 j}$ is a consequence of the cumulative logit approach (remind Section 2.2). Note that [Equation3.4[3.4] $V = Q^{⊤} {(Q (diag (p_{0}) - p_{0} p_{0}^{⊤}) Q^{⊤})}^{- 1} Q .$ [3.4] ] is well defined only if $d \geq 2,$ and that the ULSO statistic reduces to the ordinary Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] if d = 2; see Appendix C for details. Thus, non-trivial ULSO charts are only available if $d \geq 3$ (i.e., if the range of X consists of at least four categories).

Finally, let us recall that the Shewhart versions of the quadratic-form control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.4[3.4] $V = Q^{⊤} {(Q (diag (p_{0}) - p_{0} p_{0}^{⊤}) Q^{⊤})}^{- 1} Q .$ [3.4] ] are easily equipped with an inherent memory by substituting the raw frequencies $N_{t}$ by the smoothed frequencies $N_{t}^{(λ)}$ according to [Equation2.1[2.1] $N_{t}^{(λ)} = λ N_{t} + (1 - λ) N_{t - 1}^{(λ)} for t = 1, 2, \dots, N_{0}^{(λ)} = n p_{0} .$ [2.1] ]. This was suggested by Wang, Su, and Xie (Citation2018); Bai and Li (Citation2021) for their ACD and ULSO charts, respectively, but it can also be applied to the Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] as well as to the Shewhart-type charts being discussed in later sections.

3.2. Statistics based on weighted class counts

Several control charts rely on a type of weighted class count, i.e., a statistic of the form [3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] which is plotted against a lower and upper CL (LCL and UCL, respectively). Statistic [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ] corresponds to taking the sample sum after having transformed the original ordinal range $S$ into a range of numerical scores, $V = {v_{0}, \dots, v_{d}} .$ There are two common ways of choosing the scores in practice: one may define the scores based on probabilistic arguments, as detailed below, or one chooses the scores based on “external reasoning”, which is first discussed in the present section. In a quality context, such scores are typically chosen to express the severity of defects, leading to a demerit control chart. Such demerit charts date back to Dodge (Citation1928); Dodge and Torrey (Citation1956) and have been further investigated by, among others, Jones, Woodall, and Conerly (Citation1999) and (Montgomery Citation2009, Section 7.3.3). In the latter two references, the default example refers to $d + 1 = 4$ levels of defect, ranging from class D (“not serious”) to A (“very serious”) with corresponding demerit values $1, 10, 50, 100 .$ Another example is reported by Nembhard and Nembhard (Citation2000), where $d + 1 = 3$ classes of nonconformity (minor, major, critical) are assigned the scores 1, 3, 10. Finally, in Wardell and Candia (Citation1996), the scores regarding customer satisfaction are simply chosen as the integers $1, \dots, d + 1$ (Likert scale) — this particular choice shall be discussed again in Section 3.3.

Although common in practice, the approach of choosing numerical scores for ordinal data (and thus implicitly using a metric scale) is criticized by (Agresti Citation2010, p. 10), because “often, an appropriate choice of scores is unclear” and the obtained outcome might be sensitive to the actual choice of the scores. Applied to the context of SPC, the question arises if there is a notable effect of this choice on the ARL performance of the demerit chart [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ].

Therefore, an alternative approach is to derive the weights $v_{0}, \dots, v_{d}$ in [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ] from probabilistic principles. There are several suggestions in the literature on how to choose these scores in this way. The so-called simple ordinal categorical (SOC) chart of Li, Tsung, and Zou (Citation2014) plots the absolute values of [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ], where the weights v_j are related to the ridits of the in-control distribution: [3.6] ${SOC}_{t} = | \sum_{j = 0}^{d} (f_{0, j - 1} + f_{0, j} - 1) N_{t, j} | with f_{0, - 1} : = 0.$ [3.6]

Obviously, the SOC chart [Equation3.6[3.6] ${SOC}_{t} = | \sum_{j = 0}^{d} (f_{0, j - 1} + f_{0, j} - 1) N_{t, j} | with f_{0, - 1} : = 0.$ [3.6] ] is related to the ACD chart [Equation3.3[3.3] ${ACD}_{t} = n^{- 1} \sum_{j = 0}^{d} w_{j} {(C_{t, j - 1} + C_{t, j} - n (f_{0, j - 1} + f_{0, j}))}^{2},$ [3.3] ] and the ULSO chart [Equation3.4[3.4] $V = Q^{⊤} {(Q (diag (p_{0}) - p_{0} p_{0}^{⊤}) Q^{⊤})}^{- 1} Q .$ [3.4] ], although these monitor quadratic-form statistics based on weighted class counts. Recall that [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ] to [Equation3.6[3.6] ${SOC}_{t} = | \sum_{j = 0}^{d} (f_{0, j - 1} + f_{0, j} - 1) N_{t, j} | with f_{0, - 1} : = 0.$ [3.6] ] are easily combined with the EWMA smoothing [Equation2.1[2.1] $N_{t}^{(λ)} = λ N_{t} + (1 - λ) N_{t - 1}^{(λ)} for t = 1, 2, \dots, N_{0}^{(λ)} = n p_{0} .$ [2.1] ] as well. Also the (nominal) control chart proposed by Perry (Citation2020) should be mentioned here, where the weights v_j in [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ] are equal to $1 / (n p_{0, j})$ for $j = 0, \dots, d,$ in analogy to the Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ]. Furthermore, Perry (Citation2020) uses an EWMA approach, where the EWMA smoothing is not applied to the frequency vectors as in [Equation2.1[2.1] $N_{t}^{(λ)} = λ N_{t} + (1 - λ) N_{t - 1}^{(λ)} for t = 1, 2, \dots, N_{0}^{(λ)} = n p_{0} .$ [2.1] ], but to a standardized version of the weighted class count [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ].

Some control charts with probabilistically weighted class counts are inspired by a log-likelihood ratio (log-LR) approach. By contrast to the previous charts, not only the in-control distribution $p_{0}$ has to be specified, but also a relevant out-of-control scenario, say $p_{1} .$ Here, one can account for the ordinal nature of the data by choosing $p_{1}$ with respect to a parametric ordinal model, recall Section 2.2. In Steiner, Geyer, and Wesolowsky (Citation1994, Citation1996a,Citation1996b), for instance, a latent-variable approach [Equation2.4[2.4] $P (X = s_{j}) = F_{Q} (η_{j}) - F_{Q} (η_{j - 1}) for j = 0, \dots, d .$ [2.4] ] is used. As an example, the threshold parameters $η_{0}, \dots, η_{d - 1}$ could be chosen such that $p_{0}$ arises from a standard normal distribution, $N (0, 1),$ and $p_{1}$ is computed from the same $η_{0}, \dots, η_{d - 1}$ but for $N (μ, 1)$ with some anticipated $μ = 0 .$ Analogously, if using a rank-count approach like in [Equation2.3[2.3] $P (X = s_{j}) = ω \cdot 1 (j = k) + (1 - ω) \cdot (\begin{matrix} d \\ j \end{matrix}) π^{j} {(1 - π)}^{d - j},$ [2.3] ], one may use $π_{1} = π_{0}$ to distinguish $p_{1}$ from $p_{0} .$

Then, Steiner, Geyer, and Wesolowsky (Citation1994, Citation1996a) propose a (one- or two-sided) Shewhart monitoring of the log-LR statistics [3.7] $l R_{t} = \sum_{j = 0}^{d} N_{t, j} \ln (p_{1, j} / p_{0, j}),$ [3.7] while Steiner, Geyer, and Wesolowsky (Citation1996b); Ryan, Wells, and Woodall (Citation2011) consider a corresponding upper-sided CUSUM control chart: [3.8] $C_{t} = \max {0, l R_{t} + C_{t - 1}}, C_{0} = 0.$ [3.8]

As a related approach, we propose to adapt the concept of the Shiryaev–Roberts (SR) control chart (Shiryaev Citation1961; Roberts Citation1966) to the case of ordinal data. Together with $\exp (l R_{t}) = \prod_{j = 0}^{d} {(p_{1, j} / p_{0, j})}^{N_{t, j}},$ its recursive implementation is given by [3.9] $R_{t} = (R_{t - 1} + 1) \exp (l R_{t}), R_{0} = 0,$ [3.9] where the design recommendations of Ottenstreuer (Citation2022) are useful for the practical implementation of the SR chart.

3.3. Ordinal statistics

For monitoring quantitative real-valued processes, it is common to focus on relevant stochastic properties such as the process’ mean (e.g., $\bar{X}$ -chart) or variance (e.g., S²-chart), see Montgomery (Citation2009). Accordingly, it would be natural to apply analogous approaches also to ordinal samples data. Surprisingly, control charts based on such ordinal statistics have been rarely addressed in the literature so far. Let us start with a brief summary about important properties of an ordinal random variable X. The location of X is typically measured in terms of the mode or median, where only the median accounts for the natural order among the states. The mode, by contrast, which is also well-defined for nominal data, might not even be unique. For the dispersion of X, a large variety of measures have been proposed in the literature, see Kvålseth (Citation2011b) for a survey. As explained in Section 2.2, a crucial requirement for such ordinal dispersion measures is that any one-point distribution is mapped on the minimal value of the measure’s range, and the extreme two-point distribution on the maximal value. Probably the most well-known example is given by the index of ordinal variation, $IOV = \frac{4}{d} \sum_{i = 0}^{d - 1} f_{i} (1 - f_{i}),$ which has the range $[0; 1] .$ Finally, also measures of ordinal skewness are important for practice (whereas skewness is not meaningful for nominal data), such as those discussed by Klein and Doll (Citation2021). A simple example is given by $skew = (\frac{2}{d} \sum_{i = 0}^{d - 1} f_{i}) - 1$ with range $[- 1; 1],$ where $skew = 0$ is attained for a symmetric distribution, i.e., if $f_{j} = 1 - f_{d - 1 - j}$ for $j = 0, \dots, d - 1 .$

At first glance, it appears reasonable to simply monitor the sample counterparts of the aforementioned measures on a control chart. However, as the range of X is often quite small (low value of d), it will be difficult in practice to design a control chart based on the sample median or mode. Thus, for monitoring location, alternatives like the SOC chart [Equation3.6[3.6] ${SOC}_{t} = | \sum_{j = 0}^{d} (f_{0, j - 1} + f_{0, j} - 1) N_{t, j} | with f_{0, - 1} : = 0.$ [3.6] ] appear to be more promising. Control charts for ordinal dispersion, by contrast, are more easily implemented, and as explained by Weiß (Citation2021a), they are indeed particularly relevant for quality-related applications. Assume, for example, that s₀ refers to the best quality, and increasing states $s_{j} > s_{0}$ refer to increasing quality deteriorations. For a well-set process, the in-control probability for s₀ should be larger than the remaining probabilities (mode s₀), and $p_{0}$ should be reasonably close to the one-point distribution in s₀ (low dispersion). Quality deteriorations will move probability mass from s₀ to states $s_{j} > s_{0}$ such that the ordinal dispersion increases. Thus, monitoring for increases in dispersion appears to be relevant for many SPC applications. Accordingly, Bashkansky and Gadrich (Citation2011); Weiß (Citation2021a) use the sample IOVs computed as [3.10] ${IOV}_{t} = \frac{4}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j} (1 - {\hat{f}}_{t, j}),$ [3.10] which can also be combined with the EWMA smoothing [Equation2.1[2.1] $N_{t}^{(λ)} = λ N_{t} + (1 - λ) N_{t - 1}^{(λ)} for t = 1, 2, \dots, N_{0}^{(λ)} = n p_{0} .$ [2.1] ] for achieving a memory-type chart. The monitoring of alternative (sample) dispersion measures, such as those in Kvålseth (Citation2011b), has not been investigated so far. Also a control chart based on an ordinal skewness measure (Klein and Doll Citation2021), such as [3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] could be a relevant option. If $p_{0}$ is close to the one-point distribution in s₀, we are concerned with strong positive skewness, and this gets reduced by quality deteriorations. At this point, an important relation to the demerit chart [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ] in Section 3.2 should be pointed out. For the rank-count variable I with range ${0, \dots, d}$ and CDF f, recall the discussion in Section 2.2, it is well known that its mean can be expressed as $E [I] = d - \sum_{j = 0}^{d - 1} f_{j} .$ This result is commonly referred to as the “alternative expectation formula”; an early reference is (Karlin and Taylor Citation1975, p. 33). It can be used to rewrite [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ] as $n \cdot {skew}_{t} = \sum_{j = 0}^{d} (1 - \frac{2}{d} j) N_{t, j} \Leftrightarrow \sum_{j = 0}^{d} (j + 1) N_{t, j} = n (1 + \frac{d}{2}) - n \frac{d}{2} \cdot {skew}_{t} .$

Hence, the ${skew}_{t}$ -chart is equivalent to a demerit chart with linear weighting scheme, such as the integer weights $1, \dots, d + 1$ suggested by Wardell and Candia (Citation1996). This, in turn, implies that these particular demerit weights can be justified probabilistically as accounting for the ordinal skewness of X.

3.4. Miscellaneous approaches

For the sake of completeness, we provide information and references on some further control charts for ordinal data, although these are not considered for our simulation study in Section 4. The “p-tree method” of Duran and Albin (Citation2009), for example, monitors conditional (ordinal) events of the form $X = s_{j} | X \geq s_{j} .$ This is achieved by running d control charts simultaneously, which plot the statistics [3.12] $N_{t, 0} / n and N_{t, j} / (n - C_{t, j - 1}) for j = 1, \dots, d - 1.$ [3.12]

But as our focus is on univariate control charts, we do not further consider the p-tree method here.

Tucker, Woodall, and Tsui (Citation2002) assume the ordinal data to follow a latent-variable model, recall (2.4), with location parameter μ. Their control chart plots the statistics $T_{t} = {\hat{μ}}_{t} / se ({\hat{μ}}_{t}),$ where ${\hat{μ}}_{t}$ is the maximum likelihood estimate of μ computed from the tth sample, and $se ({\hat{μ}}_{t})$ the estimated asymptotic standard error of ${\hat{μ}}_{t} .$ While theoretically appealing, the practical implementation is demanding as $T_{t}$ is computed by numerical methods.

Franceschini and Romano (Citation1999) consider ordinal variables being evaluated on a linguistic scale. For the tth sample, the so-called “ordered weighted average” is used as a measure of location, and the ordinal range as a measure of dispersion. Then, corresponding Shewhart charts are constructed. But as criticized by Franceschini, Galetto, and Varetto (Citation2005, p. 181), “the dynamics of the charts are poor and little information can be extracted about the process”. Therefore, Franceschini, Galetto, and Varetto (Citation2005) suggested to define an order relation between the ordinal samples according to a specified dominance criterion. Then, the tth sample is ranked according to its position in the ordered sample space, and this information is used for computing the monitored statistic. Here, a finer resolution for the sample statistics is achieved, but the practical difficulty arises from the fact that the size of the sample space (the number of equivalence classes) increases linearly in both d and n. Thus, an implementation appears feasible only for rather low sample sizes.

4. A simulation-based performance analysis

To provide a performance comparison of the control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ], where an overview is provided in , we define four different in-control scenarios $p_{0}$ for our simulation study that are inspired by real-data examples from the literature; namely from

Table 1. Control charts used in simulation study.

Download CSV Display Table

Bashkansky and Gadrich (Citation2011): conditions of patients visiting their Health Maintenance Organization, $d + 1 = 3$ categories;
Wang, Su, and Xie (Citation2018): quality level of white wine, $d + 1 = 4$ categories;
Wardell and Candia (Citation1996): patient satisfaction with predischarge information in a hospital, $d + 1 = 5$ categories; and
Li, Tsung, and Zou (Citation2014): quality of manufactured toothbrush heads, $d + 1 = 4$ categories.

To simplify the subsequent discussion, it is advisable to further classify these scenarios, because some of the control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ] implicitly assume a certain shape of the in-control distribution $p_{0} .$ For example, the IOV chart [Equation3.10[3.10] ${IOV}_{t} = \frac{4}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j} (1 - {\hat{f}}_{t, j}),$ [3.10] ] is tailor-made for detecting increases in dispersion, i.e., it is expected to perform well if quality deteriorations go along with increasing dispersion. This would be the case in a “high-quality situation”, where $p_{0, 0}$ is largest among all probabilities in $p_{0}$ (i.e., where the mode category corresponds the best quality level; recall that we always arrange the categories in $S$ according to increasing quality deterioration). Also the demerit schemes [Equation3.5[3.5] $D_{t} = v_{0} N_{t, 0} + \dots + v_{d} N_{t, d},$ [3.5] ] appear to be well-suited for this case, as they use increasing weights for increasing deterioration. On the other hand, the skew chart [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ], for example, might work well also for a roughly symmetrical distribution, i.e., where medium quality levels are most frequent under in-control conditions. We refer to such a case as a “medium-quality situation”, i.e., if the mode category s_j satisfies $0 < j < d .$ According to this distinction, the examples of Wardell and Candia (Citation1996) and Li, Tsung, and Zou (Citation2014) are classified as high-quality situations (which are also skewed to the right). The examples of Bashkansky and Gadrich (Citation2011) and Wang, Su, and Xie (Citation2018), by contrast, are medium-quality situations, where the patient data in Bashkansky and Gadrich (Citation2011) are nearly symmetrically distributed, and the wine data of Wang, Su, and Xie (Citation2018) are skewed to the left. Besides simplifying the discussion of the subsequent results, the distinction between high-quality and medium-quality situations is also appealing for practice, as it allows easy entry into the selection of a suitable control chart.

By closely following these real-world applications, we ensure the practical relevance of the considered $p_{0},$ while their diverse properties shall help to reach a well-founded evaluation of the charts’ performance. To also get meaningful out-of-control situations, we embed these four scenarios into the most common parametric models for ordinal data, recall Section 2.2, namely a rank-counts approach using the $k - IB (d, π, ω)$ -distribution in [Equation2.3[2.3] $P (X = s_{j}) = ω \cdot 1 (j = k) + (1 - ω) \cdot (\begin{matrix} d \\ j \end{matrix}) π^{j} {(1 - π)}^{d - j},$ [2.3] ] as well as the logit and probit latent-variable approaches [Equation2.4[2.4] $P (X = s_{j}) = F_{Q} (η_{j}) - F_{Q} (η_{j - 1}) for j = 0, \dots, d .$ [2.4] ]. While the latter can be adapted to any choice of $p_{0}$ (due to having d threshold parameters $η_{0}, \dots, η_{d - 1}$ ), the more parsimonious $k - IB (d, π, ω)$ -distribution is less flexible. Therefore, the final in-control scenarios are defined by specifying a $k - IB (d, π, ω)$ -model, namely the medium-quality cases

Scenario 1 (inspired by Bashkansky and Gadrich (Citation2011)):
$1 - IB (2, 0.5, 0.7),$ i.e., $p_{0} = (0.075, 0.850, 0.075);$
Scenario 2 (inspired by Wang, Su, and Xie (Citation2018)):
$2 - IB (3, 0.5, 0.6),$ i.e., $p_{0} = (0.050, 0.150,$ $0.750, 0.050);$

and the high-quality cases

Scenario 3 (inspired by Wardell and Candia (Citation1996)):
$0 - IB (4, 0.3, 0.4),$ i.e., $p_{0} \approx (0.544, 0.247, 0.159,$ $0.045, 0.005);$
Scenario 4 (inspired by Li, Tsung, and Zou (Citation2014)):
$0 - IB (3, 0.4, 0.8),$ i.e., $p_{0} \approx (0.843, 0.086,$ $0.058, 0.013) .$

The out-of-control choices for p according to the $k - IB (d, π, ω)$ -model are generated by increasing the value of π, see the corresponding blocks in the first column of in Appendix A. In addition, we consider out-of-control p resulting from various (positive) location shifts in the logit and probit implementations of $p_{0},$ i.e., where $η_{0}, \dots, η_{d - 1}$ are chosen to match $p_{0}$ for $Q \sim Log (0, 1)$ and $Q \sim N (0, 1),$ respectively. In this way, we get a variety of out-of-control scenarios (see for an overview), which are not chosen arbitrarily but implied by well-established models for ordinal data. For all four scenarios, the sample size is chosen as n = 100 like in Bai and Li (Citation2021), and we also followed their choice for the EWMA smoothing parameter, namely $λ = 0.1 .$

Common to all out-of-control p is that they imply a deterioration in quality, such that p is more left-skewed than $p_{0} .$ Thus, the skew chart [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ] is designed with a lower control limit, whereas all remaining charts have an upper limit as quality deteriorations lead to an increase in the monitored statistics. Recall that the skew chart is equivalent to a demerit chart with linearly increasing weights $1, \dots, d + 1 .$ Generally, the idea behind demerit charts is choosing the weights to express the severity of defects, but not according to some probabilistic reasoning. Therefore, we checked the literature for corresponding recommendations (being different from the linear weights behind the skew chart [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ] to avoid a duplication of results) to define the charts labeled as “Demerit” in . For d = 2, we found the recommendation of Nembhard and Nembhard (Citation2000) to use 1, 3, 10, and for d = 3 the one of Jones, Woodall, and Conerly (Citation1999); Montgomery (Citation2009) to use $1, 10, 50, 100 .$ Note that in both cases, the weights increase faster than linearly. For this reason, we also selected super-linear weights for d = 4, namely, $1^{2}, \dots, 5^{2},$ where we did not find a recommendation in the literature. To summarize, while the columns “Skew” in –A.4 refer to demerit schemes with linear weights, the columns “Demerit” are based on super-linear weights.

As the performance measure, we consider the ARL discussed in Section 2.1, which is determined by simulation with 10⁶ replications to ensure a sufficiently good accuracy. The control limits of the charts are adjusted to give an in-control ARL (ARL₀) of $\approx 370,$ which is a common target value in the SPC literature. However, due to the discreteness of the observations and, thus, of the monitored statistics, it is usually not possible to meet this value exactly. In a few cases (mainly for the Shewhart version of the skew chart), there are notable deviations from 370, which must be taken into account in the evaluation of the corresponding out-of-control performance. But generally, the actual ARL₀-values are very close to 370, see the first segments of . Finally, note that for the CUSUM chart [Equation3.8[3.8] $C_{t} = \max {0, l R_{t} + C_{t - 1}}, C_{0} = 0.$ [3.8] ] and the SR chart [Equation3.9[3.9] $R_{t} = (R_{t - 1} + 1) \exp (l R_{t}), R_{0} = 0,$ [3.9] ], it is also necessary to specify an out-of-control target value $p_{1}$ for the PMF, recall Section 3.2. For each scenario, we have chosen $p_{1}$ as that PMF p that is implied by a logit model with a small location shift (as both methods are usually used for detecting small changes), namely 0.05 for Scenarios 1–2, 0.04 for 3, and 0.1 for 4.

4.1. ARL performance in medium-quality setting

Let us start our performance analyses with the medium-quality scenarios 1 and 2, where the corresponding ARL values are summarized in . Note that the ARLs of the ULSO chart [Equation3.4[3.4] $V = Q^{⊤} {(Q (diag (p_{0}) - p_{0} p_{0}^{⊤}) Q^{⊤})}^{- 1} Q .$ [3.4] ] are omitted in , because it is identical to the Pearson statistic [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] for d = 2, recall Appendix C. Furthermore, it is also not surprising that the memory-type charts (EWMA, CUSUM, and SR), all being listed in the lower part (labeled “EWMA”) of , outperform their Shewhart counterparts (upper part).

Recall that the CUSUM and SR charts are designed for $p_{1}$ according to the logistic distribution with a small location shift. Thus, it is not surprising that the CUSUM chart, due to its known optimality properties, has the lowest ARL at very small deviations from $p_{0},$ regardless of the specific generation of p, whereas SR has inferior performance. It is remarkable, however, that the EWMA-type skew chart is most sensitive to changes in every other case of Scenario 1 (and also performs very well in Scenario 2). This can be explained by the fact that $p_{0}$ is close to symmetry, and quality deteriorations immediately lead to negative skewness values. Just as clearly, the IOV chart turns out to be the worst choice in Scenario 1, in both the EWMA and Shewhart version. Across all shift types, the chart suffers from the small increase in variation, starting from a symmetric $p_{0}$ with already low variation. The performance improves for Scenario 2, although for large changes, the IOV chart still has the worst performance. Thus, the use of the IOV chart for medium-quality settings is not recommended. For small changes, the Shewhart-type ACD chart stands out in , with out-of-control ARLs being larger than the in-control ARL, an unfavorable behavior of a control method. Also X²-, ULSO, and SOC chart most often perform rather poorly. By contrast, the EWMA-type demerit chart (with super-linear scores $1, 10, 50, 100$ ) shows very good results for Scenario 2, with similar out-of-control ARLs as the EWMA-skew chart with its linear weights. So it seems that the actual choice of weights is not crucial in Scenario 2, except the fact that the Shewhart version of the demerit chart can be adjusted more closely to the target ARL₀. For the perfectly symmetric Scenario 1, the EWMA-skew chart is slightly better than the EWMA demerit one (but the demerit’s Shewhart version again better meets ARL₀), which is plausible as its linear weights arise from the measurement of skewness. Generally, the demerit and skew charts do rather well throughout , i.e., the demerit principle proves to be an effective tool for monitoring ordinal medium-quality observations, which is quite striking considering the simplicity of the method and the more or less arbitrary scores. There seems to be only little effect of the actual weighting scheme, as long as the demerit scores increase with increasing quality deterioration. At this point, it is worth noting that the probabilistic weights $\ln (p_{1, j} / p_{0, j})$ used for CUSUM and SR chart, recall [Equation3.7[3.7] $l R_{t} = \sum_{j = 0}^{d} N_{t, j} \ln (p_{1, j} / p_{0, j}),$ [3.7] ], are also increasing in j, namely nearly linear in Scenario 1 and super-linear in Scenario 2.

4.2. ARL performance in a high-quality setting

In the medium-quality setting of Section 4.1, we ended up with a surprisingly unique recommendation: although not necessarily being the optimal choice in any shift scenario, both demerit-type charts (especially the EWMA-skew chart) perform very well in any case. Now, let us investigate the ARLs in for the high-quality setting (Scenarios 3–4). First note that still the (EWMA) demerit-type charts are among the best charts in both . This time, however, also the IOV chart turns out to be a strong competitor, which is plausible as a high-quality situation corresponds to low ordinal dispersion, being quickly increased by quality deteriorations. By contrast, the X²-, ACD, ULSO, and SOC charts (especially their EWMA versions) often have clearly larger out-of-control ARLs than the corresponding demerit, skew, and IOV charts. The SR chart is generally worse than the CUSUM chart, and the CUSUM’s performance is quite sensitive to the actual out-of-control scenario: being designed based on logit assumptions (which now leads to sub-linearly increasing weights in both scenarios), it has competing performance under logit and probit specifications, but deteriorates under 0-IB specification. Altogether, the EWMA versions of demerit, skew, and IOV chart perform very well. It is not possible to derive a unique recommendation from this group that is superior in any out-of-control scenario. But it gets clear that any of these charts performs reasonably well in each scenario, even if not being the best choice.

To summarize, although some of the control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ] rely on quite sophisticated derivations, quality deteriorations appear to be best detected by using quite basic control statistics: a demerit-type chart (including the skew chart) in combination with EWMA smoothing always shows rather good performance, and for high-quality settings, also the (EWMA-)IOV chart is a good choice for process monitoring. Generally, the use of EWMA smoothing for PMF estimation is recommended: for the out-of-control scenarios in , we did not observe any case where the Shewhart ARL would be lower than the EWMA ARL.

5. An illustrative data example

In this section, we demonstrate the application of the control charts studied in Section 4 by means of a real-data example. We use a data set from manufacturing industry that was presented by Li, Tsung, and Zou (Citation2014), and where the corresponding in-control model was taken as an inspiration for the (high-quality) Scenario 4 in Section 4. Now, let us focus on the Phase-II data for illustration.

The data set consists of 30 samples with a unique sample size of n = 64, where each individual observation expresses the quality of a manufactured electric toothbrush head. Here, the quality is measured in terms of the excess plastic (known as “flash”), which may arise on the brush head when its two components are welded together. The extent of flash is classified into the $d + 1 = 4$ ordinal categories $s_{0} = “ slight ”, s_{1} = “ small ”, s_{2} = “ medium ”,$ and $s_{3} = “ large ” .$ These categories are ordered according to degrading quality, because increasing flash implies a higher risk of injury. (left) shows the sample frequencies $N_{t, 1}, N_{t, 2}, N_{t, 3}$ (corresponding to non-optimal quality) as a bar plot, whereas the absolute frequencies $N_{t, 0},$ referring to the best quality, result from the difference $n - N_{t, 1} - N_{t, 2} - N_{t, 3} .$ On the right-hand side of , the control limits of the charts and their corresponding in-control ARLs are displayed, which are again determined by simulations with 10⁶ replications. This time, the chart designs are based on the exact in-control PMF $p_{0} = (0.8631, 0.0804, 0.0357, 0.0208)$ as given in Li, Tsung, and Zou (Citation2014) (not on the “smoothed” version used for Scenario 4). Like in Section 4, the Shewhart version of the skew chart is the most inflexible in adjusting its ARL₀ to the target value of 370.

Despite its low ARL₀, the skew chart triggers an alarm only at time t = 25, just like the other Shewhart-type charts do, see in Appendix D. In fact, the 25th sample causing the alarm expresses the worst quality among all samples, see (left): the total frequency $N_{t, 1} + N_{t, 2} + N_{t, 3}$ of non-optimal quality as well as the frequency $N_{t, 3}$ of worst quality are maximal for t = 25. However, due to their lack of memory, the Shewhart charts of do not indicate that there seems to be some disturbing trend already before t = 25 (such a trend is also hardly visible from ). This trend gets clear from the memory-type charts in , i.e., if EWMA-smoothing with $λ = 0.1$ is added to the aforementioned charts, or if CUSUM and SR charts are considered (defining $p_{1}$ by a logit model with location shift 0.1). The statistics of all memory charts start to gradually increase (or decrease in the case of the skew chart) around t = 20. But unlike the Shewhart charts, only three memory charts, namely demerit, IOV, and skew, signal a change already at t = 25. The remaining memory charts trigger their first alarm with a delay between one and four samples. This illustrates the well-known dilemma of memory-type charts: they are able to detect a gradual deterioration or small shift in the process, but they may be slow in recognizing a sudden large shift (such as for t = 25). Shewhart charts, by contrast, typically show the opposite performance and are, thus, uniquely successful in the present data example. It is therefore all the more remarkable that the EWMA versions of the demerit, IOV, and skew chart are as quick as their Shewhart versions. This outstanding performance of the demerit and skew charts is consistent with our general findings in Section 4, and for the IOV chart, it follows from the fact that we are concerned with a high-quality situation (i.e., maximal in-control probability for the best quality level).

Figure 1. Left: Frequencies $N_{t, 1}, N_{t, 2}, N_{t, 3}$ of flash types $s_{1}, s_{2}, s_{3}$ on toothbrush heads, where $N_{t, 0} = 64 - N_{t, 1} - N_{t, 2} - N_{t, 3} .$ Right: Control limits of considered control charts, simulated ARL₀ in parentheses.

6. Conclusions and future research

In this paper, we provided an extensive survey of sample-based control charts for monitoring i.i.d. univariate ordinal data. The control charts in the literature were classified into identified groups of control statistics for comparisons and discussions. Some new results and proposals were given to supplement the existing work. A simulation study of the discussed control charts was completed for evaluating and comparing the performance under four diverse in-control scenarios. These four scenarios were inspired by real-world applications from the literature, and they were embedded into common parametric models for ordinal data to get a variety of out-of-control scenarios. Our simulation results indicate that demerit-type charts perform considerably well for monitoring ordinal medium-quality observations, with little influence from the actual weighting scheme. For the high-quality setting scenarios, again the demerit-type charts are among the best performing charts, with the IOV chart being a strong competitor. In conclusion, the demerit principle proves to be very effective in process monitoring applications with ordinal data, and a demerit-type chart (including the skew chart) in combination with the EWMA smoothing is recommended based on the cases studied.

There are several directions for future research on control charts for ordinal processes. First, ordinal control charts for individual observations (rather than samples taken from the process) should be developed. This would not only be relevant for automated production processes with 100% inspection, but also for a second future research direction, the monitoring of ordinal time series data. We conjecture that serial dependence between individual observations (or within samples taken from the process) will impact the performance of control charts being developed for i.i.d. data, such that a revised chart design or novel types of control charts are necessary in such a case. To our knowledge, Li, Xu, and Zhou (Citation2018); Li and Lu (Citation2022) are the only proposals on sequential methods for ordinal time series up to now. More articles are available on the monitoring of multivariate ordinal data, see Wang, Li, and Su (Citation2017); Bai and Li (Citation2021); Hakimi et al. (Citation2021) and the references therein, but an extensive comparative study is yet missing. Finally, as real-world ordinal time series often exhibit missing data, see Weiß (Citation2021b), it would be interesting to discuss how control charts should be adapted to account for missingness.

Acknowledgments

The authors thank the two referees for their useful comments on an earlier draft of this article. The authors are grateful to Professor Jian Li (Xi’an Jiaotong University, China) for providing the flash data discussed in Section 5.

Data availability statement

The data that support the findings of this study are available from the corresponding author, C.H. Weiß, upon reasonable request.

Disclosure statement

The authors report no conflict of interest.

Additional information

Funding

This work was supported by the Interne Forschungsförderung 2021 (IFF 2021) of Helmut Schmidt University.

Notes

1 Another type of qualitative data, which is not considered here, are nominal data, where we do not have such a natural order; corresponding SPC references can be found in Weiß (Citation2021a).

References

Agresti, A. 2010. Analysis of Ordinal Categorical Data. 2nd edition. Hoboken, New Jersey: John Wiley & Sons, Inc.
Google Scholar
Bai, K., and J. Li. 2021. Location-scale monitoring of ordinal categorical processes. Naval Research Logistics (NRL) 68 (7):937–50. doi: 10.1002/nav.21973.
Web of Science ®Google Scholar
Bashkansky, E., and T. Gadrich. 2011. Statistical quality control for ternary ordinal quality data. Applied Stochastic Models in Business and Industry 27 (6):586–99. doi: 10.1002/asmb.868.
Web of Science ®Google Scholar
Bross, I. D. J. 1958. How to use ridit analysis. Biometrics 14 (1):18–38. doi: 10.2307/2527727.
Web of Science ®Google Scholar
Dodge, H. F. 1928. A method of rating manufactured product. Bell System Technical Journal 7 (2):350–68. doi: 10.1002/j.1538-7305.1928.tb01229.x.
Google Scholar
Dodge, H.F., Torrey, M.N. 1956. A check inspection and demerit rating plan. Journal of Quality Technology 9(3), 146–153.
Google Scholar
Duncan, A. J. 1950. A chi-square chart for controlling a set of percentages. Industrial Quality Control 7:11–5.
Google Scholar
Duran, R. I., and S. L. Albin. 2009. Monitoring and accurately interpreting service processes with transactions that are classified in multiple categories. IIE Transactions 42 (2):136–45. doi: 10.1080/07408170903074908.
Web of Science ®Google Scholar
Edwards, H. P., K. Govindaraju, and C. D. Lai. 2007. A control chart procedure for monitoring university student grading. International Journal of Services Technology and Management 8 (4/5):344–54. doi: 10.1504/IJSTM.2007.013924.
Google Scholar
Franceschini, F., and D. Romano. 1999. Control chart for linguistic variables: A method based on the use of linguistic quantifiers. International Journal of Production Research 37 (16):3791–801. doi: 10.1080/002075499190059.
Web of Science ®Google Scholar
Franceschini, F., M. Galetto, and M. Varetto. 2005. Ordered samples control charts for ordinal variables. Quality and Reliability Engineering International 21 (2):177–95. doi: 10.1002/qre.614.
Web of Science ®Google Scholar
Hakimi, A., H. Farughi, A. Amiri, and J. Arkat. 2021. Phase II monitoring of the ordinal multivariate categorical processes. Advances in Industrial Engineering 55 (3):249–67.
Google Scholar
Jones, L. A., W. H. Woodall, and M. D. Conerly. 1999. Exact properties of demerit control charts. Journal of Quality Technology 31 (2):207–16. doi: 10.1080/00224065.1999.11979915.
Web of Science ®Google Scholar
Karlin, S., and H. M. Taylor. 1975. A First Course in Stochastic Processes. 2nd edition. New York: Academic Press.
Google Scholar
Kiesl, H. 2003. Ordinale Streuungsmaße—Theoretische Fundierung und statistische Anwendungen (in German). Lohmar, Cologne: Josef Eul Verlag.
Google Scholar
Klein, I., and M. Doll. 2021. Tests on asymmetry for ordered categorical variables. Journal of Applied Statistics 48 (7):1180–98. doi: 10.1080/02664763.2020.1757045.
PubMed Web of Science ®Google Scholar
Knoth, S. 2006. The art of evaluating monitoring schemes—How to measure the performance of control charts?. In Frontiers in Statistical Quality Control 8, eds. H.-J. Lenz & P.-T. Wilrich, 74–99. Heidelberg: Physica-Verlag.
Google Scholar
Kvålseth, T. O. 2011a. The lambda distribution and its applications to categorical summary measures. Advances and Applications in Statistics 24 (2):83–106.
Google Scholar
Kvålseth, T. O. 2011b. Variation for categorical variables. In International Encyclopedia of Statistical Science, ed. M. Lovric, 1642–5. Berlin: Springer.
Google Scholar
Lehmann, E. L. 1953. The power of rank tests. The Annals of Mathematical Statistics 24 (1):23–43. doi: 10.1214/aoms/1177729080.
Google Scholar
Li, M., and Q. Lu. 2022. Changepoint detection in autocorrelated ordinal categorical time series. Environmetrics 33 (7):e2752. doi: 10.1002/env.2752.
Web of Science ®Google Scholar
Li, J., F. Tsung, and C. Zou. 2014. A simple categorical chart for detecting location shifts with ordinal information. International Journal of Production Research 52 (2):550–62. doi: 10.1080/00207543.2013.838329.
Web of Science ®Google Scholar
Li, J., J. Xu, and Q. Zhou. 2018. Monitoring serially dependent categorical processes with ordinal information. IISE Transactions 50 (7):596–605. doi: 10.1080/24725854.2018.1429695.
Web of Science ®Google Scholar
Marcucci, M. 1985. Monitoring multinomial processes. Journal of Quality Technology 17 (2):86–91. doi: 10.1080/00224065.1985.11978941.
Web of Science ®Google Scholar
Montgomery, D. C. 2009. Introduction to Statistical Quality Control. 6th edition. New York: John Wiley & Sons, Inc.
Google Scholar
Nembhard, D. A., and H. B. Nembhard. 2000. A demerits control chart for autocorrelated data. Quality Engineering 13 (2):179–90. doi: 10.1080/08982110108918640.
Google Scholar
Ottenstreuer, S. 2022. The Shiryaev–Roberts control chart for Markovian count time series. Quality and Reliability Engineering International 38 (3):1207–25. doi: 10.1002/qre.2945.
Web of Science ®Google Scholar
Page, E. 1954. Continuous inspection schemes. Biometrika 41 (1-2):100–15. doi: 10.1093/biomet/41.1-2.100.
Web of Science ®Google Scholar
Perry, M. B. 2020. An EWMA control chart for categorical processes with applications to social network monitoring. Journal of Quality Technology 52 (2):182–97. doi: 10.1080/00224065.2019.1571343.
Web of Science ®Google Scholar
Roberts, S. W. 1959. Control chart tests based on geometric moving averages. Technometrics 1 (3):239–50. doi: 10.1080/00401706.1959.10489860.
Google Scholar
Roberts, S. W. 1966. A comparison of some control chart procedures. Technometrics 8 (3):411–30. doi: 10.1080/00401706.1966.10490374.
Web of Science ®Google Scholar
Ryan, A. G., L. J. Wells, and W. H. Woodall. 2011. Methods for monitoring multiple proportions when inspecting continuously. Journal of Quality Technology 43 (3):237–48. doi: 10.1080/00224065.2011.11917860.
Web of Science ®Google Scholar
Samimi, Y., A. Aghaie, and M. J. Tarokh. 2010. Analysis of ordered categorical data to develop control charts for monitoring customer loyalty. Applied Stochastic Models in Business and Industry 26 (6):668–88. doi: 10.1002/asmb.808.
Web of Science ®Google Scholar
Shewhart, W. A. 1931. Economic Control of Quality of Manufactured Product. New York: D. Van Nostrand Company, Inc.
Google Scholar
Shiryaev, A. N. 1961. The problem of the most rapid detection of a disturbance in a stationary process. Soviet Mathematics: Doklady 2:795–9.
Google Scholar
Steiner, S. H., P. L. Geyer, and G. O. Wesolowsky. 1994. Control charts based on grouped observations. International Journal of Production Research 32 (1):75–91. doi: 10.1080/00207549408956917.
Web of Science ®Google Scholar
Steiner, S. H., P. L. Geyer, and G. O. Wesolowsky. 1996a. Shewhart control charts to detect mean and standard deviation shifts based on grouped data. Quality and Reliability Engineering International 12 (5):345–53. doi: 10.1002/(SICI)1099-1638(199609)12:5<345::AID-QRE11>3.0.CO;2-M.
Web of Science ®Google Scholar
Steiner, S. H., P. L. Geyer, and G. O. Wesolowsky. 1996b. Grouped data-sequential probability ratio tests and cumulative sum control charts. Technometrics 38 (3):230–7. doi: 10.1080/00401706.1996.10484502.
Web of Science ®Google Scholar
Tucker, G. R., W. H. Woodall, and K.-L. Tsui. 2002. A control chart method for ordinal data. American Journal of Mathematical and Management Sciences 22 (1-2):31–48. doi: 10.1080/01966324.2002.10737574.
Google Scholar
Wang, J., J. Li, and Q. Su. 2017. Multivariate ordinal categorical process control based on log-linear modeling. Journal of Quality Technology 49 (2):108–22. doi: 10.1080/00224065.2017.11917983.
Web of Science ®Google Scholar
Wang, J., Q. Su, and M. Xie. 2018. A univariate procedure for monitoring location and dispersion with ordered categorical data. Communications in Statistics - Simulation and Computation 47 (1):115–28. doi: 10.1080/03610918.2017.1280159.
Web of Science ®Google Scholar
Wardell, D. G., and M. R. Candia. 1996. Statistical process monitoring of customer satisfaction survey data. Quality Management Journal 3 (4):36–50. doi: 10.1080/10686967.1996.11918757.
Google Scholar
Weiß, C. H. 2020. Distance-based analysis of ordinal data and ordinal time series. Journal of the American Statistical Association 115 (531):1189–200. doi: 10.1080/01621459.2019.1604370.
Web of Science ®Google Scholar
Weiß, C. H. 2021a. On approaches for monitoring categorical event series. In Control Charts and Machine Learning for Anomaly Detection in Manufacturing, Springer Series in Reliability Engineering, ed. K.P. Tran, 105–29. Cham: Springer Nature Switzerland AG.
Google Scholar
Weiß, C. H. 2021b. Analyzing categorical time series in the presence of missing observations. Statistics in Medicine 40 (21):4675–90. doi: 10.1002/sim.9089.
PubMed Web of Science ®Google Scholar

Appendix A.

Tabulated ARL results of Section 4

Table A.1. Scenario 1: ARL performance of control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ].

Display Table

Table A.2. Scenario 2: ARL performance of control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ].

Display Table

Table A.3. Scenario 3: ARL performance of control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ].

Display Table

Table A.4. Scenario 4: ARL performance of control charts [Equation3.1[3.1] $X_{t}^{2} = \sum_{j = 0}^{d} \frac{{(N_{t, j} - n p_{0, j})}^{2}}{n p_{0, j}} = n^{- 1} {(N_{t} - n p_{0})}^{⊤} diag {(p_{0})}^{- 1} (N_{t} - n p_{0}) .$ [3.1] ] to [Equation3.11[3.11] ${skew}_{t} = (\frac{2}{d} \sum_{j = 0}^{d - 1} {\hat{f}}_{t, j}) - 1,$ [3.11] ].

Display Table

Table A.5. In-control and out-of-control PMFs for Scenarios 1–4.

Display Table

Appendix B.

On a CDF-based Pearson-type GoF-Statistic

Let X be an ordinal random variable with PMF p such that $p_{j} > 0$ for all $j = 0, \dots, d$ (otherwise, the range $S$ could be reduced by removing the categories with probability zero); as before, let f be the corresponding CDF. If the sample CDF $\hat{f}$ is computed from n i.i.d. replicates of X, then its asymptotic distribution is known from (Kiesl Citation2003, p. 99): we have $\sqrt{n} (\hat{f} - f) \overset{d}{\to} N (0, Σ),$ where the covariance matrix $Σ = {(σ_{i, j})}_{i, j = 0, \dots, d - 1}$ has the entries $σ_{i, j} = f_{\min {i, j}} - f_{i} f_{j} .$ Thus, a Pearson-type GoF-statistic based on the sample CDF, as well as its asymptotic distribution, are immediately implied as (B.1) ${\tilde{X}}^{2} = n {(\hat{f} - f)}^{⊤} Σ^{- 1} (\hat{f} - f) \overset{d}{\to} χ_{d}^{2},$ (B.1) provided that $Σ^{- 1}$ exists. In what follows, we show that this CDF-based statistic ${\tilde{X}}^{2}$ is indeed identical to the ordinary Pearson statistic $X^{2}$ from (3.1).

Lemma B.1.

If $p_{j} > 0$ for all $j = 0, \dots, d$ , then the inverse matrix $Σ^{- 1} = {(s_{i j})}_{i, j = 0, \dots, d - 1}$ exists and is symmetric, i.e., s_ij = s_ji. Its entries s_ij for $j \geq i$ are given by $s_{i j} = {\begin{matrix} \frac{1}{p_{i}} + \frac{1}{p_{i + 1}} & if j = i \in {0, \dots, d - 1}, \\ - \frac{1}{p_{i + 1}} & if j = i + 1 \in {1, \dots, d - 1}, \\ 0 & otherwise . \end{matrix}$

Here, it is understood that $Σ^{- 1} = \frac{1}{p_{0}} + \frac{1}{p_{1}} \in (1; \infty)$ if d = 1.

Note that $Σ^{- 1}$ is a (symmetric) tridiagonal matrix according to Lemma B.1.

Proof.

We prove Lemma B.1 by showing that the given expression for $Σ^{- 1}$ satisfies $Σ Σ^{- 1} = I,$ where I denotes the d × d-identity matrix. If d = 1, then $σ_{00} s_{00} = f_{0} (1 - f_{0}) (\frac{1}{p_{0}} + \frac{1}{p_{1}}) = p_{0} p_{1} \frac{p_{1} + p_{0}}{p_{0} p_{1}} = 1.$

If d = 2, then $\begin{matrix} Σ Σ^{- 1} = (\begin{matrix} f_{0} (1 - f_{0}) & f_{0} (1 - f_{1}) \\ f_{0} (1 - f_{1}) & f_{1} (1 - f_{1}) \end{matrix}) (\begin{matrix} \frac{1}{p_{0}} + \frac{1}{p_{1}} & - \frac{1}{p_{1}} \\ - \frac{1}{p_{1}} & \frac{1}{p_{1}} + \frac{1}{p_{2}} \end{matrix}) \\ = (\begin{matrix} \frac{f_{0} (1 - f_{0})}{p_{0}} + \frac{f_{0} (f_{1} - f_{0})}{p_{1}} & \frac{f_{0} (1 - f_{1})}{p_{2}} + \frac{f_{0} (f_{0} - f_{1})}{p_{1}} \\ \frac{f_{0} (1 - f_{1})}{p_{0}} + \frac{(f_{0} - f_{1}) (1 - f_{1})}{p_{1}} & \frac{f_{1} (1 - f_{1})}{p_{2}} + \frac{(f_{1} - f_{0}) (1 - f_{1})}{p_{1}} \end{matrix}) \\ = (\begin{matrix} (1 - f_{0}) + f_{0} & f_{0} - f_{0} \\ (1 - f_{1}) - (1 - f_{1}) & f_{1} + (1 - f_{1}) \end{matrix}) = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) . \end{matrix}$

For $d \geq 3,$ the entries of $Σ Σ^{- 1},$ namely $\sum_{k = 0}^{d - 1} σ_{i k} s_{k j}$ for $i, j \in {0, \dots, d - 1},$ consist of at most three summands because of the tridiagonal structure of $Σ^{- 1} .$ This happens if $1 \leq j \leq d - 2,$ whereas $j \in {0, d - 1}$ leads to two summands of the same structure as in the case d = 2. Thus, let us focus on $1 \leq j \leq d - 2 :$ $\begin{matrix} \sum_{k = 0}^{d - 1} σ_{i k} s_{k j} = σ_{i, j - 1} s_{j - 1, j} + σ_{i j} s_{j j} + σ_{i, j + 1} s_{j + 1, j} \\ = σ_{i j} (\frac{1}{p_{j}} + \frac{1}{p_{j + 1}}) - σ_{i, j - 1} \frac{1}{p_{j}} - σ_{i, j + 1} \frac{1}{p_{j + 1}} \\ = \frac{1}{p_{j}} (σ_{i j} - σ_{i, j - 1}) + \frac{1}{p_{j + 1}} (σ_{i j} - σ_{i, j + 1}) \\ = \frac{1}{p_{j}} (f_{\min {i, j}} - f_{\min {i, j - 1}} - f_{i} (f_{j} - f_{j - 1})) \\ + \frac{1}{p_{j + 1}} (f_{\min {i, j}} - f_{\min {i, j + 1}} + f_{i} (f_{j + 1} - f_{j})) \\ = \frac{1}{p_{j}} (f_{\min {i, j}} - f_{\min {i, j - 1}}) + \frac{1}{p_{j + 1}} (f_{\min {i, j}} - f_{\min {i, j + 1}}) . \end{matrix}$

It remains to distinguish between diagonal (i = j) and non-diagonal ( $i = j$ ) elements: $\sum_{k = 0}^{d - 1} σ_{i k} s_{k j} = {\begin{matrix} \frac{1}{p_{j}} (f_{i} - f_{i}) + \frac{1}{p_{j + 1}} (f_{i} - f_{i}) = 0 & if i < j, \\ \frac{1}{p_{j}} (f_{j} - f_{j - 1}) + \frac{1}{p_{j + 1}} (f_{i} - f_{i}) = 1 & if i = j, \\ \frac{1}{p_{j}} (f_{j} - f_{j - 1}) + \frac{1}{p_{j + 1}} (f_{j} - f_{j + 1}) = 0 & if i > j . \end{matrix}$

This completes the proof of Lemma B.1. □

Using Lemma B.1, we can calculate the statistic ${\tilde{X}}^{2}$ according to (B.1).

Theorem B.2.

If $p_{j} > 0$ for all $j = 0, \dots, d$ , then the statistic ${\tilde{X}}^{2}$ according to (B.1) equals ${\tilde{X}}^{2} = n \sum_{j = 0}^{d} \frac{{({\hat{p}}_{j} - p_{j})}^{2}}{p_{j}} = \sum_{j = 0}^{d} \frac{{(N_{j} - n p_{j})}^{2}}{n p_{j}} = X^{2} .$

Proof.

Using the notations of Lemma B.1, we compute $\begin{matrix} {\tilde{X}}^{2} = n \sum_{i, j = 0}^{d - 1} ({\hat{f}}_{i} - f_{i}) s_{i j} ({\hat{f}}_{j} - f_{j}) \\ = n \sum_{i = 0}^{d - 1} ({\hat{f}}_{i} - f_{i})^{2} s_{i i} + 2 n \sum_{i = 0}^{d - 2} ({\hat{f}}_{i} - f_{i}) ({\hat{f}}_{i + 1} - f_{i + 1}) s_{i, i + 1} \\ = n {({\hat{f}}_{0} - f_{0})}^{2} \frac{1}{p_{0}} + n \sum_{i = 1}^{d - 1} ({\hat{f}}_{i} - f_{i})^{2} \frac{1}{p_{i}} + n {({\hat{f}}_{d - 1} - f_{d - 1})}^{2} \frac{1}{p_{d}} \\ + n \sum_{i = 0}^{d - 2} ({\hat{f}}_{i} - f_{i})^{2} \frac{1}{p_{i + 1}} - 2 n \sum_{i = 0}^{d - 2} ({\hat{f}}_{i} - f_{i}) ({\hat{f}}_{i + 1} - f_{i + 1}) \frac{1}{p_{i + 1}} . \end{matrix}$

Using the index transformation $\sum_{i = 1}^{d - 1} ({\hat{f}}_{i} - f_{i})^{2} \frac{1}{p_{i}} = \sum_{i = 0}^{d - 2} ({\hat{f}}_{i + 1} - f_{i + 1})^{2} \frac{1}{p_{i + 1}},$ and applying the binomial formula, it follows that $\begin{matrix} {\tilde{X}}^{2} = n {({\hat{f}}_{0} - f_{0})}^{2} \frac{1}{p_{0}} + n {({\hat{f}}_{d - 1} - f_{d - 1})}^{2} \frac{1}{p_{d}} \\ + n \sum_{i = 0}^{d - 2} (({\hat{f}}_{i + 1} - f_{i + 1}) - ({\hat{f}}_{i} - f_{i}))^{2} \frac{1}{p_{i + 1}} \\ = n {({\hat{f}}_{0} - f_{0})}^{2} \frac{1}{p_{0}} + n {({\hat{p}}_{d} - p_{d})}^{2} \frac{1}{p_{d}} + n \sum_{i = 0}^{d - 2} ({\hat{p}}_{i + 1} - p_{i + 1})^{2} \frac{1}{p_{i + 1}} . \end{matrix}$

This completes the proof of Theorem B.2. □

Appendix C.

Some notes on the ULSO chart

In what follows, let us abbreviate $P = diag (p_{0}) - p_{0} p_{0}^{⊤} .$

Proposition C.1.

If d = 1, then the matrix $Q P Q^{⊤}$ in (3.4) is not invertible.

Proof.

If d = 1, then $f_{0, 1} = 1$ and $f_{0, 0} = p_{0, 0} = 1 - p_{0, 1};$ recall that generally, we have $η (0) = η (1) = 0 .$ Hence, $Q = (\begin{matrix} p_{0, 0} - 1 & p_{0, 0} \\ \frac{η (f_{0, 0})}{p_{0, 0}} & - \frac{η (f_{0, 0})}{1 - p_{0, 0}} \end{matrix}), P = p_{0, 0} (1 - p_{0, 0}) (\begin{matrix} 1 & - 1 \\ - 1 & 1 \end{matrix}) .$

Thus, it follows that $Q P Q^{⊤} = (\begin{matrix} p_{0, 0} (1 - p_{0, 0}) & - η (f_{0, 0}) \\ - η (f_{0, 0}) & \frac{η {(f_{0, 0})}^{2}}{p_{0, 0} (1 - p_{0, 0})} \end{matrix}) .$

But the latter matrix has determinant 0 and is, thus, not invertible. □

Proposition C.2.

If d = 2, then the ULSO statistic (3.4) agrees with the Pearson statistic (3.1).

Proof.

If d = 2, then $f_{0, 2} = 1, f_{0, 1} = 1 - p_{0, 2}, f_{0, 0} = p_{0, 0},$ and $p_{0, 1} = 1 - p_{0, 0} - p_{0, 2} .$ Thus, $Q = (\begin{matrix} p_{0, 0} - 1 & p_{0, 0} - p_{0, 2} & 1 - p_{0, 2} \\ \frac{η (f_{0, 0})}{p_{0, 0}} & \frac{η (f_{0, 1}) - η (f_{0, 0})}{1 - p_{0, 0} - p_{0, 2}} & - \frac{η (f_{0, 1})}{p_{0, 2}} \end{matrix}),$ and $P = (\begin{matrix} p_{0, 0} (1 - p_{0, 0}) & - p_{0, 0} (1 - p_{0, 0} - p_{0, 2}) & - p_{0, 0} p_{0, 2} \\ - p_{0, 0} (1 - p_{0, 0} - p_{0, 2}) & (p_{0, 0} + p_{0, 2}) (1 - p_{0, 0} - p_{0, 2}) & - p_{0, 2} (1 - p_{0, 0} - p_{0, 2}) \\ - p_{0, 0} p_{0, 2} & - p_{0, 2} (1 - p_{0, 0} - p_{0, 2}) & p_{0, 2} (1 - p_{0, 2}) \end{matrix}) .$

This time, $Q P Q^{⊤}$ equals $(\begin{matrix} (1 - p_{0, 0}) (1 - p_{0, 2}) (p_{0, 0} + p_{0, 2}) & - η (f_{0, 1}) (1 - p_{0, 0}) - η (f_{0, 0}) (1 - p_{0, 2}) \\ - η (f_{0, 1}) (1 - p_{0, 0}) - η (f_{0, 0}) (1 - p_{0, 2}) & \frac{{(η (f_{0, 1}) p_{0, 0} + η (f_{0, 0}) p_{0, 2})}^{2} - η {(f_{0, 1})}^{2} p_{0, 0} - η {(f_{0, 0})}^{2} p_{0, 2}}{p_{0, 0} p_{0, 2} (p_{0, 0} + p_{0, 2} - 1)} \end{matrix})$ and has the non-zero determinant $\det (Q P Q^{⊤}) = \frac{{(η (f_{0, 0}) p_{0, 2} (1 - p_{0, 2}) - η (f_{0, 1}) p_{0, 0} (1 - p_{0, 0}))}^{2}}{p_{0, 0} p_{0, 2} (1 - p_{0, 0} - p_{0, 2})} .$

Defining E as the matrix of ones, after tedious calculations, we can express $V = Q^{⊤} {(Q P Q^{⊤})}^{- 1} Q$ in (3.4) as $V = (\begin{matrix} \frac{1}{p_{0, 0}} - 1 & - 1 & - 1 \\ - 1 & \frac{1}{1 - p_{0, 0} - p_{0, 2}} - 1 & - 1 \\ - 1 & - 1 & \frac{1}{p_{0, 2}} - 1 \end{matrix}) = diag {(p_{0})}^{- 1} - E .$

As $\sum_{j = 0}^{d} N_{t, j} = n = \sum_{j = 0}^{d} n p_{0, j},$ it follows that $(N_{t} - n p_{0}) ^{⊤} E (N_{t} - n p_{0}) = 0,$ such that $n^{- 1} (N_{t} - n p_{0}) ^{⊤} V (N_{t} - n p_{0})$ is equal to statistic (3.1). □

Appendix D.

Phase-II control charts of section 5

Figure D.1. Shewhart control charts applied to flash data, where alarm times highlighted by dashed line.

Figure D.2. Memory-type control charts (EWMA version of charts from , or CUSUM and SR) for flash data, where alarm times highlighted by dashed line.

A review and comparison of control charts for ordinal samples

Abstract

1. Introduction