Full article: Multiple comparisons of mean vectors with large dimension under general conditions

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Multiple comparisons for two or more mean vectors are considered when the dimension of the vectors may exceed the sample size, the design may be unbalanced, populations need not be normal, and the true covariance matrices may be unequal. Pairwise comparisons, including comparisons with a control, and their linear combinations are considered. Under fairly general conditions, the asymptotic multivariate distribution of the vector of test statistics is derived whose quantiles can be used in multiple testing. Simulations are used to show the accuracy of the tests. Real data applications are also demonstrated.

KEYWORDS:

1. Introduction

The objective of this work is to present multiple comparisons for mean vectors in a multi-sample problem where the populations need not necessarily be normal, sample sizes and covariance matrices may be unequal, and the dimension of the vectors may exceed the sample sizes. Precisely, let $X_{i k} = (X_{i k 1}, \dots, X_{i k p})^{'} \sim F_{i}$ , $k = 1, \dots, n_{i}$ , be iid random vectors with $E (X_{i k}) = μ_{i} \in R^{p}$ , $Cov (X_{i k}) = Σ_{i} \in R_{> 0}^{p \times p}$ , $i = 1, \dots, g \geq 2$ , where $R_{> 0}^{p \times p}$ denotes the space of real, symmetric, positive-definite, $p \times p$ matrices and $F_{i}$ denotes the distribution function for ith population.

We are interested to develop multiple comparison procedures (MCP) or, correspondingly, simultaneous confidence intervals (SCI), for difference of mean vectors, by relaxing the usual linear model assumptions, e.g. normality and homoscedasticity. Thus, $F_{i}$ may be non-normal and $Σ_{i}$ may be unequal which, along with $n_{i}$ also allowed to be unequal (unbalanced design), implies a complete multi-sample Behrens-Fisher problem. Further, we allow p to be large, even $p ≫ n_{i}$ . These comparisons are of interest as a first post hoc investigation after a global MANOVA hypothesis of equality of all mean vectors is rejected; see Seber [Citation1] or Johnson and Wichern [Citation2].

The multivariate theory offers a number of solutions to this problem for the classical case, $p < n_{i}$ , particularly assuming normality and homoscedasticity. The global MANOVA hypotheses are mostly tested by the likelihood-ratio criterion such as Wikls' Λ and its rejection follows by finding out the mean vectors responsible for the global rejection. It commonly begins with a general strategy for a set of comparisons defined as linear combination, $a^{'} δ_{i j}$ , $a \in R^{p}$ , where $δ_{i j} = μ_{i} - μ_{j}$ , $i \neq j$ . A case of particular interest is of pairwise differences $δ_{i j}$ themselves which includes all possible differences as well as special cases such as comparisons with a control.

The classical case of such comparisons has been extensively investigated; see e.g. Krishnaiah [Citation3,Citation4], Wijsman [Citation5], Kropf [Citation6], Kropf and Läuter [Citation7], Westfall et al. [Citation8], Läuter et al. [Citation9], Conneely and Boehnke [Citation10], Westfall and Troendle [Citation11], Bretz et al. [Citation12], Dickhaus [Citation13], Goeman and Finos [Citation14], Goeman and Solari [Citation15], Guilbaud [Citation16,Citation17], where Dickhaus [Citation18] is a modern, comprehensive book length reference with exhaustive bibliography.

The classical methods for MCP or SCI do not work when $p ≫ n_{i}$ and need to be modified. The recent wave of high-dimensional data has motivated a thorough inquiry into new avenues for simultaneous inference which, already complicated enough as compared to global testing, is further exacerbated by the largeness of dimensionality. Of particular concern are the fields like genetics, microarray, agriculture, fMRI, psychology where analysing umpteen amounts of data has become a norm rather than exception; see e.g. Nichols and Hayasaka [Citation19] and Dickhaus [Citation18].

The multiple comparisons introduced in this paper are applicable for such high-dimensional data which, additionally, do not depend on usual assumptions such as normality and homoscedasticity. In fact, concerning normality, the tests can be used for any distribution with finite fourth moment across p-dimensional vector. A distinguishing feature of the proposed tests is that we exclusively derive asymptotic joint distribution of the entire vector of preliminary tests whose quantiles can be directly used to test any number of comparisons of g man vectors. Under a few, mild assumptions, the asymptotic covariance matrix turns out be of very simple form and particularly sparse, not only making the derivation of the limit distribution convenient but also enhancing the applicability of the proposed tests under fairly general conditions.

We begin in the next section with a concise notational set up, to be used throughout the paper, followed by the main tests and their properties. A simulation based evaluation is given in Section 3 and applications are given in Section 4. Section 5 summarizes the main points.

2. Test statistics and their properties

2.1. Notations and preliminary set up

Let the vectors $X_{i k} \in R^{p}$ , $k \in {1, \dots, n_{i}}$ , $i \in {1, \dots, g}$ , as defined above, be generated by a probability space ( $X$ , $A$ , $P_{θ}$ ) where the probability measure $P_{θ}$ is indexed with parameter $θ \in Θ$ and Θ is the parameter space, not necessarily finite. Then $X_{i} = (X_{1}^{'} \dots X_{n_{i}}^{'}) \in R^{n_{i} \times p}$ is the data matrix for ith sample and $X = (X_{1}^{'}, \dots, X_{n_{g}}^{'})^{'} \in R^{n \times p}$ , $n = \sum_{i = 1}^{g} n_{i}$ , with parameter space ${Γ, Ξ}$ , where $Γ = E (X) = (1_{n_{1}}^{'} \otimes μ_{1} | \dots | 1_{n_{g}}^{'} \otimes μ_{g})^{'}$ , $Ξ = Cov (X) = \oplus_{i = 1}^{g} (I_{n_{i}} \otimes Σ_{i})$ with $Cov (X_{i}) = I_{n_{i}} \otimes Σ_{i}$ using $Cov (X_{i k}) = Σ_{i}$ $\forall i$ , where ⊕ and ⊗ are the Kronecker sum and Kronecker product, respectively. Let ${\bar{X}}_{i} = \sum_{k = 1}^{n} X_{i k} / n_{i}$ and ${\hat{Σ}}_{i} = \sum_{k = 1}^{n} {\tilde{X}}_{i k} {\tilde{X}}_{j k}^{'} / (n_{i} - 1)$ be the usual unbiased estimators of $μ_{i}$ and $Σ_{i}$ with ${\tilde{X}}_{i k} = X_{i k} - {\bar{X}}_{i}$ , or, using the ith data matrix, $i = 1, \dots, g$ , (1) ${\bar{X}}_{i} = \frac{1}{n_{i}} X_{i}^{'} 1_{n_{i}}, {\hat{Σ}}_{i} = \frac{1}{n_{i} - 1} X_{i}^{'} C_{n_{i}} X_{i},$ (1) where $C_{n_{i}} = I_{n_{i}} - J_{n_{i}} / n_{i}$ is centering matrix, $I$ is identity matrix, $J = 11^{'}$ and $1$ a vector of 1 s.

Let $H = {H_{I} : I \in I}$ be a family of hypotheses, finite or infinite, with card ${I} = G$ , corresponding to families of distributions ${P_{θ} : θ \in Θ_{I}}$ with parameter space $Θ_{I}$ bifurcated into $Θ_{0, I}$ and $Θ_{1, I} = Θ_{I} ∖ Θ_{0, I}$ , according to $H_{I}$ being null ( $(H_{0, I}$ ) or alternative ( $H_{1, I}$ ) hypothesis, where $Θ_{0} \cup Θ_{1} = Θ$ , $Θ_{0} \cap Θ_{1} = \emptyset$ . A (non-randomized) test for each $H_{I}$ is carried out using a test statistic $T_{I}$ with its space $T_{I}$ , which similarly bifurcates the sample space into $X_{0, I}$ and $X_{1, I}$ , with a binary decision φ: $T_{I} \to {0, 1}$ where φ = 1 (0) when $H_{I}$ is rejected (accepted).

As usual, the power function $β (θ_{I} | Θ_{I})$ = α (size) if $Θ_{I} = Θ_{0, I}$ and $1 - β$ (power) if $Θ_{I} = Θ_{1, I}$ . For a sample $X \in X$ , $p_{I} = sup_{θ \in Θ_{0, I}} P (T_{o} \geq c_{α})$ is the p-value of $T_{I}$ with observed value $T_{o}$ and critical value $c_{α}$ . The problem of MCP pertains to simultaneously testing a set of G hypotheses $H_{0, I} : θ \in Θ_{0, I} vs . H_{1, I} : θ \in Θ_{1, I}, I \in I, card {I} = G .$ For pairwise comparisons of $μ_{i}$ , we have $θ = δ_{i j} = μ_{i} - μ_{j}$ , $i \neq j$ , with $G = (\binom{g}{2}) = g (g - 1) / 2$ , and for comparisons with a control, $θ = δ_{1 j} = μ_{1} - μ_{j}$ with G=g−1, $j = 2, \dots, g$ , assuming, without loss of generality, sample 1 as control. In either case, we essentially deal with a vector of test statistics $T \in R^{G}$ and corresponding vector of observed p-values, $p \in (0, 1)^{\otimes G}$ .

With several tests being carried out simultaneously, the most serious issue in multiple testing is to effectively control α, i.e. reduce the chance of false positives (FP). Let $I_{0} \subset I$ be the subset corresponding to the true null hypotheses, $H_{0} = {H_{0, I} : I \in I_{0}}$ , with card ${I_{0}} = G_{0} \leq G$ , and $R \subset I$ be the subset for which $H_{0, I}$ is rejected. Then $f_{m}$ = card ${R \cap I_{0}}$ refers to the set of FPs (rejected true hypotheses or type I errors), so that $r_{m}$ = card ${R ∖ I_{0}}$ is the index of true positives or TPs (rightly rejected null hypotheses or power of test). We, therefore, are interested to keep $f_{m}$ ( $r_{m})$ as small (large) as possible. Several error control procedures can be adopted, subject to research questions. For details, see e.g. Hochberg and Tamhane [Citation20], Bretz et al. [Citation12], Dickhaus [Citation18], Goeman and Solari [Citation15], Hemerik and Goeman [Citation21].

In practice, family-wise error control (in the strong sense), FWEs, is the most desired error control and will be our main target in the sequel. It is the proportion of all FPs, i.e. $P (f_{m} > 0)$ . The simplest way to control FWEs is through Bonferroni inequality which ensures $P (f_{m} > 0) \leq G_{0} α / G \leq α$ , where equality holds in most cases since $G_{0} = G$ , i.e. each of G tests has $α / G$ chance for FP. It offers an efficient control for small to moderate G but is obviously conservative (or has less power) as G becomes large. An alternative option is the false discovery rate, FDR = $E [{f_{m} / (f_{m} + r_{m})} 1_{{f_{m} + r_{m} \geq 1}}]$ with $1_{{\cdot}}$ as indicator function; see e.g. Dickhaus [Citation18, Ch. 1].

Among other notations used in the sequel, a vector $a \in R^{p}$ is a column vector with norm $∥ a ∥^{2}$ = $⟨ a, a ⟩$ and a matrix norm is Frobenius $∥ A ∥^{2} = tr (A^{2})$ . The test statistics are formulated as linear combinations of second-order U-statistics of symmetric (product) kernels, $h (\cdot) : R^{p} \mapsto R$ , defined as bilinear forms of independent vectors. With $h (\cdot)$ a measurable, possibly degenerate, square-integrable, $\int h^{2} d P < \infty$ , function, the set up conforms to a Hilbert space $L_{2} (H)$ equipped with inner product $⟨ \cdot, \cdot ⟩ : R^{p} \to R$ , so that $h (\cdot)$ , with an orthonormal decomposition, is a Hilbert-Schmidt kernel; see van der Vaart [Citation22] or Lee [Citation23]. This helps us study the properties of test statistics under flexible conditions, the subject of next section.

2.2. Test statistics and their properties

For the data set up in Section 2.1, let $T_{I} = T_{i j}$ be the test statistic for a (preliminary) hypothesis $H_{0, I} = H_{0 i j} : δ_{i j} = 0$ with $δ_{G} \in R^{G}$ the vector of all hypotheses to be simultaneously tested. Thus, for all pairwise differences, $δ_{i j} : μ_{i} - μ_{j}$ , i<j, with $G = g (g - 1) / 2$ , $δ_{G} = (δ_{11}, \dots, δ_{g - 1, g})^{'}$ where (2) $T_{G} = (T_{1}, \dots, T_{g - 1})^{'} = (T_{12}, \dots, T_{1 g}, T_{23}, \dots, T_{2 g}, \dots, T_{g - 2, g}, T_{g - 1, g})^{'},$ (2) is the vector of test statistics, a set of simultaneous tests for $H_{0} : δ_{G} = 0$ , with $T_{i} = (T_{i, i + 1}, \dots, T_{i g})^{'}$ , $i = 1, \dots, g - 1$ . Our strategy begins by defining $T_{i j}$ , a test statistic for $H_{0 i j}$ , valid for $p ≫ n_{i}$ where $F_{i}$ may be non-normal and $Σ_{i}$ may be unequal. The limit of $T_{i j}$ is derived under flexible conditions since the multiple tests heavily rest on the properties of $T_{i j}$ . Using these properties, we derive the joint distribution of $T_{G}$ to be used for MCP for any G. The most salient feature is that the effect of high-dimensionality, $p \to \infty$ , is taken care of in $T_{i j}$ , so that the limit of $T_{G}$ is mainly influenced by g or G. Now, to define $T_{i j}$ , consider $Q_{i j 0} = U_{i} + U_{j} - 2 U_{i j}$ where (3) $U_{i} = \frac{1}{n_{i} (n_{i} - 1)} \underset{k \neq r}{\sum_{k = 1}^{n_{i}} \sum_{r = 1}^{n_{i}}} h (X_{i k}, X_{i r}), U_{i j} = \frac{1}{n_{i} n_{j}} \sum_{k = 1}^{n_{i}} \sum_{l = 1}^{n_{j}} h (X_{i k}, X_{j l}),$ (3) are one- and two-sample U-statistics, respectively, with symmetric kernels $h (X_{i k}, X_{i r}) = X_{i k}^{'} X_{i r} / p$ , $h (X_{i k}, X_{j l}) = X_{i k}^{'} X_{j l} / p$ , $k, r = 1, \dots, n_{i}$ , $k \neq r$ , $l = 1, \dots, n_{j}$ , $i, j = 1, \dots, g$ , $i \neq j$ , $n_{i j} = n_{i} + n_{j}$ . Now $E (Q_{i j 0}) = ∥ δ_{i j} ∥^{2} = 0$ under $H_{0 i j}$ , $δ_{i j} = μ_{i} - μ_{j}$ , so that $Q_{i j 0}$ can be used to test $H_{0 i j}$ . For scaling and appropriate limit, also consider $Q_{i j 1} = Q_{i 1} + Q_{j 1}$ , $Q_{i 1} = (E_{i} - U_{i}) / n_{i}$ , $E_{i} = \sum_{k = 1}^{n_{i}} X_{i k}^{'} X_{i k} / n_{i}$ . Note that, $Q_{i 1} = tr ({\hat{Σ}}_{i}) / n_{i}$ ⇒ $Q_{i j 1} = tr ({\hat{Σ}}_{i j 0})$ , ${\hat{Σ}}_{i j 0} = {\hat{Σ}}_{i} / n_{i} + {\hat{Σ}}_{j} / n_{j}$ so that $E (Q_{i j 1}) = tr (Σ_{i j 0})$ , which is same under $H_{0 i j}$ and $H_{1 i j}$ , where $Σ_{i j 0} = Σ_{i} / n_{i} + Σ / n_{j}$ . Thus, writing $Q_{i j} = Q_{i j 1} + Q_{i j 0}$ , it follows that [see also Citation24] $E (Q_{i j}) = ∥ δ_{i j} ∥^{2} + tr (Σ_{i j 0}) = tr (Σ_{i j 0}) under H_{0 i j} .$ We thus define the two-sample test statistic for $H_{0 i j}$ as (4) $T_{i j} = 1 + \frac{n_{i j} Q_{i j 0}}{[n_{i j} Q_{i j 1} / p]} .$ (4) $T_{i j}$ is location-invariant so that we can assume $μ_{i} = 0$ $\forall i$ without loss of generality. $T_{i j}$ is defined in Ahmad [Citation25] as a modification of the Hotelling's two-sample $T^{2}$ statistic to test $H_{0 i j}$ for high-dimensional data under non-normality and heteroscedasticity. Recall $T^{2}$ = $(n_{i} n_{j} / n_{i j}) {\hat{δ}}_{i j}^{'} {\hat{Σ}}_{i j}^{- 1} {\hat{δ}}_{i j}$ where ${\hat{δ}}_{i j} = {\bar{X}}_{i} - {\bar{X}}_{j}$ and $\hat{Σ} = [(n_{i} - 1) {\hat{Σ}}_{i} + (n_{j} - 1) {\hat{Σ}}_{j}] / (n_{i} + n_{j} - 2)$ is pooled estimator of $Σ_{i} = Σ_{j} = Σ$ [Citation1, see e.g.]. The modification pertains to removing ${\hat{Σ}}^{- 1}$ , which does not exist when $p > n_{i}$ , and writing $∥ {\hat{δ}}_{i j} ∥^{2} = Q_{i j 1} + Q_{i j 0} = Q_{i j}$ since $∥ {\bar{X}}_{i} ∥^{2} = \sum_{k, r = 1}^{n_{i}} X_{i k}^{'} X_{i r} / n_{i}^{2} = (E_{i} - U_{i}) / n_{i} + U_{i}$ . Properties of $T_{i j}$ are studied under the following assumptions.

Assumption 2.1

$E (X_{i k s}^{4}) \leq γ < \infty$ , $i = 1, \dots, g$ , $\forall s = 1, \dots, p$ , $γ \in R^{+}$ .

Assumption 2.2

As $n_{i} \to \infty$ , $n_{i} / n \to ρ_{i} \in (0, \infty)$ , $i = 1, \dots, g$ .

Assumption 2.3

As $p \to \infty$ , $tr (Σ_{i}) / p = κ_{i} = O (1)$ , $i = 1, \dots, g$ .

Assumption 2.4

As $p \to \infty$ , $μ_{i}^{'} Σ_{k} μ_{j} / p^{2} = ψ_{i j}$ , $0 < ψ_{i j} < \infty$ , $i = 1, \dots, g$ , k=i or k=j.

The assumptions are stated for g samples for their further use in the sequel. Note that, by Assumption 2.3, $∥ Σ_{i} ∥^{2} / p^{2} = O (1)$ . If we let $λ_{i} \in R^{+}$ be the eigenvalues of $Σ_{i}$ , so that $ν_{i}$ be those of $Σ_{i} / p$ , $i \in {1, \dots, g}$ , then Assumption 2.3 and its consequence uniformly bound the first two moments of $ν_{i}$ . Assumption 2.1 is inevitably needed to compute moments of bilinear forms when normality is relaxed. Assumption 2.4 is only needed for distribution under the alternative.

Assumptions 2.2 and 2.3 are mild and frequently used in high-dimensional testing problems. In particular, Assumption 2.3 holds for many commonly used covariance structures. Consider, e.g. $Σ$ as compound symmetric (CS), $Σ = (1 - ρ) I + ρ J$ with $I$ as identity matrix, $J = 11^{'}$ , $1$ a vector of 1s, $- 1 / (p - 1) \leq ρ \leq 1$ . Then $tr (Σ^{r}) = O (p^{r})$ , r = 1, 2. Note that, unlike common practice in the literature, we need not assume similar bound for higher moments of the eigenvalues of $Σ$ , e.g. $tr (Σ^{2}) / p = O (1)$ which may collapse for many useful structures, including CS. Note also that CS belongs to spiked structures where a few eigenvalues dominate the rest, so that the proposed procedures hold for such structures as well. See also discussion after Assumption 2.6 below.

Under these assumptions, the limit of $T_{i j}$ , for $n,_{i}, p \to \infty$ , is given in Ahmad [Citation25]. First, $n_{i j} Q_{i j 1} / p ⟹ P ρ_{i}^{- 1} κ_{i} + ρ_{j}^{- 1} κ_{j} = \sum_{s = 1}^{\infty} (ρ_{i}^{- 1} ν_{s i} + ρ_{j}^{- 1} ν_{s j}) = K_{i j}$ , as $n_{i}, p \to \infty$ . The limit obviously approximates $E (Q_{i j 1}) = tr (Σ_{i j 0})$ and holds both under $H_{0 i j}$ and $H_{1 i j}$ . As $E (Q_{i j 0}) = ∥ δ_{i j} ∥^{2} = 0$ under $H_{0 i j}$ , the kernels of $U_{i}$ and $U_{i j}$ are degenerate, so that [Citation22] $n_{i} U_{i} ⟹ D \sum_{s = 1}^{\infty} ν_{i s} (z_{i s}^{2} - 1)$ , $\sqrt{n_{i} n_{j}} U_{i j} ⟹ D \sum_{s = 1}^{\infty} ν_{i s} z_{i s} z_{j s}$ , where $z_{i s} \sim N (0, 1)$ , iid. Then $n_{i j} Q_{i j 0} ⟹ D \sum_{s = 1}^{\infty} (ρ_{i}^{- 1} ν_{i s} z_{i s}^{2} + ρ_{j}^{- 1} ν_{i s} z_{j m}^{2} - 2 ρ_{i}^{- 1 / 2} ρ_{j}^{- 1 / 2} ν_{i j s} z_{i s} z_{j s}) - K_{i j}$ and, by Slutsky's lemma, (5) $T_{i j} ⟹ D \frac{1}{K_{i j}} \sum_{m = 1}^{\infty} (ρ_{i}^{- 1 / 2} ν_{i m}^{1 / 2} z_{i m} - ρ_{j}^{- 1 / 2} ν_{j m}^{1 / 2} z_{j m})^{2},$ (5) where the limiting moments, $E (T_{i j}) \approx 1$ , $Var (T_{i j}) \approx 2 \sum_{m = 1}^{\infty} (ρ_{1}^{- 1} ν_{1 m} + ρ_{2}^{- 1} ν_{2 m})^{2} / K_{i j}^{2}$ , approximate the first two moments of $χ_{f_{i j}}^{2} / f_{i j}$ , $f_{i j} = [tr (Ω_{0 i j})]^{2} / tr (Ω_{0 i j}^{2})$ , $Ω_{0 i j} = n Σ_{i j 0} / p$ . Thus $T_{i j} ⟹ D χ_{f_{i j}}^{2} / f_{i j}$ . The normal limit follows by an application of Hájek-Šidák Lemma [Citation26, p. 183]. The limit under $H_{1 i j}$ follows by the projection theory of U-statistics. We estimate $Var (T_{i j}) = σ_{T_{i j}}^{2}$ by using unbiased, consistent estimators of traces in $f_{i j}$ , i.e. $tr (Σ_{i}^{2})$ , $[tr (Σ_{i})]^{2}$ , $tr (Σ_{i} Σ_{j})$ , defined as $E_{2 i}$ = $η_{i} {(n_{i} - 1) (n_{i} - 2) tr ({\hat{Σ}}_{i}^{2})$ + $[tr ({\hat{Σ}}_{i})]^{2} - n_{i} Q_{i}}$ , $E_{3 i} = η_{i} {2 tr ({\hat{Σ}}_{i}^{2}) + (n_{i}^{2} - 3 n_{i} + 1) [tr ({\hat{Σ}}_{i})]^{2} - n_{i} Q_{i}}$ and $tr ({\hat{Σ}}_{1} {\hat{Σ}}_{2})$ , where $Q_{i} = \sum_{k = 1}^{n_{i}} ({\tilde{X}}_{i k}^{'} {\tilde{X}}_{i k})^{2} / (n_{i} - 1)$ , ${\tilde{X}}_{i} = X_{i k} - {\bar{X}}_{i}$ , $η_{i} = (n_{i} - 1) / [n_{i} (n_{i} - 2) (n_{i} - 3)]$ . The consistent estimator $\hat{Var} (T_{i j})$ can replace $Var (T_{i j})$ in $T_{i j}$ . Following theorem summarizes the limit. For proof and an extension to multi-sample case, see Ahmad [Citation25].

Theorem 2.5

For $T_{i j}$ in Equation (Equation4(4) $T_{i j} = 1 + \frac{n_{i j} Q_{i j 0}}{[n_{i j} Q_{i j 1} / p]} .$ (4) ), $(T_{i j} - E (T_{i j})) / σ_{T_{i j}} ⟹ D N (0, 1),$ $n_{i}, n_{j}, p \to \infty,$ under Assumptions 2.1–2.4. The limit remains valid by replacing $σ_{T_{i j}}^{2}$ with its consistent estimator defined above.

A few remarks concerning Theorem 2.5 will help us proceed further. First, the limit of $T_{i j}$ holds for any distribution with finite fourth moment. Second, the composition of $T_{i j}$ in terms of U-statistics helps us relax normality and obtain the limit conveniently as the kernels are simple bilinear forms of independent components. The accuracy of $T_{i j}$ for small or moderate $n_{i}$ and large p is shown through simulations in Ahmad [Citation25]. This also implies that the dimension p is taken care of in the limit of $T_{i j}$ , so that the extension to multiple comparisons will not be much influenced by p. Finally, as $Q_{i j 1}$ converges to $E (Q_{i j 1}) = tr (Σ_{i j 0})$ in probability, the limit of $T_{i j}$ mainly follows from $Q_{i j 0}$ . Thus, in extending the limit to $T_{G}$ , we mainly focus on $Q_{i j 0}$ . For this, note that (6) $\begin{aligned} Var (Q_{i j 0}) & = 2 ∥ Σ_{i j 0} ∥^{2} + 4 δ_{i j}^{'} Σ_{i j 0} δ_{i j} \end{aligned}$ (6) (7) $\begin{aligned} Cov (Q_{i j 0}, Q_{i j^{'} 0}) & = \frac{2}{n_{i}^{2}} ∥ Σ_{i} ∥^{2} + \frac{4}{n_{i}} δ_{i j}^{'} Σ_{i} δ_{i j^{'}} \end{aligned}$ (7) (8) $\begin{aligned} Cov (Q_{i j 0}, Q_{i^{'} j 0}) & = \frac{2}{n_{j}^{2}} ∥ Σ_{j} ∥^{2} + \frac{4}{n_{j}} δ_{i j}^{'} Σ_{j} δ_{i^{'} j} \end{aligned}$ (8) with $Cov (Q_{i j 0}, Q_{i^{'} j^{'} 0}) = 0$ for $i \neq i^{'}$ , $j \neq j^{'}$ (see Appendix) where, under $H_{0 i j}$ , (9) $Var (Q_{i j 0}) = 2 ∥ Σ_{i j 0} ∥^{2}, Cov (Q_{i j 0}, Q_{i j^{'} 0}) = \frac{2}{n_{i}^{2}} ∥ Σ_{i} ∥^{2}, Cov (Q_{i j 0}, Q_{i^{'} j 0}) = \frac{2}{n_{j}^{2}} ∥ Σ_{j} ∥^{2},$ (9) independent of $μ_{i}$ . Now, with $Q_{i 0} = (Q_{i j 0}, \dots, Q_{i g 0})^{'}$ , $i = 1, \dots, g - 1$ , consider the vector (10) $Q_{0} = (Q_{10}^{'}, \dots, Q_{g - 1, 0}^{'})^{'},$ (10) where $E (Q_{0}) = 0$ , $Cov (Q_{0}) = Λ = 2 (Λ_{i j} / p^{2})_{i, j = 1}^{G} \in R^{G \times G}$ , a partitioned matrix with diagonal and off-diagonal blocks $Cov (Q_{i 0})$ = $Λ_{i i} / p^{2} \in R^{(g - i) \times (g - i)}$ , $Cov (Q_{i 0}, Q_{j 0}) = Λ_{i j} / p^{2} \in R^{(g - i) \times (g - j)}$ , i.e. (11) $Λ_{i i} = \frac{1}{n_{i}^{2}} ∥ Σ_{i} ∥^{2} (J - I)_{g - i} + \oplus_{j = i + 1}^{g} ∥ Σ_{i j 0} ∥^{2}, Λ_{i j} = 0^{'} \frac{1}{n_{i}^{2}} ∥ Σ_{i} ∥^{2} 1_{g - i}^{'} \frac{1}{n_{j}^{2}} \oplus_{j = i + 2}^{g} ∥ Σ_{j} ∥^{2}$ (11) $i = 1, \dots, g - 1$ , $j = i + 1, \dots, g$ , $1$ is vector of 1s, $J = 11^{'}$ , $I$ is identity matrix, ⊕ is Kronecker sum and $0$ in $Λ_{i j}$ is of order $(j - i - 1) \times (g - j)$ with no zero row if j−i−1=0. A closer look at the structure of $Λ$ reveals several aspects which will simplify the computations that follow. Ignoring $p^{2}$ for simplicity, and denoting $a_{i} = ∥ Σ_{i} ∥^{2} / n_{i}^{2}$ , $a_{i j} = ∥ Σ_{i j 0} ∥^{2}$ , we can write (12) $Λ_{i i} = (\begin{matrix} a_{i, i + 1} & a_{i} & \dots & a_{i} \\ a_{i} & a_{i, i + 2} & \dots & a_{i} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ a_{i} & a_{i} & \dots & a_{i, g} \end{matrix}) .$ (12) For any given i, $Λ_{i i}$ has same off-diagonal element, $a_{i}$ , with diagonal elements $a_{i j}$ , where $Σ_{i j 0}$ = $Σ_{i} / n_{i} + Σ_{j} / n_{j}$ = $Cov ({\hat{δ}}_{i j})$ , j=i+1. For off-diagonal blocks $Λ_{i j}$ , $\begin{aligned} Λ_{12} & = (\begin{matrix} a_{2} & a_{2} & \dots & a_{2} \\ a_{3} & 0 & \dots & 0 \\ 0 & a_{4} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & a_{g} \end{matrix}), Λ_{13} = (\begin{matrix} 0 & 0 & \dots & 0 \\ a_{3} & a_{3} & \dots & a_{3} \\ a_{4} & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & a_{g} \end{matrix}), \\ Λ_{1, g - 2} & = (\begin{matrix} 0 & 0 \\ ⋮ & ⋮ \\ 0 & 0 \\ a_{g - 2} & a_{g - 2} \\ a_{g - 1} & 0 \\ 0 & a_{g} \end{matrix}), Λ_{1, g - 1} = (\begin{matrix} 0 \\ ⋮ \\ 0 \\ a_{g - 1} \\ a_{g} \end{matrix}) \\ Λ_{23} & = (\begin{matrix} a_{3} & a_{3} & \dots & a_{3} \\ a_{4} & 0 & \dots & 0 \\ 0 & a_{5} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & a_{g} \end{matrix}), Λ_{2, g - 2} = (\begin{matrix} 0 & 0 \\ ⋮ & ⋮ \\ 0 & 0 \\ a_{g - 2} & a_{g - 2} \\ a_{g - 1} & 0 \\ 0 & a_{g} \end{matrix}), \\ Λ_{2, g - 1} & = (\begin{matrix} 0 \\ ⋮ \\ 0 \\ a_{g - 1} \\ a_{g} \end{matrix}), Λ_{g - 2, g - 1} = (\begin{matrix} a_{g - 1} & 0 \\ 0 & a_{g} \end{matrix}) \end{aligned}$

The off-diagonal elements in $Λ_{i j}$ are mostly 0 and the number of (rows with) zeros increases with increasing j for every i, making $Λ$ an increasingly sparse matrix. However, the distinct non-zero elements in $Λ$ consist of a much smaller set (13) $\{tr (Σ_{i}^{2}), tr (Σ_{i} Σ_{j}), i, j = 1, \dots, g, i < j\},$ (13) with cardinality $C_{e} = g (g + 1) / 2$ . Thus, for any g, we only need to estimate $C_{e}$ out of $C_{T} = G (G + 1) / 2$ elements in order to estimate $Λ$ . For example, for g = 6, 9, 12, 15, 20 samples, $C_{e}$ = 21, 66, 78, 120, 210 whereas $C_{T}$ = 120, 1540, 2211, 5565, 18145, respectively. The consistent estimators of these traces are given before Theorem 2.5. Used as plug-in estimators, they lead to a consistent estimator, $\hat{Λ}$ , of $Λ$ . A further simplification follows from weak (mostly zero) off-diagonal elements as compared to diagonal ones, so that the following assumption holds trivially.

Assumption 2.6

$lim_{p \to \infty} ∥ Σ_{i} ∥^{2} / [{tr (Σ_{i} + Σ_{j})} {tr (Σ_{i} + Σ_{k})}] \to γ \in [0, 1)$ , $i \neq j \neq k = 1, \dots, g$ .

Although, Assumption 2.6 is kept flexible to adjust many covariance structures, it can be shown that the ratio indeed vanishes for most covariance structures, so that Assumption 2.6 encompasses many practical cases, including trivial ones e.g. $Σ \propto I$ ; see also Section 4. For the distribution of $T_{G}$ , consider the moments of $Q_{i j 0}$ in Equations (Equation6(6) $\begin{aligned} Var (Q_{i j 0}) & = 2 ∥ Σ_{i j 0} ∥^{2} + 4 δ_{i j}^{'} Σ_{i j 0} δ_{i j} \end{aligned}$ (6) )–(Equation9(9) $Var (Q_{i j 0}) = 2 ∥ Σ_{i j 0} ∥^{2}, Cov (Q_{i j 0}, Q_{i j^{'} 0}) = \frac{2}{n_{i}^{2}} ∥ Σ_{i} ∥^{2}, Cov (Q_{i j 0}, Q_{i^{'} j 0}) = \frac{2}{n_{j}^{2}} ∥ Σ_{j} ∥^{2},$ (9) ). Using the projection theory of U-statistics (Appendix), the projection of $Q_{i j 0}$ can be shown as ${\hat{Q}}_{i j 0} = 2 δ_{i j}^{'} \{({\bar{X}}_{i} - μ_{i}) - ({\bar{X}}_{j} - μ_{j})\} / p = 2 δ_{i j}^{'} \{({\bar{X}}_{i} - {\bar{X}}_{j}) - δ_{i j}\} / p,$ see [Citation25, Appendix B.2]. As ${\hat{Q}}_{i j 0}$ is composed of independent components and holds for any pair $(i, j)$ , the projection of $Q_{i 0}$ , hence of $Q_{0}$ , consists of sums of these independent components. Further, with $Q_{i j 1}$ converging to a constant in probability, the limit for $T_{G}$ follows conveniently by the Cramér-Wold device and Slutsky's lemma [Citation22]. Finally, using the plug-in consistent estimators of the elements of $Λ$ , the limit also extends to $\hat{Λ}$ . We have the following theorem.

Theorem 2.7

For $T_{G},$ the limit in Equation (Equation14(14) $a^{'} T_{G} ⟹ D N (a^{'} 1, a^{'} Λ a),$ (14) ) holds under Assumptions 2.1–2.6, as $n_{i}, p \to \infty$ . Further, the limit remains valid by replacing $Λ$ with its consistent estimator defined above.

As mentioned above, the off-diagonal elements in $Λ$ vanish under Assumption 2.6 for most covariance matrices, leaving $Λ$ diagonal. This makes the limit in Theorem 2.7 much easier to prove and simpler to use. In particular, with ${\hat{f}}_{i j}$ as the estimator of $f_{i j}$ , as discussed after Equation (Equation5(5) $T_{i j} ⟹ D \frac{1}{K_{i j}} \sum_{m = 1}^{\infty} (ρ_{i}^{- 1 / 2} ν_{i m}^{1 / 2} z_{i m} - ρ_{j}^{- 1 / 2} ν_{j m}^{1 / 2} z_{j m})^{2},$ (5) ), we can use the Chi-square limit with $Cov (T_{G}) \approx diag (2 / f_{12}, \dots, 2 / f_{g - 1, g})$ with $f_{i j}$ estimated as ${\hat{f}}_{i j}$ . Alternatively, the corresponding normal limit may be used. In fact, given the structure of the test statistics, and also because the normal limit follows through Chi-square limit, it has been observed that the Chi-square approximation mostly performs relatively better that the normal limit, and is thus strongly recommended for practical applications.

Note that, Theorem 2.7 implies that the limit also holds for any linear combination $a^{'} T_{G}$ , $a \in R^{G} ∖ {0}$ . With $E (T_{G}) \approx 1_{G}$ , we have, for $n_{i}, p \to \infty$ , (14) $a^{'} T_{G} ⟹ D N (a^{'} 1, a^{'} Λ a),$ (14) so that we can also test any linear combination $H_{0} : a^{'} δ_{G} = 0$ , particularly including any single $δ_{i j} = 0$ , using $\sqrt{2 / f_{i j}} (T_{i j} - 1) ⟹ D N (0, 1)$ . The corresponding 100(1 - α)% simultaneous confidence interval (SCI) for $a^{'} δ_{G}$ follows as (15) $a^{'} {\hat{T}}_{G} \mp z_{α / 2} \sqrt{a^{'} \hat{Λ} a},$ (15) where $z_{α / 2}$ is 100(α/2)% quantile of $N (0, 1)$ -distribution. Note that, the observed length of this confidence interval is $\hat{L} = 2 z_{α / 2} (a^{'} \hat{Λ} a)^{1 / 2}$ . By the consistency of $\hat{Λ}$ (Theorems 2.5-2.7) and the continuous mapping theorem, $E (\hat{L})$ converges to $a^{'} Λ a$ which, under the assumptions, is a finite value, assuming $∥ a ∥^{2} < \infty$ which holds conveniently.

The comparison of treatments with a control is a special case of all pairwise comparisons presented above. Let Sample 1 be treated as control, and the interest is to test it against all other samples, i.e. $H_{i 0} : δ_{1 i} = 0$ , $δ_{1 i} = μ_{1} - μ_{i}$ , $i = 2, \dots g$ . The vector of tests is (16) $T_{1} = (T_{12}, \dots, T_{1 g})^{'},$ (16) which is the first sub-vector of $T_{G}$ in Equation (Equation2(2) $T_{G} = (T_{1}, \dots, T_{g - 1})^{'} = (T_{12}, \dots, T_{1 g}, T_{23}, \dots, T_{2 g}, \dots, T_{g - 2, g}, T_{g - 1, g})^{'},$ (2) ). Using the related computations, we get $E (Q_{01}) = 0_{g - 1}$ , $Cov (Q_{01}) = Λ_{11}$ , the first diagonal block of $Λ$ , so that under the assumptions, $E (T_{1}) \approx 1_{g - 1}$ and, assuming zero off-diagonals, $Cov (T_{1}) = diag (2 / f_{12}, \dots, 2 / f_{1 g})$ . The multiple tests and corresponding confidence intervals follows from those given for $T_{G}$ above, without much changes.

3. Simulations

We do a simulation study to assess the performance of the proposed tests, in terms of their size control and power, and also their robustness to the violation of assumptions. We consider g = 3 and 6 samples and generate p-dimensional iid vectors from normal, uniform and exponential distributions. For g=3, we use $(n_{1}, n_{2}, n_{3}) = (10, 15, 20)$ , (20, 30, 40), (10, 30, 60) and (50, 75, 100), with $p \in {50, 300, 500, 1000}$ , where the last sample size triplet corresponds to large samples and penultimate triplet amounts to very unbalanced design. The other two triplets are used to show the accuracy of the tests for small to moderate sample sizes. We use three covariance structures, Compound Symmetry (CS), Autoregressive of order 1, AR(1), and unstructured (UN), defined, respectively, as $κ I + ρ J$ , $Cov (X_{i}, X_{j}) = κ ρ^{| i - j |}$ , $\forall k, l$ and $Σ = (σ_{i j})_{i, j = 1}^{d}$ with $σ_{i j} = 1 (1) d$ (i=j), $ρ_{i j} = (i - 1) / d$ (i>j), where $I$ is identity matrix and $J = 11^{'}$ is matrix of 1 s.

To include violation of homoscedasticity assumption, we combine the structures as (CS, AR(1, 0.5), AR(1, 0.7)), (AR(1, 0.5), AR(1, 0.7), UN), where 0.5 and 0.7 are ρ values used. We use $κ = 1$ for all cases. For g=6, we use $(n_{1}, n_{2}, \dots, n_{6}) = (10, 10, 10, 20, 20, 20)$ , (30, 40, 50, 30, 40, 50), (30, 40, 50, 60, 70, 80), with same covariance matrix combinations as used for g=3, repeated for first three and next three populations. Due to the close similarity of the results, we restrict the presentation of power to (CS, AR, AR) combination for g=3 and to normal and exponential distributions, with first two sample size sextuples, for g=6.

For both size and power, we use $α = 0.05$ . For g=3, we test all (three) pairwise hypotheses $δ_{i j} = 0$ , i<j, i,j=1, 2, 3, where for g=6, we do comparisons with (sample 1 as) control, that is, $H_{0} : δ_{1 j} = 0$ , $j = 2, \dots, 6$ . Moreover, for power, we add non-centrality parameter, defined as $ϑ = 0.2 (0.2) 1 q$ with $q = (1 / p, \dots, p / p)$ , to population 1 for both g = 3 and 6. This, for g = 3, affects tests for $δ_{12}$ and $δ_{13}$ , whereas for g = 6 and comparisons with control, it affects all tests. The p-values and power are estimated using the asymptotic distribution in Theorem 2.7, averaged over 1000 simulation runs.

For comparison, we also compute, under the same set up, size and power for the most commonly used multiple test procedure, namely max test, $T_{max}$ , with Bonferroni error control. We thus compute $T_{max} = max {T_{i j} : i, j = 1, \dots G, i < j}$ and use $α / G$ as nominal level to exercise Bonferroni control. Note that, both types of error control pertain to family-wise in the strong sense (FWEs); see Section 1. The estimated quantiles, $\hat{1 - α}$ and power, $\hat{1 - β}$ , are reported in Tables , respectively, for g=3 and 6.

Table 1. Estimated size of pairwise comparisons for g = 3: all distributions.

Display Table

Table 2. Estimated size of comparisons with a control for g=6: All distributions.

Display Table

Table 3. Estimated power of pairwise comparisons for g=3: All distributions.

Display Table

Table 4. Estimated power of comparisons with a control for g=6: All distributions.

Display Table

We observe an accurate size control by the proposed tests for both 3 and 6 samples, under all covariance structures and for all populations. The accuracy for exponential distribution as a serious non-normal case is particularly noticeable. Likewise is the case for the covariance structures involving CS, being highly spiked covariance matrix, with only two distinct eigenvalues. These results depict strong robustness of the tests against several violations of usual assumptions. Similar situation appears for power which steadily increases not only for increasing sample sizes but also for increasing dimension. Note the power converging quickly to 1 for sample sizes as small as 10 or 20, even for exponential distribution. Due to this, we reduce ϑ values for each p as soon as the power approaches its maximum value. For example, for p=500, power was already observed 1 for $ϑ = 0.4$ , hence not reported. We also note, in comparison, that $T_{max}$ often moves between being conservative to liberal and looses its stability, although it generally shows nice power.

To conclude, the proposed tests can be generally considered for most of practically used distributions and covariance structures, where the dimension may far exceed the sample size, and for a moderate number of independent samples. Note that, theoretically, the asymptotic covariance matrix of the vector of tests, $Λ$ , holds for any g, hence any G, but a large g is practically a rare phenomenon. In most cases, g is a moderate values like $g \leq$ 6 or 7, as compared to p which may run into thousands. In this context, the tests may find applicability in a wide array of practical problems. On the other hand, the largeness of g may, at least in a few special contexts, be of interest and is therefore being considered for a future work. It indeed needs a different sort of asymptotics to allow for $g \to \infty$ simultaneously with $p \to \infty$ .

4. Applications

We apply the proposed procedures to two data sets, heretofore called SRBCT and Species data, with g=4 and 5 samples, respectively. The first data set consists of small, round blue cell tumors (SRBCT) observed over four independent groups, including a normal group, with sizes $n_{1} = 29$ , $n_{2} = 25$ , $n_{3} = 11$ , $n_{4} = 18$ , with dimension p=2308 gene expressions. The second, species, data set consists of p=809 species counts of macrobenthos, observed from n=101 independent sites in five different regions, with sample sizes $n_{1} = 16$ , $n_{2} = 21$ , $n_{3} = 25$ , $n_{4} = 19$ , $n_{5} = 20$ , along a long transact of the Norwegian continental shelf.

We have $X = (X_{1}^{'}, \dots, X_{5}^{'})^{'} \in R^{n \times p}$ as complete data matrix with $X_{i} = (X_{i 1}^{'}, \dots, X_{i n_{i}}^{'})^{'} \in R^{n_{i} \times p}$ for ith sample, where $n_{i}$ and p are given above. Both data sets represent unbalanced one-way MANOVA designs with g=4 and 5 independent samples, with dimensions p=2308 and 809, and total sample size $n = \sum_{i = 1}^{5} n_{i} = 83$ and 101, respectively.

We begin by testing global hypotheses, i.e. $H_{0 g} : μ_{1} = \dots = μ_{g}$ vs $H_{15} : μ_{i} \neq μ_{j}$ for at least one pair $i \neq j$ , i, j = $1, \dots, g$ , with g=4 and 5, respectively. We use MANOVA test statistic proposed, under identical general conditions as used here, in Ahmad [Citation25]. The observed values of the test statistic, $T_{g}$ (see Equation 8 in the reference), for SRBCT data are 378.1604 and 45.7850, respectively, for Chi-square and normal approximations, with p-value virtually zero in each case. A detailed analysis of species data is already provided in Ahmad [Citation25, Sec. 5], by which $T_{g}$ = 180.4 and 40.61 for Chi-square and normal approximations, respectively, with p-values again zero. With global hypotheses strongly rejected, we expect to find vectors responsible for this rejection.

For multiple comparisons, we consider sample 1 as control and compare it with the remaining samples for Species data, i.e. we test $H_{01 j} : δ_{1 j} = 0$ , j=2, 3, 4, 5, with $G = 4$ , whereas we do all G=6 pairwise comparisons for SRBCT data, i.e. $H_{i j 0} : δ_{i j} = 0$ , i,j=1, 2, 3, 4, i<j. The vectors of test statistics for Species and SRBCT data, respectively, are computed as $T_{5} = (5.15, 12.24, 10.98, 10.36)^{'}, T_{6} = (5.17, 3.76, 5.32, 6.25, 5.43, 5.07)^{'},$ with the corresponding vectors of p-values $(0.004, 0, 0, 0)^{'}$ and $0_{6}$ . The results indicate all means, statistically, discernably different from each other at any reasonable nominal size. For further assessment, we also compute the $Λ$ matrix (see Equation Equation11(11) $Λ_{i i} = \frac{1}{n_{i}^{2}} ∥ Σ_{i} ∥^{2} (J - I)_{g - i} + \oplus_{j = i + 1}^{g} ∥ Σ_{i j 0} ∥^{2}, Λ_{i j} = 0^{'} \frac{1}{n_{i}^{2}} ∥ Σ_{i} ∥^{2} 1_{g - i}^{'} \frac{1}{n_{j}^{2}} \oplus_{j = i + 2}^{g} ∥ Σ_{j} ∥^{2}$ (11) ) for the two data sets, respectively, of order $4 \times 4$ and $5 \times 5$ , shown in Equations (Equation17(17) $\begin{aligned} \hat{Λ} & = (\begin{matrix} 2.326 & 0.032 & 0.068 & 0.055 \\ 0.032 & 3.459 & 0.163 & 0.131 \\ 0.068 & 0.163 & 2.846 & 0.275 \\ 0.055 & 0.131 & 0.275 & 4.177 \end{matrix}) \end{aligned}$ (17) ) and (Equation19(19) $\begin{aligned} \hat{Λ} & = (\begin{matrix} 15.559 & 0.014 & 0.022 & 0.019 & 0.028 & 0.000 \\ 0.014 & 7.272 & 0.016 & 0.090 & 0.000 & 0.096 \\ 0.022 & 0.016 & 5.748 & 0.000 & 0.036 & 0.026 \\ 0.019 & 0.090 & 0.000 & 8.358 & 0.018 & 0.014 \\ 0.028 & 0.000 & 0.036 & 0.018 & 6.695 & 0.023 \\ 0.000 & 0.096 & 0.026 & 0.014 & 0.023 & 5.655 \end{matrix}) \end{aligned}$ (19) ). It may be mentioned that the analysis reported above is based on Chi-square approximation which, as already discussed, has relatively better performance than the normal one, and the ratio in Assumption 2.6 is assumed to vanish, so that $\hat{Λ}$ are used as diagonal matrices. This can be easily witnessed from the matrices computed for the two data sets. It is clear that ignoring the off-diagonal elements does not cause much loss of information concerning the comparisons.

To expand more on this, and to highlight an additional important property of the proposed tests, ${\hat{Λ}}^{- 1}$ is also reported in each case; Equations (Equation18(18) $\begin{aligned} {\hat{Λ}}^{- 1} & = (\begin{matrix} 0.430 & - 0.003 & - 0.009 & - 0.005 \\ - 0.003 & 0.290 & - 0.016 & - 0.008 \\ - 0.009 & - 0.016 & 0.355 & - 0.023 \\ - 0.005 & - 0.008 & - 0.023 & 0.241 \end{matrix}) \end{aligned}$ (18) ) and (Equation20(20) $\begin{aligned} {\hat{Λ}}^{- 1} & = (\begin{matrix} 0.064 & 0.000 & 0.000 & 0.000 & 0.000 & 0.000 \\ 0.000 & 0.138 & 0.000 & - 0.002 & 0.000 & - 0.002 \\ 0.000 & 0.000 & 0.174 & 0.000 & - 0.001 & - 0.001 \\ 0.000 & - 0.002 & 0.000 & 0.120 & 0.000 & 0.000 \\ 0.000 & 0.000 & - 0.001 & 0.000 & 0.149 & - 0.001 \\ 0.000 & - 0.002 & - 0.001 & 0.000 & - 0.001 & 0.177 \end{matrix}) \end{aligned}$ (20) ). First, we observe that, estimated $Λ$ is a non-singular matrix, hence can be inverted, something that in fact can be shown for ${\hat{Λ}}_{G}$ in general. Second, this in turn implies that the tests can be defined as affine-invariant, using ${\hat{Λ}}^{- 1}$ . As we have not proved this inverse for the general case explicitly, it is left for a later work. Finally, we notice that the off-diagonal elements virtually vanish in the inverses. Thus, in affine-invariant form, the tests may be used even more safely under Assumption 2.6. (17) $\begin{aligned} \hat{Λ} & = (\begin{matrix} 2.326 & 0.032 & 0.068 & 0.055 \\ 0.032 & 3.459 & 0.163 & 0.131 \\ 0.068 & 0.163 & 2.846 & 0.275 \\ 0.055 & 0.131 & 0.275 & 4.177 \end{matrix}) \end{aligned}$ (17) (18) $\begin{aligned} {\hat{Λ}}^{- 1} & = (\begin{matrix} 0.430 & - 0.003 & - 0.009 & - 0.005 \\ - 0.003 & 0.290 & - 0.016 & - 0.008 \\ - 0.009 & - 0.016 & 0.355 & - 0.023 \\ - 0.005 & - 0.008 & - 0.023 & 0.241 \end{matrix}) \end{aligned}$ (18) (19) $\begin{aligned} \hat{Λ} & = (\begin{matrix} 15.559 & 0.014 & 0.022 & 0.019 & 0.028 & 0.000 \\ 0.014 & 7.272 & 0.016 & 0.090 & 0.000 & 0.096 \\ 0.022 & 0.016 & 5.748 & 0.000 & 0.036 & 0.026 \\ 0.019 & 0.090 & 0.000 & 8.358 & 0.018 & 0.014 \\ 0.028 & 0.000 & 0.036 & 0.018 & 6.695 & 0.023 \\ 0.000 & 0.096 & 0.026 & 0.014 & 0.023 & 5.655 \end{matrix}) \end{aligned}$ (19) (20) $\begin{aligned} {\hat{Λ}}^{- 1} & = (\begin{matrix} 0.064 & 0.000 & 0.000 & 0.000 & 0.000 & 0.000 \\ 0.000 & 0.138 & 0.000 & - 0.002 & 0.000 & - 0.002 \\ 0.000 & 0.000 & 0.174 & 0.000 & - 0.001 & - 0.001 \\ 0.000 & - 0.002 & 0.000 & 0.120 & 0.000 & 0.000 \\ 0.000 & 0.000 & - 0.001 & 0.000 & 0.149 & - 0.001 \\ 0.000 & - 0.002 & - 0.001 & 0.000 & - 0.001 & 0.177 \end{matrix}) \end{aligned}$ (20)

5. Discussion and conclusions

In the context of multi-sample multivariate problem, multiple comparisons of mean vectors with very large dimension, possibly much larger than the number of vectors in any sample, are considered. The case is of frequent interest, for example, as a first post hoc assessment of mean vectors after a global MANOVA hypothesis is rejected. All possible pairwise differences and comparisons with a control are treated. In particular, the joint asymptotic distribution, under $n_{i}, p \to \infty$ , is derived whose tail probabilities can be directly used to carry out the multiple tests. Simulations results are used to show the accuracy of the tests, and a comparison with max test is also given.

Following the objectives of the present work, as stated in Section 1, the proposed tests can be used in applied problems requiring simultaneous inference for two or more large mean vectors which might have been sampled from a non-normal distribution and may have unequal covariance matrices as well as the sample sizes. Whereas the test statistics are asymptotically approximated with Chi-square and Normal distributions, it is observed that the former provides relatively better accuracy than the later and is thus highly recommended for practical use.

Disclosure statement

No potential conflict of interest was reported by the author.

ORCID

M. Rauf Ahmad http://orcid.org/0000-0002-5362-5835

References

Seber GAF. Multivariate observations. New York (NY): Wiley.
Google Scholar
Johnson RA, DW Wichern. Applied multivariate statistical analysis. 6th ed. Upper Saddle River (NJ): Pearson Education; 2007.
Google Scholar
Krishnaiah PR. On the simultaneous ANOVA and MANOVA tests. Ann Inst Stat Math. 1965;17:35–53. doi: 10.1007/BF02868151
Web of Science ®Google Scholar
Krishnaiah PR. Simultaneous test procedures under general MANOVA models. In: Krishnaiah PR, editor. Multivariate analysis. Vol II. New York (NY): Academic Press; 1969. p. 121–143.
Google Scholar
Wijsman RA. Constructing all smallest simultaneous confidence sets in a given class with applications to MANOVA. An Stat. 1979;7(5):1003–1018. doi: 10.1214/aos/1176344784
Web of Science ®Google Scholar
Kropf S. Hochdimensionale multivariate Verfahren in der medizinischen Statistik. Aachen: Shaker; 2000.
Google Scholar
Kropf S, Läuter J. Multiple tests for different sets of variables using a data-driven ordering of hypotheses, with an application to gene expression data. Biom J. 2002;44:789–800. doi: 10.1002/1521-4036(200210)44:7<789::AID-BIMJ789>3.0.CO;2-#
Web of Science ®Google Scholar
Westfall P, Kropf S, Finos L. Weighted FWE-controlling methods in high-dimensional situations. In: Benjamini Y, Bretz F, Sarakr SK, editors. Recent developments in multiple comparison procedures. Vol. 47, IMS Lecture Notes and Monpgraph Series; 2004. p. 143–154.
Google Scholar
Läuter J, Glimm E, Eszlinger M. Search for relevant sets of variables in a high-dimensional setup keeping the familywise error rate. Statist Neerl. 2005;59(3):298–312. doi: 10.1111/j.1467-9574.2005.00290.x
Web of Science ®Google Scholar
Conneely KN, Boehnke M. So many correlated tests, so little time! Rapid adjustment of p values for multiple correlated tests. Amer J Human Genet. 2007;81:1158–1168. doi: 10.1086/522036
PubMed Web of Science ®Google Scholar
Westfall P, Troendle JF. Multiple testing with minimal assumptions. Biom J. 2008;50:745–755. doi: 10.1002/bimj.200710456
PubMed Web of Science ®Google Scholar
Bretz F, Hothorn T, Westfall P. Multiple comparisons using R. Boca Raton (FL): CRC Press; 2011.
Google Scholar
Dickhaus T. Simultaneous statistical inference in dynamic factor models. Berlin: Humboldt-Universitätzu; 2012. (Discussion paper, 2012-033.).
Google Scholar
Goeman J, Finos L. The inheritance procedure: Multiple testing of tree-structured hypotheses. Stat App Genet Molec Biol. 2012;11:1–18. doi: 10.1515/1544-6115.1554
Web of Science ®Google Scholar
Goeman J, Solari A. Multiple hypothesis testing in genomics. Stat Med. 2014;33:1946–1978. doi: 10.1002/sim.6082
PubMed Web of Science ®Google Scholar
Guilbaud O. Simultaneous confidence regions for closed tests, including Holms-, Hochberg-, and Hommel-related procedures. Biom J. 2012;54:317–342. doi: 10.1002/bimj.201100123
PubMed Web of Science ®Google Scholar
Guilbaud O. Sharper Confidence Intervals for Hochberg- and Hommel-Related Multiple Tests Based On an Extended Simes Inequality. Statist Biopharm Res. 2014;6:123–136. doi: 10.1080/19466315.2013.872999
Web of Science ®Google Scholar
Dickhaus T. Simultaneous statistical inference: with applications in the life sciences. New York (NY): Springer; 2014.
Google Scholar
Nichols T, Hayasaka S. Controlling the familywise error rate in functional neuroimaging: a comparative review. Stat Methods Med Res. 2003;12:419–446. doi: 10.1191/0962280203sm341ra
PubMed Web of Science ®Google Scholar
Hochberg Y, Tamhane AC. Multiple comparison procedures. New York (NY): Wiley; 1987.
Google Scholar
Hemerik J, Goeman J. False discovery proportion estimation by permutations: confidence for significance analysis of microarrays. J R Statist Soc B. 2018;80:137–155. doi: 10.1111/rssb.12238
Google Scholar
van der Vaart AW. Asymptotic statistics. Cambridge: Cambridge University Press; 1998.
Google Scholar
Lee AJ. U-statistics: theory and practice. Boca Raton (FL): CRC Press; 1990.
Google Scholar
Ahmad MR. Location-invariant multi-sample U-tests for covariance matrices with large dimension. Scand J Stat. 2017;44:500–523.
Web of Science ®Google Scholar
Ahmad MR. A unified approach to testing mean vectors with large dimension. AStA Adv Stat Anal. 2018.
Google Scholar
Jiang J. Large sample techniques for statistics. New York (NY): Springer; 2010.
Google Scholar
Koroljuk VS, Borovskich YV. Theory of U-statistics. Dordrecht: Kluwer Academic Press; 1994.
Google Scholar

Appendix. Some basic moments

Consider

U_{i}

with symmetric kernel

h (X_{i k}, X_{i r})

and conditional expectation (projection)

h_{c} (\cdot) = E [h (\cdot) | X_{1 k}, \dots X_{c})

, c=1,2 so that

h_{1} (X_{i k}) = E [h (\cdot) | X_{i k}]

h_{2} (\cdot) = h (\cdot)

with

Var [h_{i} (\cdot)] = ξ_{i}

, i=1,2. For

U_{i j}

with symmetric kernel

h (X_{i k}, X_{j l})

with

m_{1} = 1 = m_{2}

and with

c_{1}, c_{2} = 0, 1

, the conditional expectations are

h_{10} (X_{i k}) = E [h (\cdot) | X_{i k}]

h_{01} (X_{j l})

h_{11} (\cdot) = h (\cdot)

with corresponding variances

ξ_{10}, ξ_{01}

ξ_{11}

. Here,

h (\cdot)

is used when the arguments are evident from the context. Then, the moments of U-statistics follow as given, e.g., in Koroljuk and Borovskich [Citation27] or van der Vaart [Citation22]; see also Ahmad [Citation25, Appendix A]

Using these notations, $E (U_{i}) = μ_{i}^{'} μ_{i}$ , $E (U_{i j}) = μ_{i}^{'} μ_{j}$ , with $h (X_{i k}, X_{i r}) = X_{i k}^{'} X_{i r}$ , $h_{1} (X_{i k}) = μ_{i}^{'} X_{i k}$ , $ξ_{1} = Var [h_{i} (\cdot)] = μ_{i}^{'} Σ_{i} μ_{i}$ and $ξ_{2} = Var [h (\cdot)] = tr (Σ_{i}^{2}) + 2 μ_{i}^{'} Σ_{i} μ_{i}$ . For $U_{i j}$ , $h (X_{i k}, X_{j l}) = X_{i k}^{'} X_{j l}$ , with $h_{10} = μ_{j}^{'} X_{i k}$ , $h_{01} = μ_{i}^{'} X_{j l}$ , $ξ_{10} = Var [h_{10} (\cdot)] = μ_{j}^{'} Σ_{i} μ_{j}$ , $ξ_{10} = Var [h_{10} (\cdot)] = μ_{i}^{'} Σ_{j} μ_{i}$ , $h_{11} (\cdot) = h (\cdot)$ , $ξ_{11} = Var [h_{11} (\cdot)] = μ_{i}^{'} Σ_{j} μ_{i} + μ_{j}^{'} Σ_{i} μ_{j} + tr (Σ_{i} Σ_{j})$ . Now $Var (U_{i})$ = $2 [2 (n_{i} - 1) μ_{i}^{'} Σ_{i} μ_{i}$ + $tr (Σ_{i}^{2})] / n_{i} (n_{i} - 1)$ , $Var (U_{i j}) = [n_{i} μ_{i}^{'} Σ_{j} μ_{i} + n_{j} μ_{j}^{'} Σ_{i} μ_{j} + tr (Σ_{i} Σ_{j})] / n_{i} n_{j}$ , $Cov (U_{i}, U_{i j}) = 2 μ_{j}^{'} Σ_{i} μ_{i} / n_{i}$ , $Cov (U_{j}, U_{i j}) = 2 μ_{i}^{'} Σ_{j} μ_{j} / n_{j}$ , $Cov (U_{i j}, U_{i j^{'}}) = μ_{j}^{'} Σ_{i} μ_{j^{'}} / n_{i}$ , $Cov (U_{i j}, U_{i^{'} j})$ = $μ_{i}^{'} Σ_{j} μ_{i^{'}} / n_{j}$ , $i \neq j$ , $i \neq j^{'}$ , $i^{'} \neq j$ , where the remaining covariances vanish by independence.

Multiple comparisons of mean vectors with large dimension under general conditions

ABSTRACT

1. Introduction

2. Test statistics and their properties

2.1. Notations and preliminary set up

2.2. Test statistics and their properties

3. Simulations

Table 1. Estimated size of pairwise comparisons for g = 3: all distributions.

Table 2. Estimated size of comparisons with a control for g=6: All distributions.

Table 3. Estimated power of pairwise comparisons for g=3: All distributions.

Table 4. Estimated power of comparisons with a control for g=6: All distributions.

4. Applications

5. Discussion and conclusions

Disclosure statement

Related Research Data

References

Appendix. Some basic moments

Information for

Open access

Opportunities

Help and information

Multiple comparisons of mean vectors with large dimension under general conditions

ABSTRACT

1. Introduction

2. Test statistics and their properties

2.1. Notations and preliminary set up

2.2. Test statistics and their properties

3. Simulations

Table 1. Estimated size of pairwise comparisons for g = 3: all distributions.

Table 2. Estimated size of comparisons with a control for g=6: All distributions.

Table 3. Estimated power of pairwise comparisons for g=3: All distributions.

Table 4. Estimated power of comparisons with a control for g=6: All distributions.

4. Applications

5. Discussion and conclusions

Disclosure statement

ORCID

Related Research Data

References

Appendix. Some basic moments

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date