Full article: Properties of BLUEs in full versus small linear models

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In this article we consider the partitioned linear model $M_{12} = {y, X_{1} β_{1} + X_{2} β_{2}, V}$ and the corresponding small model $M_{1} = {y, X_{1} β_{1}, V} .$ We focus on comparing the best linear unbiased estimators, BLUEs, of $X_{1} β_{1}$ under $M_{12}$ and $M_{1} .$ In other words, we are interested in the effect of adding regressors on the BLUEs. Particular attention is paid on the consistency of the model, that is, whether the realized value of the response vector y belongs to the column space of $(X_{1} : V)$ or $(X_{1} : X_{2} : V) .$

Keywords:

MATHEMATICS SUBJECT CLASSIFICATION:

1. Introduction

In this article we consider the partitioned linear model $y = X_{1} β_{1} + X_{2} β_{2} + ε$ and so-called small model (submodel) $y = X_{1} β_{1} + ε,$ or shortly $M_{12} = {y, X β, V} = {y, X_{1} β_{1} + X_{2} β_{2}, V}, M_{1} = {y, X_{1} β_{1}, V} .$ Here y is an n-dimensional observable response variable, and $ε$ is an unobservable random error with a known covariance matrix $cov (ε) = V = cov (y)$ and expectation $E (ε) = 0 .$ The matrix X is a known n × p matrix, that is, $X \in R^{n \times p},$ partitioned columnwise as $X = (X_{1} : X_{2}),$ $X_{i} \in R^{n \times p_{i}}, i = 1, 2.$ Vector $β = (β_{1}^{'}, β_{2}^{'})' \in R^{p}$ is a vector of fixed (but unknown) parameters; symbol $'$ stands for the transpose.

As for notation, $r (A),$ $A^{-},$ $A^{+},$ $C (A),$ $N (A),$ and $C {(A)}^{⊥},$ denote, respectively, the rank, a generalized inverse, the (unique) Moore–Penrose inverse, the column space, the null space, and the orthogonal complement of the column space of the matrix A. By $A^{⊥}$ we denote any matrix satisfying $C (A^{⊥}) = C {(A)}^{⊥} .$ Furthermore, we will write $P_{A} = P_{C (A)} = A A^{+} = A {(A' A)}^{-} A'$ to denote the orthogonal projector onto $C (A) .$ The orthogonal projector onto $C {(A)}^{⊥}$ is denoted as $Q_{A} = I_{a} - P_{A},$ where $I_{a}$ is the a × a identity matrix and a is the number of rows of A. We write shortly $M = I_{n} - P_{X}, M_{i} = I_{n} - P_{X_{i}}, i = 1, 2 .$ One obvious choice for $X^{⊥}$ is M.

When using generalized inverses it is essential to know whether the expressions are independent of the choice of the generalized inverses involved. The following lemma gives an important invariance condition; cf. Rao and Mitra (Citation1971, Lemma 2.2.4)

Lemma 1.1.

For nonnull matrices $A$ and $C$ the following holds: $A B^{-} C = A B^{+} C for all B^{-} \Leftrightarrow C (C) \subseteq C (B) & C (A') \subseteq C (B') .$

For a given linear model $M = {y, X β, V},$ let the set $W (M)$ of nonnegative definite matrices be defined as (1.1) $W (M) = {W \in R^{n \times n} : W = V + XUU' X', C (W) = C (X : V)} .$ (1.1) In Equation(1.1)(1.1) $W (M) = {W \in R^{n \times n} : W = V + XUU' X', C (W) = C (X : V)} .$ (1.1) , U can be any matrix comprising p rows as long as $C (W) = C (X : V)$ is satisfied. Lemma 1.2 collects together some important properties of the class $W (M);$ see, for example, Puntanen, Styan, and Isotalo (Citation2011, Prop. 12.1 and 15.2).

Lemma 1.2.

Consider the model $M = {y, X β, V}$ and let $W = V + XUU' X' \in W (M)$ . Then(1.2) $\begin{matrix} G_{12} = X {(X' W^{-} X)}^{-} X' W^{+} = P_{W} - VM {(MVM)}^{-} M P_{W} \\ = P_{W} - VM {(MVM)}^{+} = P_{W} - VM {(MVM)}^{+} M . \end{matrix}$ (1.2) Moreover, the following statements are equivalent:

$C (X : V) = C (W),$
$C (X) \subseteq C (W),$
$X^{'} W^{-} X$ is invariant for any choice of $W^{-},$
$C (X' W^{-} X) = C (X')$ for any choice of $W^{-},$
$X {(X' W^{-} X)}^{-} X' W^{-} X = X$ for any choices of $W^{-}$ and ${(X' W^{-} X)}^{-} .$

It is noteworthy that the matrix $G_{12}$ in Equation(1.2)(1.2) $\begin{matrix} G_{12} = X {(X' W^{-} X)}^{-} X' W^{+} = P_{W} - VM {(MVM)}^{-} M P_{W} \\ = P_{W} - VM {(MVM)}^{+} = P_{W} - VM {(MVM)}^{+} M . \end{matrix}$ (1.2) is invariant for the choice of the generalized inverses denoted as “ $^{-}$ ”, and it is independent of any choice of $W \in W (M) .$ Notice also that the invariance properties in (d) and (e) in Lemma 1.2 are valid for all choices of $W \in W (M) .$ It is clear that $V \in W (M)$ if and only if $C (X) \subseteq C (V) .$

In Lemma 1.2, the matrix W is nonnegative definite, denoted as $W \geq_{L} 0 .$ A corresponding version of Lemma 1.2 can be presented for $W = V + XTX'$ which may not be symmetric but satisfies $C (X : V) = C (W) .$

Corresponding to Equation(1.1)(1.1) $W (M) = {W \in R^{n \times n} : W = V + XUU' X', C (W) = C (X : V)} .$ (1.1) , we will say that $W_{i} \in W (M_{i})$ if there exist $U_{i}$ such that (1.3) $W_{i} = V + X_{i} U_{i} U_{i}^{'} X'_{i}, C (W_{i}) = C (X_{i} : V), i = 1, 2 .$ (1.3) For the partitioned linear model $M_{12}$ we will say that $W \in W (M_{12})$ if $W = V + X_{1} U_{1} U'_{1} X'_{1} + X_{2} U_{2} U'_{2} X'_{2},$ where $U_{1}$ and $U_{2}$ are defined as in Equation(1.3)(1.3) $W_{i} = V + X_{i} U_{i} U_{i}^{'} X'_{i}, C (W_{i}) = C (X_{i} : V), i = 1, 2 .$ (1.3) . For our considerations the actual choice of $U_{1}$ and $U_{2}$ does not matter as long as they satisfy Equation(1.3)(1.3) $W_{i} = V + X_{i} U_{i} U_{i}^{'} X'_{i}, C (W_{i}) = C (X_{i} : V), i = 1, 2 .$ (1.3) .

By the consistency of the model $M$ it is meant that y lies in $C (X : V)$ with probability 1. Hence we assume that under the consistent model $M$ the observed numerical value of y satisfies $y \in C (X : V) = C (X : VM) = C (X) \oplus C (VM) = C (X) ⊞ C (MV),$ where “⊕” refers to the direct sum and “ $⊞$ ” refers to the direct sum of orthogonal subspaces. For the equality $C (X : V) = C (X : VM),$ see Rao (Citation1974, Lemma 2.1).

For parts (a) and (b) of Lemma 1.3, see, for example, Puntanen, Styan, and Isotalo (Citation2011, Th. 8). and for part (c), see the rank rule of the matrix product of Marsaglia and Styan (Citation1974, Cor. 6.2). Claim (d) is straightforward to confirm.

Lemma 1.3.

Consider $X = (X_{1} : X_{2})$ and let $M_{2} = I_{n} - P_{X_{2}}$ . Then

$C (X_{1} : X_{2}) = C (X_{1} : M_{1} X_{2}),$
$M = I_{n} - P_{(X_{1} : X_{2})} = I_{n} - (P_{X_{2}} + P_{M_{2} X_{1}}) = M_{2} Q_{M_{2} X_{1}} = Q_{M_{2} X_{1}} M_{2},$
$r (M_{2} X_{1}) = r (X_{1}) - \dim C (X_{1}) \cap C (X_{2}),$
$C (X_{2}) \subseteq C (X_{1} : V) \Leftrightarrow C (M_{1} X_{2}) \subseteq C (M_{1} V) .$

For Lemma 1.4, see, for example, Puntanen, Styan, and Isotalo (Citation2011, p. 152).

Lemma 1.4.

For conformable matrices A and B the following three statements are equivalent: $(a) P_{A} - P_{B} is an orth . projector, (b) P_{A} - P_{B} \geq_{L} 0, (c) C (B) \subseteq C (A) .$ If any of the above conditions holds then $P_{A} - P_{B} = P_{C (A) \cap C {(B)}^{⊥}} = P_{(I - P_{B}) A} .$

Let A and B be arbitrary m × n matrices. Then, in the consistent linear model $M,$ the estimators $Ay$ and $By$ are said to be equal (with probability 1) if (1.4) $Ay = By for all y \in C (X : V) = C (X : VM) = C (W),$ (1.4) where $W \in W (M) .$ Thus, if A and B satisfy Equation(1.4)(1.4) $Ay = By for all y \in C (X : V) = C (X : VM) = C (W),$ (1.4) , then $A - B = C Q_{W}$ for some matrix C. It is crucial to notice that in Equation(1.4)(1.4) $Ay = By for all y \in C (X : V) = C (X : VM) = C (W),$ (1.4) we are dealing with the “statistical” equality of the estimators $Ay$ and $By .$ In Equation(1.4)(1.4) $Ay = By for all y \in C (X : V) = C (X : VM) = C (W),$ (1.4) y refers to a vector in $R^{n} .$ Thus we do not make any notational difference between a random vector and its observed value.

According to the well-known fundamental BLUE-equation, see Lemma 2.1 in Section 2, $Ay$ is the $BLUE$ of $X β$ if and only if $A (X : VM) = (X : 0) .$ Obviously $(A + N Q_{W}) y$ is another representation of $BLUE$ for any n × n matrix N. However, the equality $Ay = (A + N Q_{W}) y for all y \in C (W)$ holds when the model is consistent in the sense that $y \in C (W) .$ The properties of the $BLUE$ deserve particular attention when $C (X : V) = R^{n}$ does not hold: then there is an infinite number of multipliers B such that $By$ is $BLUE$ but for all such multipliers the vector $By$ itself is unique once the response y has been observed. The case of two linear models, $B_{i} = {y, X β, V_{i}}, i = 1, 2,$ is extensively studied by Mitra and Moore (Citation1973). They ask, for example, when is a specific linear representation of the $BLUE$ of $μ = X β$ under $B_{1}$ also a $BLUE$ under $B_{2},$ and when is the $BLUE$ of $μ = X β$ under $B_{1}$ irrespective of the linear representation used in its expression, also a $BLUE$ under $B_{2} .$

The purpose of this paper is to consider the models $M_{1}$ and $M_{12}$ in the spirit of Mitra and Moore (Citation1973). We pick up particular fixed representations for the $BLUE$ s of $μ_{1} = X_{1} β_{1}$ under these two models, say $G_{1} y$ and $G_{1 #} y,$ and study the conditions under which they are equal for all values of $y \in C (X_{1} : X_{2} : V)$ or $y \in C (X_{1} : V),$ that is, (1.5) $G_{1} W_{1} = G_{1 #} W_{1}, or G_{1} W = G_{1 #} W .$ (1.5) Moreover, we review the conditions under which Equation(1.5)(1.5) $G_{1} W_{1} = G_{1 #} W_{1}, or G_{1} W = G_{1 #} W .$ (1.5) holds for all representations of the $BLUE$ s, not only for fixed $G_{1}$ and $G_{1 #} .$ Some related considerations were made by Haslett, Markiewicz, and Puntanen (Citation2020) when these models are supplemented with the new unobservable random vector $y_{*},$ coming from $y_{*} = K β_{1} + ε_{*},$ where the covariance matrix of $y_{*}$ is known as well as the cross-covariance matrix between $y_{*}$ and y.

The well-known (or pretty well-known) results are given as Lemmas, while the new (or at least not so well-known) results are represented as Propositions.

2. The fundamental BLUE equations

A linear statistic $By$ is said to be linear unbiased estimator, LUE, for the parametric function $K β$ in $M_{12}$ if its expectation is equal to $K β,$ which happens if and only if $K' = X' B';$ in this case $K β$ is said to be estimable. The LUE $By$ is the best linear unbiased estimator, $BLUE,$ of estimable $K β$ if $By$ has the smallest covariance matrix in the Löwner sense among all LUEs of $K β :$ $cov (By) \leq_{L} cov (B_{#} y) for all B_{#} : B_{#} X = K .$ It is well known that $μ_{1} = X_{1} β_{1}$ is estimable under $M_{12}$ if and only if $C (X_{1}) \cap C (X_{2}) = {0}, i . e ., r (M_{2} X_{1}) = r (X_{1}) .$ For Lemma 2.1, characterizing the $BLUE,$ see, for example, Rao (Citation1973, p. 282).

Lemma 2.1.

Consider the model $M_{12}$ where $η = K β$ is estimable. Then

$Ay = BLUE (X β) \Leftrightarrow A (X : VM) = (X : 0),$ that is, $A \in {P_{μ | M_{12}}},$
$By = BLUE (K β) \Leftrightarrow B (X : VM) = (K : 0),$ that is, $B \in {P_{η | M_{12}}} .$
In particular, if $μ_{1} = X_{1} β_{1}$ is estimable,
$Cy = BLUE (μ_{1}) \Leftrightarrow C (X_{1} : X_{2} : VM) = (X_{1} : 0 : 0),$ that is, $C \in {P_{μ_{1} | M_{12}}} .$

Of course, under the model $M_{1}$ we have $Dy = BLUE (μ_{1}) \Leftrightarrow D (X_{1} : V M_{1}) = (X_{1} : 0), i . e ., D \in {P_{μ_{1} | M_{1}}} .$ To indicate that $A \in {P_{μ | M_{12}}}$ we will also use notations $Ay = \tilde{μ} (M_{12}) = BLUE (X β | M_{12}), Ay \in {BLUE (X β | M_{12})} .$ Using Lemma 1.2 we can obtain, for example, the following well-known solution to A in Lemma 2.1: $X {(X' W^{-} X)}^{-} X' W^{-} \in {P_{μ | M_{12}}},$ where $W \in W (M_{12})$ and we can freely choose the generalized inverses involved. Expression $X {(X' W^{-} X')}^{-} X' W^{-}$ is not necessarily unique with respect to the choice of $W^{-}$ but by Lemma 1.2, the matrix $G_{12} = X {(X' W^{-} X')}^{-} X' W^{+} = P_{W} - VM {(MVM)}^{-} M P_{W}$ is unique whatever choices of $W^{-}$ and ${(X' W^{-} X')}^{-}$ we have and moreover, $G_{12}$ does not depend on the choice of $W \in W (M_{12}) .$ The general solution for A in Lemma 2.1, can be expressed, for example, as $G_{0} = G_{12} + N Q_{W}, where N \in R^{n \times n} is free to vary,$ and $Q_{W} = I_{n} - P_{W} .$ Thus the solution for A (as well as for B and C) in Lemma 2.1 is unique if and only if $C (X : V) = R^{n} .$

Consider then the estimation of $μ_{1} = X_{1} β_{1}$ under $M_{12}$ assuming that $μ_{1}$ is estimable. Premultiplying the model $M_{12}$ by $M_{2}$ yields the reduced model $M_{12 \cdot 2} = {M_{2} y, M_{2} X_{1} β_{1}, M_{2} V M_{2}} .$ Now the well-known Frisch–Waugh–Lovell theorem, see, for example, Groß and Puntanen (Citation2000, Sec. 6), states that the $BLUE$ s of $μ_{1}$ under $M_{12}$ and $M_{12 \cdot 2}$ coincide. To obtain an explicit expression for the $BLUE$ of $M_{2} X_{1} β_{1}$ under $M_{12 \cdot 2}$ we need a W-matrix in $M_{12 \cdot 2} .$ Now any matrix of the form $M_{2} V M_{2} + M_{2} X_{1} T_{1} T_{1}^{'} X_{1}^{'} M_{2} = M_{2} (V + X_{1} T_{1} T_{1}^{'} X_{1}^{'}) M_{2}$ satisfying (2.1) $C [M_{2} (V : X_{1} T_{1})] = C [M_{2} (V : X_{1})] = C (M_{2} W_{1}),$ (2.1) is a W-matrix in $M_{12 \cdot 2} .$ Choosing $T_{1} = U_{1}$ as in Equation(1.3)(1.3) $W_{i} = V + X_{i} U_{i} U_{i}^{'} X'_{i}, C (W_{i}) = C (X_{i} : V), i = 1, 2 .$ (1.3) we have $M_{2} W M_{2} = M_{2} W_{1} M_{2} \in W (M_{12 \cdot 2}) .$ Thus the $BLUE$ of $M_{2} X_{1} β_{1}$ under $M_{12 \cdot 2}$ can be expressed as $BLUE (M_{2} X_{1} β_{1} | M_{12 \cdot 2}) = M_{2} X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} y,$ where ${\dot{M}}_{2} = M_{2} {(M_{2} W_{1} M_{2})}^{-} M_{2} .$

We observe that Equation(2.1)(2.1) $C [M_{2} (V : X_{1} T_{1})] = C [M_{2} (V : X_{1})] = C (M_{2} W_{1}),$ (2.1) holds for $T_{1} = 0$ if and only if $C (M_{2} X_{1}) \subseteq C (M_{2} V),$ that is, see part (d) of Lemma 1.3, (2.2) $C (X_{1}) \subseteq C (X_{2} : V) .$ (2.2) Our conclusion: If Equation(2.2)(2.2) $C (X_{1}) \subseteq C (X_{2} : V) .$ (2.2) holds, then the $BLUE$ of $M_{2} X_{1} β_{1}$ under $M_{12 \cdot 2}$ can be expressed as (2.3) $BLUE (M_{2} X_{1} β_{1} | M_{12 \cdot 2}) = M_{2} X_{1} {(X_{1}^{'} {\dot{M}}_{2 V} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2 V} y,$ (2.3) where ${\dot{M}}_{2 V} = M_{2} {(M_{2} V M_{2})}^{-} M_{2} .$ Actually, it can be shown that Equation(2.2)(2.2) $C (X_{1}) \subseteq C (X_{2} : V) .$ (2.2) is also a necessary condition for Equation(2.3)(2.3) $BLUE (M_{2} X_{1} β_{1} | M_{12 \cdot 2}) = M_{2} X_{1} {(X_{1}^{'} {\dot{M}}_{2 V} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2 V} y,$ (2.3) . It is obvious that under the estimability of $μ_{1}$ we have (2.4a) $BLUE (μ_{1} | M_{12 \cdot 2}) = BLUE (μ_{1} | M_{12}) = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} y,$ (2.4a) (2.4b) $BLUE (μ_{2} | M_{12 \cdot 1}) = BLUE (μ_{2} | M_{12}) = X_{2} {(X_{2}^{'} {\dot{M}}_{1} X_{2})}^{-} X_{2}^{'} {\dot{M}}_{1} y,$ (2.4b) where ${\dot{M}}_{i} = M_{i} {(M_{i} W M_{i})}^{-} M_{i}, i = 1, 2.$

An alternative expression for the $BLUE$ of $μ_{1}$ can be obtained by premultiplying the fundamental BLUE-equation $X {(X' W^{-} X)}^{-} X' W^{-} (X_{1} : X_{2} : VM) = (X_{1} : X_{2} : 0)$ by $M_{2},$ yielding (2.5) $(M_{2} X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{-} (X_{1} : X_{2} : VM) = (M_{2} X_{1} : 0 : 0) .$ (2.5) Because $r (M_{2} X_{1}) = r (X_{1}),$ we can, by the rank cancelation rule of Marsaglia and Styan (Citation1974), cancel $M_{2}$ in Equation(2.5)(2.5) $(M_{2} X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{-} (X_{1} : X_{2} : VM) = (M_{2} X_{1} : 0 : 0) .$ (2.5) and thus an alternative expression for Equation(2.4a)(2.4a) $BLUE (μ_{1} | M_{12 \cdot 2}) = BLUE (μ_{1} | M_{12}) = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} y,$ (2.4a) is ${\tilde{μ}}_{1} (M_{12}) = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{-} y .$ Now we should pay attention to numerous generalized inverses appearing in the representations of the $BLUE$ s. Namely, when the observable response y belongs to a “correct” subspace of $R^{n},$ then there is no problem with the generalized inverses. In the next section we will consider particular unique representations of the multipliers of y and study the equality of the relevant estimators taking the space where y belongs into account.

3. Some useful matrix results

Let us denote $\begin{matrix} G_{1 #} = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2}, D_{1 #} = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{+}, \\ G_{2 #} = X_{2} {(X_{2}^{'} {\dot{M}}_{1} X_{2})}^{-} X_{2}^{'} {\dot{M}}_{1}, D_{2 #} = (0 : X_{2}) {(X' W^{-} X)}^{-} X' W^{+}, \end{matrix}$ where ${\dot{M}}_{1}$ and ${\dot{M}}_{2}$ are now unique (once W is given) matrices defined as $\begin{matrix} {\dot{M}}_{1} = M_{1} {(M_{1} W M_{1})}^{+} M_{1} = M_{1} {(M_{1} W_{2} M_{1})}^{+} M_{1}, \\ {\dot{M}}_{2} = M_{2} {(M_{2} W M_{2})}^{+} M_{2} = M_{2} {(M_{2} W_{1} M_{2})}^{+} M_{2} . \end{matrix}$ It is noteworthy that the following types of equalities hold: $M_{1} {(M_{1} W M_{1})}^{+} M_{1} = M_{1} {(M_{1} W M_{1})}^{+} = {(M_{1} W M_{1})}^{+} .$ Now under the estimability of $μ_{1} = X_{1} β_{1}$ we have $\begin{matrix} {\tilde{μ}}_{1} (M_{12}) = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} y = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{+} y, \\ {\tilde{μ}}_{2} (M_{12}) = X_{2} {(X_{2}^{'} {\dot{M}}_{1} X_{2})}^{-} X_{2}^{'} {\dot{M}}_{1} y = (0 : X_{2}) {(X' W^{-} X)}^{-} X' W^{+} y, \end{matrix}$ and $\tilde{μ} (M_{12}) = (G_{1 #} + G_{2 #}) y = (D_{1 #} + D_{2 #}) y for all y \in C (W) .$ Because $G_{1 #}$ and $D_{1 #}$ belong to ${P_{μ_{1} | M_{12}}},$ they satisfy the equation (3.3) $G_{1 #} W = D_{1 #} W .$ (3.3) Next we show that we also have (3.4) $G_{1 #} Q_{W} = D_{1 #} Q_{W} .$ (3.4) We immediately observe that $D_{1 #} Q_{W} = 0$ and what remains is to show that $G_{1 #} Q_{W} = 0 .$ Now the equation $G_{1 #} Q_{W} = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} Q_{W} = 0$ holds if and only if (3.5) $X_{1}^{'} {\dot{M}}_{2} Q_{W} = 0, i . e ., C ({\dot{M}}_{2} X_{1}) \subseteq C (W) .$ (3.5) Clearly Equation(3.5)(3.5) $X_{1}^{'} {\dot{M}}_{2} Q_{W} = 0, i . e ., C ({\dot{M}}_{2} X_{1}) \subseteq C (W) .$ (3.5) holds because $C ({\dot{M}}_{2} X_{1}) \subseteq C ({\dot{M}}_{2}) = C [{(M_{2} W_{1} M_{2})}^{+}] = C (M_{2} W_{1}) \subseteq C (W),$ where the last inclusion follows from $C (X_{1} : X_{2} : V) = C [X_{2} : M_{2} (X_{1} : V)] = C (X_{2} : M_{2} W_{1}) .$ Combining Equation(3.3)(3.3) $G_{1 #} W = D_{1 #} W .$ (3.3) and Equation(3.4)(3.4) $G_{1 #} Q_{W} = D_{1 #} Q_{W} .$ (3.4) gives the following result.

Proposition 3.1.

Assume that $μ_{1}$ is estimable under $M_{12}$ . Then(3.6) $G_{1 #} = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{+} = D_{1 #},$ (3.6) where ${\dot{M}}_{2} = M_{2} {(M_{2} W_{1} M_{2})}^{+} M_{2} .$ Moreover, the expressions in Equation(3.6)(3.6) $G_{1 #} = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{+} = D_{1 #},$ (3.6) are invariant for any choices of generalized inverses ${(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-},$ $W^{-},$ and ${(X' W^{-} X)}^{-}$ as well as for the choice of $W \in W (M_{12}) .$ Corresponding equality holds between $G_{2 #}$ and $D_{2 #} .$ Moreover, $G_{12} = X {(X' W^{-} X')}^{-} X' W^{+} = G_{1 #} + G_{2 #} = D_{1 #} + D_{2 #} .$

We will also need the following proposition.

Proposition 3.2.

Denote $G_{1 #} = X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2},$ where ${\dot{M}}_{2} = M_{2} {(M_{2} W_{1} M_{2})}^{+} M_{2} .$ Then

$C (X_{1}^{'} {\dot{M}}_{2} W) = C (X_{1}^{'} {\dot{M}}_{2} X_{1}) = C (X_{1}^{'} M_{2}),$
$r (W_{1} {\dot{M}}_{2} X_{1}) = r (W {\dot{M}}_{2} X_{1}) = r (X_{1}^{'} {\dot{M}}_{2} X_{1}) = r (M_{2} X_{1}),$
$C (W G_{1 #}^{'}) = C [W {\dot{M}}_{2} X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'}] = C (W {\dot{M}}_{2} X_{1}),$
$C (G_{1 #} W) = C (X_{1}' M_{2}) .$
In particular, when $μ_{1}$ is estimable under $M_{12},$ we have
$C (X_{1}^{'} {\dot{M}}_{2} W) = C (X_{1}^{'} {\dot{M}}_{2} X_{1}) = C (G_{1 #} W) = C (X_{1}^{'}) .$

Proof.

Property (b) comes from the following: (3.7) $\begin{matrix} r (M_{2} X_{1}) \geq r (W_{1} {\dot{M}}_{2} X_{1}) = r [W_{1} M_{2} {(M_{2} W_{1} M_{2})}^{-} M_{2} X_{1}] \\ \geq r [M_{2} W_{1} M_{2} {(M_{2} W_{1} M_{2})}^{-} M_{2} X_{1}] \\ = r (M_{2} X_{1}) . \end{matrix}$ (3.7) The last equality in Equation(3.7)(3.7) $\begin{matrix} r (M_{2} X_{1}) \geq r (W_{1} {\dot{M}}_{2} X_{1}) = r [W_{1} M_{2} {(M_{2} W_{1} M_{2})}^{-} M_{2} X_{1}] \\ \geq r [M_{2} W_{1} M_{2} {(M_{2} W_{1} M_{2})}^{-} M_{2} X_{1}] \\ = r (M_{2} X_{1}) . \end{matrix}$ (3.7) follows from the fact that $C (M_{2} X_{1}) \subseteq C (M_{2} W_{1}) .$ The other statements can be confirmed in the corresponding way. □

Proposition 3.3 appears to be useful for our BLUE-considerations and it also provides some interesting linear algebraic matrix results. By $A^{1 / 2}$ we refer to the nonnegative definite square root of a nonnegative definite matrix A and $A^{+ 1 / 2} = {(A^{1 / 2})}^{+}$ so that $A^{1 / 2} A^{+ 1 / 2} = P_{A} .$

Proposition 3.3.

The following five statements hold:

$C {(W^{+} X)}^{⊥} = C (WM : Q_{W}) = C (VM : Q_{W}),$
$C {(W_{1}^{+} X_{1})}^{⊥} = C (W_{1} M_{1} : Q_{W_{1}}) = C (V M_{1} : Q_{W_{1}}),$
$P_{W^{1 / 2} M_{2}} = P_{W} - P_{W^{+ 1 / 2} X_{2}},$
$P_{W} {\dot{M}}_{2} P_{W} = W^{+} - W^{+} X_{2} {(X_{2}^{'} W^{+} X_{2})}^{-} X_{2}^{'} W^{+},$
$W {\dot{M}}_{2} X_{1} = W_{1} {\dot{M}}_{2} X_{1} = [I_{n} - X_{2} {(X_{2}^{'} W^{+} X_{2})}^{-} X_{2}^{'} W^{+}] X_{1} .$
The following three statements are equivalent:
$r (X_{2}) = \dim C (W_{1}) \cap C (X_{2}) + \dim C {(W_{1})}^{⊥} \cap C (X_{2}),$
$r (W_{1}) = r (W_{1} M_{2}) + r (W_{1} X_{2}),$
$P_{W_{1}^{1 / 2} M_{2}} = P_{W_{1}} - P_{W_{1}^{+ 1 / 2} X_{2}} .$
If any of the conditions (f)–(h) holds, then
$P_{W_{1}} {\dot{M}}_{2} P_{W_{1}} = W_{1}^{+} - W_{1}^{+} X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+},$
$W {\dot{M}}_{2} X_{1} = W_{1} {\dot{M}}_{2} X_{1} = [P_{W_{1}} - P_{W_{1}} X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+}] X_{1} .$
If $C (X_{2}) \subseteq C (W_{1}),$ then
$W {\dot{M}}_{2} X_{1} = W_{1} {\dot{M}}_{2} X_{1} = [I_{n} - X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+}] X_{1} .$

Proof.

The first five statements (a)–(e) appear in Markiewicz and Puntanen (Citation2019, Sec. 4). The claim (h), that is, $P_{W_{1}^{1 / 2} M_{2}} = P_{W_{1}} - P_{W_{1}^{+ 1 / 2} X_{2}},$ holds if and only if, see Lemma 1.4, (3.8) $C (W_{1}^{1 / 2} M_{2}) = C {(W_{1}^{+ 1 / 2} X_{2} : Q_{W_{1}})}^{⊥} = C {(W_{1}^{+ 1 / 2} X_{2})}^{⊥} \cap C (W_{1}) .$ (3.8) Now Equation(3.8)(3.8) $C (W_{1}^{1 / 2} M_{2}) = C {(W_{1}^{+ 1 / 2} X_{2} : Q_{W_{1}})}^{⊥} = C {(W_{1}^{+ 1 / 2} X_{2})}^{⊥} \cap C (W_{1}) .$ (3.8) holds if and only if $r (W_{1}^{1 / 2} M_{2}) = n - r (W_{1}^{+ 1 / 2} X_{2} : Q_{W_{1}}),$ that is, $r (W_{1}) = r (W_{1} M_{2}) + r (W_{1} X_{2}),$ which further is equivalent to (f). Clearly (f) holds, for example, when $C (X_{2}) \subseteq C (W_{1}) .$

Assuming that (f) holds we can write (3.9) $\begin{matrix} P_{W_{1}} {\dot{M}}_{2} P_{W_{1}} = P_{W_{1}} M_{2} {(M_{2} W_{1} M_{2})}^{+} M_{2} P_{W_{1}} \\ = W_{1}^{+ 1 / 2} P_{W_{1}^{1 / 2} M_{2}} W_{1}^{+ 1 / 2} \\ = W_{1}^{+ 1 / 2} (P_{W_{1}} - P_{W_{1}^{+ 1 / 2} X_{2}}) W_{1}^{+ 1 / 2} \\ = W_{1}^{+} - W_{1}^{+} X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+} . \end{matrix}$ (3.9) From Equation(3.9)(3.9) $\begin{matrix} P_{W_{1}} {\dot{M}}_{2} P_{W_{1}} = P_{W_{1}} M_{2} {(M_{2} W_{1} M_{2})}^{+} M_{2} P_{W_{1}} \\ = W_{1}^{+ 1 / 2} P_{W_{1}^{1 / 2} M_{2}} W_{1}^{+ 1 / 2} \\ = W_{1}^{+ 1 / 2} (P_{W_{1}} - P_{W_{1}^{+ 1 / 2} X_{2}}) W_{1}^{+ 1 / 2} \\ = W_{1}^{+} - W_{1}^{+} X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+} . \end{matrix}$ (3.9) it follows that $\begin{matrix} W_{1} {\dot{M}}_{2} X_{1} = W_{1} [W_{1}^{+} - W_{1}^{+} X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+}] X_{1} \\ = [P_{W_{1}} - P_{W_{1}} X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+}] X_{1}, \end{matrix}$ and hence, supposing that $C (X_{2}) \subseteq C (W_{1}),$ we obtain (k): $W_{1} {\dot{M}}_{2} X_{1} = [I_{n} - X_{2} {(X_{2}^{'} W_{1}^{+} X_{2})}^{-} X_{2}^{'} W_{1}^{+}] X_{1} .$ Thus the proof is completed. □

4. Difference of the BLUEs under the full and small model

Next we introduce a particular expression for the difference $(G_{1} - G_{1 #}) y$ which is valid for all $y \in C (W) .$

Proposition 4.1.

Consider the models $M_{12}$ and $M_{1}$ and suppose that $μ_{1} = X_{1} β_{1}$ is estimable under $M_{12}$ . Using the earlier notation, we have for all $y \in C (W)$ :(4.1) $\begin{matrix} (G_{1} - G_{1 #}) y = G_{1} G_{2 #} y \\ = X_{1} {(X_{1}^{'} W_{1}^{+} X_{1})}^{-} X_{1}^{'} W_{1}^{+} \cdot X_{2} {(X_{2}^{'} {\dot{M}}_{1} X_{2})}^{-} X_{2}^{'} {\dot{M}}_{1} y . \end{matrix}$ (4.1)

Proof.

It is clear that $G_{1} G_{1 #} = G_{1 #} .$ Premultiplying $G_{12} = P_{W} - VM {(MVM)}^{-} M P_{W}$ by $G_{1}$ we observe that $G_{1} G_{12} = G_{1}$ as $G_{1} V M = 0 .$ Thus we have (4.2) $G_{1} - G_{1 #} = G_{1} (G_{12} - G_{1 #}) = G_{1} G_{2 #} .$ (4.2) The claim Equation(4.1)(4.1) $\begin{matrix} (G_{1} - G_{1 #}) y = G_{1} G_{2 #} y \\ = X_{1} {(X_{1}^{'} W_{1}^{+} X_{1})}^{-} X_{1}^{'} W_{1}^{+} \cdot X_{2} {(X_{2}^{'} {\dot{M}}_{1} X_{2})}^{-} X_{2}^{'} {\dot{M}}_{1} y . \end{matrix}$ (4.1) follows from Equation(4.2)(4.2) $G_{1} - G_{1 #} = G_{1} (G_{12} - G_{1 #}) = G_{1} G_{2 #} .$ (4.2) . □

Proposition 4.1 was proved by Haslett and Puntanen (Citation2010, Lemma 3.1) in the situation when $C (X_{2}) \subseteq C (X_{1} : V) = C (W_{1}) .$ Using different formulation and proof, it appears also in Werner and Yapar (Citation1996, Th. 2.3). See also Sengupta and Jammalamadaka (Citation2003, Ch. 9) and Güler, Puntanen, and Özdemir (Citation2014). In the full rank model, that is, when X has full column rank and $V$ is positive definite, it appears, for example, in Haslett (Citation1996).

Remark 4.1.

We might be tempted to express the equality $G_{1} y = G_{1 #} y$ as (4.3) ${\tilde{μ}}_{1} (M_{1}) = {\tilde{μ}}_{1} (M_{12}), i . e ., BLUE (μ_{1} | M_{1}) = BLUE (μ_{1} | M_{12}) .$ (4.3) However, the notation used in Equation(4.3)(4.3) ${\tilde{μ}}_{1} (M_{1}) = {\tilde{μ}}_{1} (M_{12}), i . e ., BLUE (μ_{1} | M_{1}) = BLUE (μ_{1} | M_{12}) .$ (4.3) can be problematic when the possible values of the response vector y are taken into account. It is clear that $G_{1} y$ is the $BLUE$ of $μ_{1}$ under $M_{1}$ and we may write shortly $G_{1} y = {\tilde{μ}}_{1} (M_{1}) .$ Now, there might be another estimator $Ay$ for which we can also write $Ay = {\tilde{μ}}_{1} (M_{1})$ but, however, $Ay$ and $G_{1} y$ may have different numerical observed values. The numerical value of the $BLUE$ under $M_{1}$ is unique if and only if y lies in $C (W_{1}) .$ □

Notice that in above considerations all the matrices $G_{1},$ $G_{12}$ and so on. are fixed. Let us check whether Equation(4.1)(4.1) $\begin{matrix} (G_{1} - G_{1 #}) y = G_{1} G_{2 #} y \\ = X_{1} {(X_{1}^{'} W_{1}^{+} X_{1})}^{-} X_{1}^{'} W_{1}^{+} \cdot X_{2} {(X_{2}^{'} {\dot{M}}_{1} X_{2})}^{-} X_{2}^{'} {\dot{M}}_{1} y . \end{matrix}$ (4.1) holds for arbitrary $H_{1} \in {P_{μ_{1} | M_{1}}}, H_{12} \in {P_{μ | M_{12}}}$ and so on.

Corollary 4.1.

Let us denote $\begin{matrix} H_{1} = G_{1} + N_{1} Q_{W_{1}}, H_{12} = G_{12} + N_{2} Q_{W}, \\ H_{1 #} = G_{1 #} + N_{3} Q_{W}, H_{2 #} = G_{2 #} + N_{4} Q_{W}, \end{matrix}$ where the matrices $N_{1}, \dots, N_{4}$ are free to vary. Then

$(H_{1} - H_{1 #}) y = G_{1} G_{2 #} y + N_{1} Q_{W_{1}} y$ for all $y \in C (W),$
$(H_{1} - H_{1 #}) y = H_{1} H_{2 #} y$ for all $y \in C (W) .$
Moreover, the following two statements are equivalent:
$C (X_{2}) \subseteq C (W_{1}),$
$(H_{1} - H_{1 #}) y = G_{1} G_{2 #} y$ for all $y \in C (W) .$

Proof.

In view of $\begin{matrix} (H_{1} - H_{1 #}) W = (G_{1} + N_{1} Q_{W_{1}} - G_{1 #} - N_{3} Q_{W}) W \\ = (G_{1} + N_{1} Q_{W_{1}} - G_{1 #}) W \\ = (G_{1} - G_{1 #}) W + N_{1} Q_{W_{1}} W \\ = G_{1} G_{2 #} W + N_{1} Q_{W_{1}} W, \end{matrix}$ the statement (a) holds. We observe that $\begin{matrix} H_{1} H_{2 #} W = (G_{1} + N_{1} Q_{W_{1}}) G_{2 #} W \\ = G_{1} G_{2 #} W + N_{1} Q_{W_{1}} G_{2 #} W . \end{matrix}$ Thus the statement (b), that is, the equality $(H_{1} - H_{1 #}) W = H_{1} H_{2 #} W$ holds if and only if (4.5) $Q_{W_{1}} W = Q_{W_{1}} G_{2 #} W .$ (4.5) Replacing W with $(X_{1} : X_{2} : VM)$ in Equation(4.5)(4.5) $Q_{W_{1}} W = Q_{W_{1}} G_{2 #} W .$ (4.5) we observe that Equation(4.5)(4.5) $Q_{W_{1}} W = Q_{W_{1}} G_{2 #} W .$ (4.5) indeed holds. The equivalence of (c) and (d) is obvious. □

Proposition 4.2.

Consider the models $M_{12}$ and $M_{1}$ and suppose that $μ_{1} = X_{1} β_{1}$ is estimable under $M_{12}$ . Then the following statements are equivalent:

$G_{1} y = G_{1 #} y$ for all $y \in C (W),$ that is, $G_{1} W = G_{1 #} W,$
$G_{1} y = G_{1 #} y$ for all $y \in C (X_{1} : X_{2}),$
$G_{1} y = G_{1 #} y$ for all $y \in R^{n},$ that is, $G_{1} = G_{1 #},$
$G_{1} \in {P_{μ_{1} | M_{12}}},$ that is, $G_{1} y \in {BLUE (μ_{1} | M_{12})},$
$X_{1}^{'} W_{1}^{+} X_{2} = 0,$
$G_{1} X_{2} = 0,$
$C (X_{2}) \subseteq C {(W_{1}^{+} X_{1})}^{⊥} = C (W_{1} M_{1} : Q_{W_{1}}) = C (V M_{1} : Q_{W_{1}}) .$

Proof.

Consider the statement (a) which is obviously equivalent to (d): (4.6) $G_{1} (X_{1} : X_{2} : VM) = G_{1 #} (X_{1} : X_{2} : VM) .$ (4.6) Now $G_{1} V M = G_{1} V M_{1} Q_{M_{1} X_{2}} = 0$ and hence Equation(4.6)(4.6) $G_{1} (X_{1} : X_{2} : VM) = G_{1 #} (X_{1} : X_{2} : VM) .$ (4.6) holds if and only if (4.7) $(X_{1} : G_{1} X_{2} : 0) = (X_{1} : 0 : 0),$ (4.7) that is, $G_{1} X_{2} = X_{1} {(X_{1}^{'} W_{1}^{-} X_{1})}^{-} X_{1}^{'} W_{1}^{+} X_{2} = 0,$ which is equivalent to $X_{1}^{'} W_{1}^{+} X_{2} = 0 .$ The equivalence between (a) and (b) follows from the equivalence between Equation(4.6)(4.6) $G_{1} (X_{1} : X_{2} : VM) = G_{1 #} (X_{1} : X_{2} : VM) .$ (4.6) and Equation(4.7)(4.7) $(X_{1} : G_{1} X_{2} : 0) = (X_{1} : 0 : 0),$ (4.7) .

To prove that (a) and (c) are equivalent we need to show that $G_{1} Q_{W} = G_{1 #} Q_{W} .$ It is clear that $G_{1} Q_{W} = 0 .$ Similarly, $G_{1 #} Q_{W} = D_{1 #} Q_{W} = 0 .$ Thus (a) is equivalent to (c). The claim (g) follows from part (b) of Proposition 3.3. □

Remark 4.2.

Clearly (a) in Proposition 4.2 is equivalent to $(i) G_{1} (X_{1} : X_{2}) = G_{1 #} (X_{1} : X_{2}) = (X_{1} : 0) and (ii) G_{1} V = G_{1 #} V,$ that is, (i) $G_{1} X_{2} = 0$ and (ii) $G_{1} V = G_{1 #} V .$ Here is a question: where does the condition (ii) vanish in Proposition 4.2?

In view of Proposition 4.2, the condition (i) implies that $G_{1} = G_{1 #},$ and hence trivially (ii) holds, that is, $G_{1} V = G_{1 #} V .$ However, (ii) does not imply (i). Moreover, the condition (ii) implies that $cov (G_{1} y) = cov (G_{1 #} y)$ which by Proposition 4.3 (see below) is equivalent to $X_{1}^{'} W^{+} X_{2} = 0 .$ Thus we can conclude that $X_{1}^{'} W_{1}^{+} X_{2} = 0 \Rightarrow X_{1}^{'} W^{+} X_{2} = 0 .$ □

In Propositions 4.3–4.5 we assume that $μ_{1} = X_{1} β_{1}$ is estimable under $M_{12} .$

Proposition 4.3.

The following statements are equivalent:

$G_{1} y = G_{1 #} y$ for all $y \in C (W_{1}),$ that is, $G_{1} W_{1} = G_{1 #} W_{1},$
$G_{1 #} \in {P_{μ_{1} | M_{1}}},$ that is, $G_{1 #} y \in {BLUE (μ_{1} | M_{1})},$
${BLUE (μ_{1} | M_{12})} \subseteq {BLUE (μ_{1} | M_{1})},$ that is, ${P_{μ_{1} | M_{12}}} \subseteq {P_{μ_{1} | M_{1}}},$
$(H_{1} - H_{1 #}) W_{1} = 0$ for all $H_{1} \in {P_{μ_{1} | M_{1}}}, H_{1 #} \in {P_{μ_{1} | M_{12}}},$
$G_{1 #} V M_{1} = 0,$
$C (W {\dot{M}}_{2} X_{1}) = C (W_{1} {\dot{M}}_{2} X_{1}) = C (X_{1}),$
$W {\dot{M}}_{2} X_{1} = W_{1} {\dot{M}}_{2} X_{1} = X_{1},$
$X_{1}^{'} W^{+} X_{2} = 0,$
$G_{1} V = G_{1 #} V,$
$cov (G_{1 #} y - G_{1} y) = 0,$
$cov (G_{1} y) = cov (G_{1 #} y) .$
Moreover, we always have
$cov (G_{1 #} y - G_{1} y) = cov (G_{1 #} y) - cov (G_{1} y),$
$cov (G_{1} y) \leq_{L} cov (G_{1 #} y),$
$X_{1}^{'} W_{1}^{+} X_{2} = 0 \Rightarrow X_{1}^{'} W^{+} X_{2} = 0 .$

Proof.

It is clear that (b) is simply an alternative expression for (a) and similarly (d) for (c). The claim (a) holds if and only if $G_{1} (X_{1} : V M_{1}) = G_{1 #} (X_{1} : V M_{1}) = (X_{1} : 0),$ which gives (e): $G_{1 #} V M_{1} = 0,$ that is, (4.8) $X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} V M_{1} = 0 .$ (4.8) Premultiplying Equation(4.8)(4.8) $X_{1} {(X_{1}^{'} {\dot{M}}_{2} X_{1})}^{-} X_{1}^{'} {\dot{M}}_{2} V M_{1} = 0 .$ (4.8) by $X_{1}^{'} {\dot{M}}_{2}$ yields $X_{1}^{'} {\dot{M}}_{2} V M_{1} = X_{1}^{'} {\dot{M}}_{2} W M_{1} = X_{1}^{'} {\dot{M}}_{2} W_{1} M_{1} = 0,$ that is, $C (W {\dot{M}}_{2} X_{1}) \subseteq C (X_{1}) .$ In view of Proposition 3.2, we have $r (W {\dot{M}}_{2} X_{1}) = r (X_{1})$ and hence $C (W {\dot{M}}_{2} X_{1}) \subseteq C (X_{1})$ becomes (4.9) $C (W {\dot{M}}_{2} X_{1}) = C (W_{1} {\dot{M}}_{2} X_{1}) = C (X_{1}) .$ (4.9) Thus we have shown that (e) and (f) are equivalent. Equality Equation(4.9)(4.9) $C (W {\dot{M}}_{2} X_{1}) = C (W_{1} {\dot{M}}_{2} X_{1}) = C (X_{1}) .$ (4.9) implies (4.10) $X_{1}^{'} W^{+} X_{2} = 0,$ (4.10) that is, (f) implies (h). In view of part (e) of Proposition 3.3 we have (4.11) $W {\dot{M}}_{2} X_{1} = [I_{n} - X_{2} {(X_{2}^{'} W^{+} X_{2})}^{-} X_{2}^{'} W^{+}] X_{1} .$ (4.11) Substituting Equation(4.10)(4.10) $X_{1}^{'} W^{+} X_{2} = 0,$ (4.10) into Equation(4.11)(4.11) $W {\dot{M}}_{2} X_{1} = [I_{n} - X_{2} {(X_{2}^{'} W^{+} X_{2})}^{-} X_{2}^{'} W^{+}] X_{1} .$ (4.11) we observe that (h) implies (g), and so far we have confirmed the equivalence between (a) and any of (e)–(h).

The statement (c) holds if and only if $(G_{1 #} + N_{2} Q_{W}) (X_{1} : V M_{1}) = (X_{1} : 0) for all N_{2} \in R^{n \times n},$ that is, $(G_{1 #} + N_{2} Q_{W}) V M_{1} = 0 for all N_{2} \in R^{n \times n},$ which holds if and only if $G_{1 #} V M_{1} = 0 .$ Thus (c) and (e) are equivalent.

The claim (a) holds if and only if $G_{1} (X_{1} : V) = G_{1 #} (X_{1} : V),$ which is precisely (l): $G_{1} V = G_{1 #} V .$ It is clear that (i) is equivalent to (j). Consider then $cov (G_{1 #} - G_{1}) y = G_{1 #} V G_{1 #}^{'} + G_{1} V G_{1}^{'} - G_{1 #} V G_{1}^{'} - G_{1} V G_{1 #}^{'} .$ Notice that $G_{1} T_{1} = G_{1 #} T_{1} = T_{1},$ where $T_{1} = X_{1} U_{1} U_{1}^{'} X_{1}^{'}$ and hence $\begin{matrix} G_{1} V G_{1}^{'} = G_{1} (W_{1} - T_{1}) G_{1}^{'} = X_{1} {(X'_{1} W_{1}^{+} X_{1})}^{-} X_{1}^{'} G_{1}^{'} - T_{1} \\ = X_{1} {(X'_{1} W_{1}^{+} X_{1})}^{-} X_{1}^{'} - T_{1}, \end{matrix}$ and $\begin{matrix} G_{1} V G_{1 #}^{'} = G_{1} (W_{1} - T_{1}) G_{1 #}^{'} = G_{1} W_{1} G_{1 #}^{'} - T_{1} \\ = X_{1} {(X'_{1} W_{1}^{-} X_{1})}^{-} X_{1}^{'} G_{1 #}^{'} - T_{1} \\ = X_{1} {(X'_{1} W_{1}^{+} X_{1})}^{-} X_{1}^{'} - T_{1} = G_{1} V G_{1}^{'} . \end{matrix}$ Thus $cov (G_{1 #} y - G_{1} y) = cov (G_{1 #} y) - cov (G_{1} y),$ and so (l) and (m) hold. Statement (l) obviously confirms the equivalence between (j) and (k). Property (n) is obvious. See also Remark 4.1. □

Next we consider the condition under which an arbitrary matrix from the set ${P_{μ_{1} | M_{1}}}$ provides the $BLUE$ for $μ_{1}$ under $M_{12} .$

Proposition 4.4.

The following statements are equivalent:

${BLUE (μ_{1} | M_{1})} \subseteq {BLUE (μ_{1} | M_{12})},$ that is, ${P_{μ_{1} | M_{1}}} \subseteq {P_{μ_{1} | M_{12}}},$
$(H_{1} - H_{1 #}) W = 0$ for all $H_{1} \in {P_{μ_{1} | M_{1}}}, H_{1 #} \in {P_{μ_{1} | M_{12}}},$
$C (X_{2}) \subseteq C (X_{1} : V M_{1}),$ that is, $C (W_{1}) = C (W),$ and $X_{1}^{'} W_{1}^{+} X_{2} = 0,$
$C (X_{2}) \subseteq C (V M_{1}),$
$C (X_{2} : VM) \subseteq C (V M_{1}),$
${BLUE (μ_{1} | M_{1})} = {BLUE (μ_{1} | M_{12})},$ that is, ${P_{μ_{1} | M_{1}}} = {P_{μ_{1} | M_{12}}},$
$C (X_{2} : VM) = C (V M_{1}) .$

Proof.

Notice first that (b) is simply an alternative way to express (a). The statement (a) holds if and only if $(G_{1} + N_{1} Q_{W_{1}}) (X_{1} : X_{2} : VM) = (X_{1} : 0 : 0) for all N_{1} \in R^{n \times n},$ that is, $(G_{1} + N_{1} Q_{W_{1}}) X_{2} = 0 for all N_{1} \in R^{n \times n},$ which holds if and only if $Q_{W_{1}} X_{2} = 0$ and $G_{1} X_{2} = 0,$ which is precisely (c). Moreover, (c) implies that (4.12) $X_{2} = X_{1} A + V M_{1} B$ (4.12) for some A and B and (4.13) $X_{1}^{'} W_{1}^{+} (X_{1} A + V M_{1} B) = X_{1}^{'} W_{1}^{+} X_{1} A = 0 .$ (4.13) Now Equation(4.13)(4.13) $X_{1}^{'} W_{1}^{+} (X_{1} A + V M_{1} B) = X_{1}^{'} W_{1}^{+} X_{1} A = 0 .$ (4.13) implies that $W_{1}^{+} X_{1} A = 0,$ which further implies that $X_{1} A = 0,$ so that by Equation(4.12)(4.12) $X_{2} = X_{1} A + V M_{1} B$ (4.12) we get (d). The claim (d) obviously implies (c). The equivalence between (d) and (e) is obvious because $C (VM) \subseteq C (V M_{1}) .$

It is clear that (f) implies (b). Thus to confirm the equivalence of (b) and (f) we have to show that (4.14) $(b) \Rightarrow {BLUE (μ_{1} | M_{12})} \subseteq {BLUE (μ_{1} | M_{1})} .$ (4.14) This follows at once from Proposition 4.3 by noting that the right-hand side of Equation(4.14)(4.14) $(b) \Rightarrow {BLUE (μ_{1} | M_{12})} \subseteq {BLUE (μ_{1} | M_{1})} .$ (4.14) means that $(H_{1} - H_{1 #}) W_{1} = 0 .$ The equivalence between (f) and (g) follows by combining part (d) of Proposition 4.4 and (k) of Proposition 4.3. □

Our next task is to find necessary and sufficient conditions for $G_{1} y = G_{1 #} y for all y \in C (W)$ when the inclusion $C (X_{2}) \subseteq C (X_{1} : V)$ holds.

Proposition 4.5.

Consider the models $M_{12}$ and $M_{1}$ and suppose that(4.15) $C (X_{2}) \subseteq C (X_{1} : V) = C (W_{1}), i . e ., C (W_{1}) = C (W) .$ (4.15) Then the following statements are equivalent:

$G_{1} W_{1} = G_{1 #} W_{1},$
$H_{1} W_{1} = H_{1 #} W_{1}$ for all $H_{1}$ and $H_{1 #},$
${BLUE (μ_{1} | M_{12})} \subseteq {BLUE (μ_{1} | M_{1})},$ that is, ${P_{μ_{1} | M_{12}}} \subseteq {P_{μ_{1} | M_{1}}},$
${BLUE (μ_{1} | M_{1})} \subseteq {BLUE (μ_{1} | M_{12})},$ that is, ${P_{μ_{1} | M_{1}}} \subseteq {P_{μ_{1} | M_{12}}},$
$BLUE (μ_{1} | M_{1}) = BLUE (μ_{1} | M_{12})$ with probability 1,
$X_{1}^{'} W_{1}^{+} X_{2} = 0,$
$C (X_{2}) \subseteq C (V M_{1}),$
$X_{1} C^{12} X_{2}^{'} = 0,$ where $C^{12}$ is defined as(4.16) ${(X' W_{1}^{+} X)}^{+} = {(\begin{matrix} X_{1}^{'} W_{1}^{+} X_{1} & X_{1}^{'} W_{1}^{+} X_{2} \\ X_{2}^{'} W_{1}^{+} X_{1} & X_{2}^{'} W_{1}^{+} X_{2} \end{matrix})}^{+} = (\begin{matrix} C^{11} & C^{12} \\ C^{21} & C^{22} \end{matrix}) .$ (4.16)

Proof.

The equivalence between (a)–(g) is obvious. Consider then part (h). Now we have (4.17) $D_{1 #} = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{+} = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' W_{1}^{+} .$ (4.17) Hence (a) holds, under Equation(4.15)(4.15) $C (X_{2}) \subseteq C (X_{1} : V) = C (W_{1}), i . e ., C (W_{1}) = C (W) .$ (4.15) , if and only if $G_{1} W_{1} = D_{1 #} W_{1},$ that is, $G_{1} (X_{1} : V M_{1}) = D_{1 #} (X_{1} : V M_{1}) = (X_{1} : 0),$ that is, (4.18) $D_{1 #} V M_{1} = D_{1 #} W_{1} M_{1} = 0 .$ (4.18) Using Equation(4.17)(4.17) $D_{1 #} = (X_{1} : 0) {(X' W^{-} X)}^{-} X' W^{+} = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' W_{1}^{+} .$ (4.17) the equality Equation(4.18)(4.18) $D_{1 #} V M_{1} = D_{1 #} W_{1} M_{1} = 0 .$ (4.18) becomes (4.19) $\begin{matrix} D_{1 #} V M_{1} = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' W_{1}^{+} W_{1} M_{1} \\ = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' P_{W_{1}} M_{1} \\ = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' M_{1} \\ = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} (0 : M_{1} X_{2})' \\ = X_{1} C^{12} X_{2}^{'} M_{1} = 0, \end{matrix}$ (4.19) where $C^{12}$ is defined in Equation(4.16)(4.16) ${(X' W_{1}^{+} X)}^{+} = {(\begin{matrix} X_{1}^{'} W_{1}^{+} X_{1} & X_{1}^{'} W_{1}^{+} X_{2} \\ X_{2}^{'} W_{1}^{+} X_{1} & X_{2}^{'} W_{1}^{+} X_{2} \end{matrix})}^{+} = (\begin{matrix} C^{11} & C^{12} \\ C^{21} & C^{22} \end{matrix}) .$ (4.16) . In light of $r (X_{2}^{'} M_{1}) = r (X_{2}),$ we can cancel $M_{1}$ in the last expression in Equation(4.19)(4.19) $\begin{matrix} D_{1 #} V M_{1} = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' W_{1}^{+} W_{1} M_{1} \\ = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' P_{W_{1}} M_{1} \\ = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} X' M_{1} \\ = (X_{1} : 0) {(X' W_{1}^{-} X)}^{-} (0 : M_{1} X_{2})' \\ = X_{1} C^{12} X_{2}^{'} M_{1} = 0, \end{matrix}$ (4.19) . This proves the equivalence between (a) and (h). □

5. Conclusions

In this article we consider the partitioned linear model $M_{12} = {y, X_{1} β_{1} + X_{2} β_{2}, V}$ and the corresponding small model $M_{1} = {y, X_{1} β_{1}, V} .$ We focus on comparing the BLUEs of $μ_{1} = X_{1} β_{1}$ under $M_{12}$ and $M_{1} .$ The observed numerical value of the $BLUE$ is unique under the model $M_{1}$ if the $M_{1}$ is consistent in the sense that $y \in C (X_{1} : V)$ and the same uniqueness concerns the full model in the respective way. But now there may be some problems if we write (5.1) $BLUE (X_{1} β_{1} | M_{1}) = BLUE (X_{1} β_{1} | M_{12}) .$ (5.1) What is the meaning of the above equality? It is not fully clear because we know that under $M_{1}$ the values of y vary over $C (X_{1} : V)$ but under $M_{12}$ the values of y vary over $C (X_{1} : X_{2} : V)$ and these column spaces may be different. However, if $C (X_{1} : V) = C (X_{1} : X_{2} : V)$ there is no difficulties to interpret the equality Equation(5.1)(5.1) $BLUE (X_{1} β_{1} | M_{1}) = BLUE (X_{1} β_{1} | M_{12}) .$ (5.1) , which means that $Ay = By for all y \in C (X_{1} : V),$ where $Ay \in {BLUE (μ_{1} | M_{1})}$ and $By \in {BLUE (μ_{1} | M_{12})} .$

We consider the resulting problems by picking up particular fixed expressions for the $BLUE$ s of $μ_{1} = X_{1} β_{1}$ under these two models, and study the conditions under which they are equal for all values of $y \in C (X_{1} : X_{2} : V)$ or $y \in C (X_{1} : V) .$ Moreover, we review the conditions under which all representations of the $BLUE$ s in one model continue to be valid in the other model. Some related considerations, using different approach, have been made by Lu et al. (Citation2015), Tian (Citation2013), and Tian and Zhang (Citation2016).

Acknowledgements

Part of this research was done during meeting of an International Research Group on Multivariate and Mixed Linear Models in the Mathematical Research and Conference Center, Bȩdlewo, Poland, in November 2019 and February 2020. Thanks go to the anonymous referee for constructive remarks.

References

Groß, J., and S. Puntanen. 2000. Estimation under a general partitioned linear model. Linear Algebra and Its Applications 321, 131–44. doi:10.1016/S0024-3795(00)00028-8.
Web of Science ®Google Scholar
Güler, N., S. Puntanen, and H. Özdemir. 2014. On the BLUEs in two linear models via C.R. Rao’s Pandora’s Box. Communications in Statistics - Theory and Methods, 5, 43, 921–31. doi:10.1080/03610926.2013.826366.
Web of Science ®Google Scholar
Haslett, S. J. 1996. Updating linear models with dependent errors to include additional data and/or parameters. Linear Algebra and Its Applications 237(238):329–49.
Web of Science ®Google Scholar
Haslett, S. J., A. Markiewicz, and S. Puntanen. 2020. Properties of BLUEs and BLUPs in full vs. small linear models with new observations. In Recent developments in multivariate and random matrix analysis: Festschrift in honour of Dietrich von Rosen, eds. T. Holgersson and M. Singull, 123–46. Cham: Springer.
Google Scholar
Haslett, S. J., and S. Puntanen. 2010. Effect of adding regressors on the equality of the BLUEs under two linear models. Journal of Statistical Planning and Inference 140, 104–10. doi:10.1016/j.jspi.2009.06.010.
Web of Science ®Google Scholar
Lu, C., S. Gan, and Y. Tian. 2015. Some remarks on general linear model with new regressors. Statistics & Probability Letters, 97, 16–24. doi:10.1016/j.spl.2014.10.015.
Web of Science ®Google Scholar
Markiewicz, A, and S. Puntanen. 2019. Further properties of the linear sufficiency in the partitioned linear model. In Matrices, statistics and big data, eds. S. E. Ahmed, F. Carvalho and S. Puntanen, 1–22. Cham: Springer.
Google Scholar
Marsaglia, G, and G. P. H. Styan. 1974. Equalities and inequalities for ranks of matrices. Linear Multilinear Algebra 2:269–92.
Google Scholar
Mitra, S. K, and B. J. Moore. 1973. Gauss–Markov estimation with an incorrect dispersion matrix. Sankhyā Series A 35:139–52.
Google Scholar
Puntanen, S., G. P. H. Styan, and J. Isotalo. 2011. Matrix tricks for linear statistical models: our personal top twenty. Heidelberg: Springer.
Google Scholar
Rao, C. R. 1973. Representations of best linear estimators in the Gauss–Markoff model with a singular dispersion matrix. Journal of Multivariate Analysis 3, 276–92. doi:10.1016/0047-259X(73)90042-0.
Google Scholar
Rao, C. R. 1974. Projectors, generalized inverses and the BLUE’s. Journal of the Royal Statistical Society: Series B 36:442–8.
Google Scholar
Rao, C. R, and S. K. Mitra. 1971. Generalized inverse of matrices and its applications. New York: Wiley.
Google Scholar
Sengupta, D, and S. R. Jammalamadaka. 2003. Linear models: An integrated approach. River Edge: World Scientific.
Google Scholar
Tian, Y. 2013. On properties of BLUEs under general linear regression models. Journal of Statistical Planning and Inference 43, 771–82. doi:10.1016/j.jspi.2012.10.005.
Google Scholar
Tian, Y., and X. Zhang. 2016. On connections among OLSEs and BLUEs of whole and partial parameters under a general linear model. Statistics & Probability Letters 112, 105–12. doi:10.1016/j.spl.2016.01.019.
Web of Science ®Google Scholar
Werner, H.J., and C. Yapar. 1996. A BLUE decomposition in the general linear regression model. Linear Algebra and its Applications 237-238, 395–404. doi:10.1016/0024-3795(95)00542-0.
Web of Science ®Google Scholar

Properties of BLUEs in full versus small linear models

Abstract

1. Introduction

2. The fundamental BLUE equations

3. Some useful matrix results

4. Difference of the BLUEs under the full and small model

5. Conclusions

Acknowledgements

References

Information for

Open access

Opportunities

Help and information

Properties of BLUEs in full versus small linear models

Abstract

1. Introduction

2. The fundamental BLUE equations

3. Some useful matrix results

4. Difference of the BLUEs under the full and small model

5. Conclusions

Acknowledgements

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date