Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

This paper concerns with optimal designs for a wide class of nonlinear models with information driven by the linear predictor. The aim of this study is to generate an R-optimal design which minimizes the product of the main diagonal entries of the inverse of the Fisher information matrix at certain values of the parameters. An equivalence theorem for the locally R-optimal designs is provided in terms of the intensity function. Analytic solutions for the locally saturated R-optimal designs are derived for the models having linear predictors with and without intercept, respectively. The particle swarm optimization method has been employed to generate locally non-saturated R-optimal designs. Numerical examples are presented for illustration of the locally R-optimal designs for Poisson regression models and proportional hazards regression models.

Keywords:

1. Introduction

Generalized linear models (GLMs) have been used quite effectively in statistical modelling but the associated design issues are undoubtedly challenging, since the intensity function within their information matrices depends on the value of the linear predictor, which means that the optimal designs are related to the unknown parameters. Following Konstantinou et al. (Citation2014), we note that the information matrix of the proportional hazards regression models used in survival analysis also has the same feature when type I and random censoring are considered, and their intensity functions, which are similar to Poisson regression models and negative binomial regression models, are strictly monotonic rather than a symmetric structure that appears in the logistic and probit models. In this paper, we focus on such models with monotonic intensity functions.

In this direction, there is increasing interest in determining optimal designs under various criteria, especially for models with multiple covariates. Initial work was done by Konstantinou et al. (Citation2014) who provided analytical results of D- and c-optimal designs for this class of models with only one covariate. Subsequently, Schmidt and Schwabe (Citation2015) extended the results concerning D-optimality to one-dimensional discrete design space. For multiple regression, Schmidt and Schwabe (Citation2017) determined D-optimal designs by identifying a complete subclass, which contains the results of Russell et al. (Citation2009) for the Poisson regression model as a special case. Radloff and Schwabe (Citation2019) gave a construction method of D-optimal designs when the design region is a k-dimensional ball. Recently, Schmidt (Citation2019) systematically characterized c-, L- and $Φ_{k}$ -optimal designs for models with a single covariate and for multiple regression with an arbitrary number of covariates.

It should be noted that the linear predictor in the previously mentioned literature always includes an intercept term. The intercept in GLMs and proportional hazards regression models with censoring, respectively, characterizes the expected mean and expected survival time when all the explanatory variables are equal to zero. In this case, the linear predictor reflects the influence of all the unobserved fixed variables in these models. As pointed out by Idais and Schwabe (Citation2021), when the intercept is significantly zero, i.e., the average impact of all the unobserved fixed variables is significantly zero, one may claim that the model includes probably most variables which explain the outcome. For the gamma models without intercept, Idais and Schwabe (Citation2021) obtained some explicit solutions of D- and A-optimal designs in some multi-linear cases, including the two-factor model with interaction.

This paper aims to provide a characterization of R-optimality for multiple regression models with and without intercept. It is well known that the R-optimality criterion proposed by Dette (Citation1997) has a nice statistical interpretation, namely minimizing the volume of the Bonferroni rectangular confidence region of the regression parameters. Moreover, it satisfies an extremely useful invariance property which allows an easy calculation of optimal designs on many linearly transformed design spaces. This optimality has been frequently applied to the cases of multi-response experiments, multi-factor experiments and mixture experiments, see, e.g., X. Liu and Yue (Citation2020), P. Liu et al. (Citation2022) and Hao et al. (Citation2021) for some recent references.

In general, the dependence of designs on parameters whose values are unknown a priori occurs in nonlinear regression models. Hence, utilizing a pre-specified parameter value we can obtain the so-called locally optimal designs in accordance with Chernoff (Citation1953). In this paper, we concentrate on the construction of locally optimal designs and take R-optimal designs to replace locally R-optimal designs for simplicity. Some general notations and a brief introduction of the R-criterion are presented in Section 2. In Sections 3 and 4, we analytically and numerically determine R-optimal designs for models having an intensity function which only depends on the value of the linear predictor with and without intercept, respectively. A brief discussion is given in Section 5. All proofs are included in the Appendix.

2. Model specification and R-optimality criterion

Throughout the paper, we focus on a class of nonlinear multiple regression models with information driven by the linear predictor defined on a given design region $X$ , and consider the approximate designs ξ of the form $ξ = {\begin{array}{cccc} x_{1} & x_{2} & \dots & x_{m} \\ ω_{1} & ω_{2} & \dots & ω_{m} \end{array}}, x_{i} \in X, 0 ⩽ ω_{i} ⩽ 1, \sum_{i = 1}^{m} ω_{i} = 1$ or simply $ξ = {x_{i}; ω_{i}}_{i = 1}^{m}$ (see Silvey, Citation1980, p. 15). The Fisher information matrix of ξ with independent observations is assumed to be (1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) where $Q ≢ 0$ is the intensity/efficiency function (see Fedorov, Citation1972, p. 39) which only depends on the value of the linear predictor, $f$ is a $p \times 1$ vector of known regression functions, and $β \in R^{p}$ denotes the vector of p unknown parameters.

This kind of information matrix is common in the widely used generalized linear models, while it may also arise in other models such as the exponential regression models in proportional hazards parametrization with various censoring, including type I and random censoring (see Konstantinou et al., Citation2014) as well as other censoring distributions (see Schmidt & Schwabe, Citation2017). Following Konstantinou et al. (Citation2014), we further assume that the intensity function Q satisfies the following conditions.

(A1)	$Q (θ)$ is positive for all $θ \in R$ and twice continuously differentiable.
(A2)	The derivative $Q^{'} (θ)$ is positive for all $θ \in R$ .
(A3)	The second derivative $g^{''} (θ)$ of the function $g (θ) = 1 / Q (θ)$ is injective.
(A4)	The function $Q (θ) / Q^{'} (θ)$ is an increasing function.

It is clear that the intensity functions induced by Poisson regression models, negative binomial regression models and proportional hazards regression models with type I and random censoring abide by all the conditions above.

In what follows, we concentrate on the R-optimality of designs which minimizes the product of the diagonal entries of the inverse of the Fisher information matrix. A design $ξ^{*} \in Ξ$ is called R-optimal if it minimizes (2) $ψ (ξ, β) = \prod_{j = 1}^{p} {(M (ξ, β)^{- 1})}_{j j} = \prod_{j = 1}^{p} e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}$ (2) over Ξ, where Ξ is the set of all designs with a non-singular information matrix on $X$ , and $e_{j, p}$ denotes the jth unit vector in $R^{p}$ . An important tool in optimal design theory is equivalence theorems that not only provide a characterization of the optimal design but also are the basis of many algorithms for their numerical construction (see, e.g., Yang et al., Citation2013; Freise et al., Citation2021). The following result gives the equivalence theorem for R-optimality.

Theorem 2.1

A design $ξ^{*} \in Ξ$ is R-optimal if and only if (3) $ϕ (x, β) = Q (f (x)^{⊤} β) f (x)^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} f (x) ⩽ p$ (3) holds for all $x \in X$ . Moreover, it is equal at the support points of the design $ξ^{*}$ .

In Sections 3 and 4 the minimally supported designs (i.e., the so-called saturated designs) will appear as candidates for determining R-optimal designs. A design $ξ = {x_{i}; ω_{i}}_{i = 1}^{m}$ has minimal support if the number of support points is equal to the number of parameters, i.e., m = p. Let $D_{ω}$ = $d i a g (ω_{1}, \dots, ω_{m})$ , $\tilde{X}$ = $Q^{1 / 2} X$ with $X = (f (x_{1}), \dots, f (x_{m}))^{⊤}$ and $Q^{1 / 2}$ = $d i a g (\sqrt{Q (f (x_{1})^{⊤} β)},$ $\dots, \sqrt{Q (f (x_{m})^{⊤} β)})$ . Accordingly, the information matrix (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ) for the design ξ can be decomposed as $M (ξ, β)$ = ${\tilde{X}}^{⊤} D_{ω} \tilde{X}$ . Furthermore, if the design ξ is saturated and the regression vectors located at the rows of $X$ are linearly independent, the following result exhibits the optimal weights of a design with minimal support for the R-optimality criterion.

Lemma 2.1

Pukelsheim & Torsney, Citation1993

The R-optimal weights for a saturated design ξ are given by (4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) where $s_{11}, \dots, s_{p p}$ are the diagonal entries of the matrix (5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5)

Remark 2.1

Since the diagonal entries $s_{j j}$ of the matrix $S$ depend on the weights, a fixed point iteration procedure is available to determine a solution of optimal weights. In step r + 1, the weight $ω_{j}^{(r + 1)}$ is equivalent to $\sqrt{s_{j j} (ω_{1}^{(r)}, \dots, ω_{p}^{(r)}) / p}$ under a given initial weight vector $(ω_{1}^{(0)}, \dots, ω_{p}^{(0)})^{⊤}$ .

3. R-optimal designs for models with intercept

In this section, we will describe how to construct R-optimal designs for multiple regression models with information matrices of the form (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ) and satisfying the assumptions (A1)–(A4). More precisely, multiple regression with additive linear effects of the covariates in the linear predictor that contains an intercept term will be considered.

We consider the multi-linear case which has $f = (1, x^{⊤})^{⊤}$ with $x = (x_{1}, \dots, x_{p - 1})^{⊤} \in X \subset R^{p - 1}$ and denote the parameter vector $β = (β_{0}, β_{1}, \dots, β_{p - 1})^{⊤}$ for convenience. Suppose that the design region $X$ is a multi-dimensional polyhedron. By applying the complete class results described in Theorem 2 and Lemma 1 of Schmidt (Citation2019), R-optimal designs can be found from the complete subclass, which has at most two support points on each edge of $X$ when the hyperplanes $H_{η} = {x \in R^{p - 1} : f (x)^{⊤} β = η}$ are assumed to be bounded on $X$ for all $η \in R$ ,

The following theorem provides an approach to generate R-optimal designs on the rectangular design region for the model under consideration, the proof of which can be established by using the similar arguments as in Theorem 9 in Schmidt (Citation2019) via the fact $(\sum_{j = 1}^{p} \sqrt{s_{j j}})^{2} = \sqrt{p}$ . It should be pointed out that using the same reasoning as in the proof of Theorems 8 and 9 in Schmidt (Citation2019) the uniqueness of the solution of the common system of equations described in Theorem 3.1 follows. Here the case p = 2 with one covariate is also satisfied, so $p ⩾ 2$ .

Theorem 3.1

Let $X = [u_{1}, v_{1}] \times \dots \times [u_{p - 1}, v_{p - 1}]$ and let the assumptions (A1)–(A4) be satisfied for a model with information matrices of the form (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ). Let $β_{i} \neq 0$ for $i = 1, \dots, p - 1$ . Define $a_{i} = v_{i}$ if $β_{i} > 0$ , and $a_{i} = u_{i}$ if $β_{i} < 0$ . Let $s_{i j}$ $(i, j = 1, \dots, p)$ denote the elements of the matrix $S$ in (Equation5(5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5) ) evaluated at a design with support points $a - (x_{1} / β_{1}) e_{1, p - 1}, \dots, a - (x_{p - 1} / β_{p - 1}) e_{p - 1, p - 1}$ and $a = (a_{1}, \dots, a_{p - 1})^{⊤}$ . If a solution exists, let $x_{i}^{*} \in (0, \infty)$ and the weights $ω_{i}^{*}$ for $i = 1, \dots, p - 1$ be the unique solutions of the common system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation6(6) $x_{i} - 2 \frac{Q (f (a)^{⊤} β - x_{i})}{Q^{'} (f (a)^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (f (a)^{⊤} β - x_{i})}{Q (f (a)^{⊤} β)}} \frac{s_{1, i + 1}}{\sqrt{s_{11} s_{i + 1, i + 1}}}] = 0.$ (6) ) given by (6) $x_{i} - 2 \frac{Q (f (a)^{⊤} β - x_{i})}{Q^{'} (f (a)^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (f (a)^{⊤} β - x_{i})}{Q (f (a)^{⊤} β)}} \frac{s_{1, i + 1}}{\sqrt{s_{11} s_{i + 1, i + 1}}}] = 0.$ (6) If $x_{i}^{*} ⩽ | β_{i} | (v_{i} - u_{i})$ holds for $i = 1, \dots, p - 1$ , then the design $ξ^{*} = {\begin{matrix} a - (x_{1}^{*} / β_{1}) e_{1, p - 1} & \dots & a - (x_{p - 1}^{*} / β_{p - 1}) e_{p - 1, p - 1} & a \\ ω_{1}^{*} & \dots & ω_{p - 1}^{*} & ω_{p}^{*} \end{matrix}}$ is a unique R-optimal design.

Remark 3.1

Let $X = [u_{1}, v_{1}] \times \dots \times [u_{p - 1}, v_{p - 1}]$ , when some of the parameters $β_{1}, \dots, β_{p - 1}$ are equal to zero. It follows from Lemma 1 of Schmidt (Citation2019) that the two endpoints of the corresponding edges must be the support points of optimal design, and the number of support points is possibly over p. This means that we need to use some numerical algorithms to generate the R-optimal design, such as the commonly used particle swarm optimization (PSO) method, see Section 4.2 for details.

The following two examples illustrate the results in Theorem 3.1 and Remark 3.1, and exhibit the performance of different designs in terms of the Bonferroni confidence intervals and the relative efficiencies of designs. In Example 3.1 the first-order Poisson regression model with intercept will be considered, and the difference of the averaged Bonferroni confidence intervals derived by R- and D-optimal designs as well as the balanced design $ξ_{b}$ will be investigated by a numerical simulation. Example 3.2 considers the Poisson regression model with two covariants which was discussed in Schmidt (Citation2019), and the relative efficiency of R-optimal designs will be compared with other designs. In both of the examples, the intensity function Q is given by $Q (θ) = \exp (θ)$ for $θ \in R$ .

Example 3.1

Consider the first-order Poisson regression model with intercept and let $X = [0, 5]$ . If we set $β = (6, - 1)^{⊤}$ and $β = (1, 1)^{⊤}$ , the R- and D-optimal designs can be calculated numerically by Theorem 3.1 and Theorem 2 of Konstantinou et al. (Citation2014). For example, under the R-optimality criterion with $β = (6, - 1)^{⊤}$ we obtain $x_{1}^{*} = 2.1886$ and $ω_{2}^{*} = 0.5431$ by solving the Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation6(6) $x_{i} - 2 \frac{Q (f (a)^{⊤} β - x_{i})}{Q^{'} (f (a)^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (f (a)^{⊤} β - x_{i})}{Q (f (a)^{⊤} β)}} \frac{s_{1, i + 1}}{\sqrt{s_{11} s_{i + 1, i + 1}}}] = 0.$ (6) ). In Table , the results on optimal designs for both cases are reported. Furthermore, we carry out a simulation study in order to assess the difference of the Bonferroni confidence intervals derived by different designs. The averaged Bonferroni confidence intervals under different sample sizes are calculated by 10,000 simulation runs, which are shown in Table . We find from Table that for both cases the coverage probabilities perform in a similar manner and much close to 0.95 as the sample size increases, and the width of the averaged Bonferroni confidence intervals obtained from R-optimal designs is narrower comparing with D-optimal designs and the balanced design $ξ_{b}$ when there exists a serious loss of R-efficiency.

Table 1. The simulation results obtained from the locally R-, D-optimal designs and the balanced design on $X = [0, 5]$ for the first-order Poisson regression model with intercept.

Display Table

Example 3.2

Assume that the linear predictor in the considered Poisson regression model includes an intercept term. Let $X = [0, 5]^{2}$ and $β = (0, - 1, - 1)^{⊤}$ . A numerical calculation yields immediately $x_{1}^{*} = x_{2}^{*} = 2.1785$ , $ω_{1}^{*} = ω_{2}^{*} = 0.3060$ and $ω_{3}^{*} = 0.3880$ by the Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation6(6) $x_{i} - 2 \frac{Q (f (a)^{⊤} β - x_{i})}{Q^{'} (f (a)^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (f (a)^{⊤} β - x_{i})}{Q (f (a)^{⊤} β)}} \frac{s_{1, i + 1}}{\sqrt{s_{11} s_{i + 1, i + 1}}}] = 0.$ (6) ). The R-optimal design is given in Table . The PSO method was used to find the corresponding D-, R-, and A-optimal designs when $β = (0, - 1, 0)^{⊤}$ . The resulting designs are listed in Table . In this case, we find that the endpoints $(0, 0)$ and $(0, 5)$ must be the support points of optimal designs. Figure displays the plot of the function $ϕ (x, β)$ defined in (Equation3(3) $ϕ (x, β) = Q (f (x)^{⊤} β) f (x)^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} f (x) ⩽ p$ (3) ) for the cases $β = (0, - 1, - 1)^{⊤}$ and $β = (0, - 1, 0)^{⊤}$ , which indicates that the function ϕ attains its maximum 3 at each support point.

Figure 1. Plot of the function ϕ in (Equation3(3) $ϕ (x, β) = Q (f (x)^{⊤} β) f (x)^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} f (x) ⩽ p$ (3) ) for the R-optimal design on $X = [0, 5]^{2}$ for the Poisson regression model discussed in Example 3.2: (a) for $β = (0, - 1, - 1)^{⊤}$ and (b) for $β = (0, - 1, 0)^{⊤}$ .

Figure 1. Plot of the function ϕ in (Equation3(3) ϕ(x,β)=Q(f(x)⊤β)f(x)⊤M(ξ∗,β)−1(∑j=1pej,pej,p⊤ej,p⊤M(ξ∗,β)−1ej,p)M(ξ∗,β)−1f(x)⩽p(3) ) for the R-optimal design on X=[0,5]2 for the Poisson regression model discussed in Example 3.2: (a) for β=(0,−1,−1)⊤ and (b) for β=(0,−1,0)⊤.

Table 2. Comparison of R-optimal design with D- and A-optimal designs on $X = [0, 5]^{2}$ for the Poisson regression model discussed in Example 3.2.

Display Table

To compare the performance of different designs, such as the common D- and A-optimality, we may calculate the related efficiency that usually defines the value of the criterion function for the optimal design relative to the value of the criterion function of a design and can thus take values between 0 and 1. For instance, the R-efficiency is defined as ${E f f}_{R} (ξ) = ψ (ξ^{*}, β) / ψ (ξ, β)$ . The results are summarized in Table . For the case $β = (0, - 1, - 1)^{⊤}$ , all the designs have relatively high efficiencies regarding the other optimality criteria. This occurs primarily because the designs have a similar structure. For the case $β = (0, - 1, 0)^{⊤}$ , we observe that the R-optimal design still has relatively high D- and A-efficiencies, but the A-optimal design has a large loss of R-efficiency.

Theorem 3.2

Let $X = [0, \infty)^{p - 1}$ and let the assumptions (A1)–(A4) be satisfied for models with information matrices of the form (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ). If $β_{i} < 0$ for all $i = 1, \dots, p - 1$ , then the solutions $x_{i}^{*}$ and $ω_{i}^{*}$ from the system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation6(6) $x_{i} - 2 \frac{Q (f (a)^{⊤} β - x_{i})}{Q^{'} (f (a)^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (f (a)^{⊤} β - x_{i})}{Q (f (a)^{⊤} β)}} \frac{s_{1, i + 1}}{\sqrt{s_{11} s_{i + 1, i + 1}}}] = 0.$ (6) ) do not depend on the parameter vector $(β_{1}, \dots, β_{p - 1})^{⊤}$ , that is, the R-optimal weights are unchanged for different $(β_{1}, \dots, β_{p - 1})^{⊤}$ when $β_{0}$ is fixed.

Remark 3.2

The optimal weights of saturated designs for L- and $Φ_{k}$ -optimality criteria, except for D-optimality, depend on the parameters $(β_{1}, \dots, β_{p - 1})^{⊤}$ in the same settings of Theorem 3.2.

To further illustrate Theorem 3.2, we consider the proportional hazards regression models with two types of censoring. Under type I censoring with a fixed censoring time c the intensity function Q is given by $Q (θ) = 1 - \exp (- c \exp (θ))$ . For random censoring the intensity function $Q (θ)$ equals to $1 - [1 - \exp (- c \exp (θ))] / [c \exp (θ)]$ if the censoring times are assumed to follow a uniform distribution $U (0, c)$ (see Konstantinou et al., Citation2014). For both the above-mentioned intensity functions the variable θ belongs to $R$ .

Example 3.3

For the proportional hazards regression models with type I and random censoring Schmidt (Citation2019) discussed the behaviour of c- and $Φ_{k}$ -optimal designs for different parameter values. Their results are consistent with Remark 3.2. Analogous to Schmidt (Citation2019), we investigate the performance of R-optimal designs on the design region $X = [0, 3]^{p - 1}$ by comparing with a balanced design, say $ξ_{b}$ , which is supported at all vertices of $X$ with equal weights. First, the censoring time c as an unknown parameter should be determined previously, which is closely related to the amount of censoring q. Here q is also called the overall probability of censoring and given by $q = 1 - \sum ω_{i} P (Y_{j} < C)$ , in which $Y_{i}$ is the survival time and C is the censoring distribution (see Kalish & Harrington, Citation1988). For the two-covariate case, we choose c = 32 for type I censoring and c = 69 for random censoring such that q is, respectively, equal to 60% for $β = (3, - 2.5, - 2.5)^{⊤}$ , 71% for $β = (3, - 3, - 3)^{⊤}$ and $75 %$ for $β = (3, - 4, - 4)^{⊤}$ when the balanced design $ξ_{b}$ on $X = [0, 3]^{2}$ had been used. The R-optimal designs on $X = [0, 3]^{2}$ and R-efficiencies of $ξ_{b}$ are summarized in Table . We observe from Table that the R-optimal designs shift the support points on the edges towards the vertex $(0, 0)^{⊤}$ and the R-efficiencies of the balanced design $ξ_{b}$ decrease with increasing amounts of censoring. Note also that the R-efficiency of $ξ_{b}$ is quite low for these censoring scenarios. In addition, the R-optimal weights are the same for each model by Theorem 3.2.

Table 3. R-optimal designs on $X = [0, 3]^{2}$ for proportional hazards regression models for different $β$ and R-efficiencies of the balanced design $ξ_{b}$ .

Display Table

4. R-optimal designs for models without intercept

In the present section we turn to discuss the multi-linear case without intercept, where the vector of the regressor functions is specified as $f = (x_{1}, \dots, x_{p})^{⊤} \in X$ , $p ⩾ 2$ , and the parameter vector is denoted by $β = (β_{1}, \dots, β_{p})^{⊤}$ . The regression model without an intercept is common and it usually arises from the physical characteristics of the variables measured. The design issue for this model has attracted considerable attention in the literature (see, e.g., Idais & Schwabe, Citation2021; Li et al., Citation2005 and the references cited there).

Let $X = [u_{1}, v_{1}] \times \dots \times [u_{p}, v_{p}]$ . Then the support points of R-optimal designs are also at the edges of $X$ by Theorem 2 in Schmidt (Citation2019). Moreover, from the proof of Theorem 8 in Schmidt (Citation2019) the support points of an R-optimal design $ξ^{*}$ must be given by $a - (x_{1} / β_{1}) e_{1, p - 1}, \dots, a - (x_{p} / β_{p}) e_{p, p}$ and $a = (a_{1}, \dots, a_{p})^{⊤}$ with $a_{i} = v_{i}$ if $β_{i} > 0$ and $a_{i} = u_{i}$ if $β_{i} < 0$ for $i = 1, \dots, p$ . As a result, the optimality condition for approximate designs in Theorem 2.1 can be simplified to verify whether it is satisfied at the boundary of the design region, i.e., the design $ξ^{*}$ is R-optimal if and only if (7) $h_{i} (x_{i}, ξ^{*}) = ℓ_{i} (x_{i}, ξ^{*}) - \frac{p}{Q (a^{⊤} β + β_{i} (x_{i} - a_{i}))} ⩽ 0$ (7) holds for all $x_{i}$ , $i = 1, \dots, p$ , where the function $ℓ_{i} (x_{i}, ξ^{*})$ is given by $ℓ_{i} (x_{i}, ξ^{*}) = (a + (x_{i} - a_{i}) e_{i, p})^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} (a + (x_{i} - a_{i}) e_{i, p}) .$

4.1. Some theoretical results on saturated designs

In this subsection, we will provide two types of saturated R-optimal designs for the model considered by distinguishing whether the vertex $a$ is a support point. The first result reveals designs which include the support point $a$ , where the condition $a_{τ} \neq 0$ , $τ \in {1, \dots, p}$ , described in Theorem 4.1 is to guarantee the design $ξ_{τ}^{*}$ in (Equation9(9) $ξ_{τ}^{*} = {\begin{matrix} a - (x_{1}^{*} / β_{1}) e_{1, p} & \dots & a - (x_{τ}^{*} / β_{τ}) e_{τ, p} & \dots & a - (x_{p}^{*} / β_{p}) e_{p, p} \\ ω_{1}^{*} & \dots & ω_{τ}^{*} & \dots & ω_{p}^{*} \end{matrix}}$ (9) ) with a non-singular information matrix.

Theorem 4.1

Let the assumptions (A1)–(A4) be satisfied for models with information matrices of the form (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ) and without intercept. Let $s_{i j}$ $(i, j = 1, \dots, p)$ be the elements of the matrix $S$ in (Equation5(5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5) ) evaluated at a design of the form ${a - (x_{i} / β_{i}) e_{i, p}; ω_{i}}_{i = 1}^{p}$ . For a given index τ $(τ \in {1, \dots, p})$ with $a_{τ} \neq 0$ , if a solution exists, let $x_{i}^{*} \in (0, \infty)$ and the weights $ω_{i}^{*}$ be the unique solutions of the common system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation8(8) $x_{i} - 2 \frac{Q (a^{⊤} β - x_{i})}{Q^{'} (a^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (a^{⊤} β - x_{i})}{Q (a^{⊤} β)}} \frac{s_{τ i}}{\sqrt{s_{τ τ} s_{i i}}}] = 0$ (8) ) given by (8) $x_{i} - 2 \frac{Q (a^{⊤} β - x_{i})}{Q^{'} (a^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (a^{⊤} β - x_{i})}{Q (a^{⊤} β)}} \frac{s_{τ i}}{\sqrt{s_{τ τ} s_{i i}}}] = 0$ (8) for all i $(\neq τ)$ . If $0 < x_{i}^{*} ⩽ | β_{i} | (v_{i} - u_{i})$ holds, then the design (9) $ξ_{τ}^{*} = {\begin{matrix} a - (x_{1}^{*} / β_{1}) e_{1, p} & \dots & a - (x_{τ}^{*} / β_{τ}) e_{τ, p} & \dots & a - (x_{p}^{*} / β_{p}) e_{p, p} \\ ω_{1}^{*} & \dots & ω_{τ}^{*} & \dots & ω_{p}^{*} \end{matrix}}$ (9) with $x_{τ}^{*} = 0$ will be R-optimal on $X$ provided that the condition $h_{τ} (x_{τ}, ξ_{τ}^{*}) < 0$ $(o r ϕ (a + (x_{τ} - a_{τ}) e_{τ, p}, β) < p)$ for $x_{τ} \in [u_{τ}, v_{τ}] ∖ {a_{τ}}$ is satisfied, where $h_{τ} (\cdot, \cdot)$ is defined as in (Equation7(7) $h_{i} (x_{i}, ξ^{*}) = ℓ_{i} (x_{i}, ξ^{*}) - \frac{p}{Q (a^{⊤} β + β_{i} (x_{i} - a_{i}))} ⩽ 0$ (7) ) and $ϕ (\cdot, \cdot)$ as in (Equation3(3) $ϕ (x, β) = Q (f (x)^{⊤} β) f (x)^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} f (x) ⩽ p$ (3) ).

Example 4.1

Consider the Poisson regression model with two covariates and without intercept, where the design region is $X = [0, 5]^{2}$ . To find the R-optimal design we first fix $β = (- 0.5, 0.5)^{⊤}$ . According to Theorem 4.1 we can only specify $τ = 2$ , i.e., $x_{2}^{*} = 0$ . By solving the common system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation10(10) $x_{i} - 2 \frac{Q (a^{⊤} β - x_{i})}{Q^{'} (a^{⊤} β - x_{i})} [\frac{1}{1 - z^{⊤} 1_{p}} \sum_{j = 1}^{p} \frac{z_{j} \sqrt{Q (a^{⊤} β - x_{i})}}{\sqrt{Q (a^{⊤} β - x_{j})}} \frac{s_{j i}}{\sqrt{s_{i i} s_{j j}}} + 1] = 0$ (10) ) we obtain $x_{1}^{*} = 2.1886$ and the design $ξ_{2}^{*}$ is of the form $ξ_{2}^{*} = {\begin{matrix} (4.3772, 5) & (0, 5) \\ 0.4569 & 0.5431 \end{matrix}} .$ It is easily verified that the condition $h_{2} (x_{2}, ξ_{2}^{*}) < 0$ is satisfied for $x_{2} \in [0, 5)$ and the design $ξ_{2}^{*}$ is then R-optimal (see also Figure (a)). If we choose $β = (1, 1)^{⊤}$ , however, it follows from Theorem 4.1 that the designs $ξ_{1}^{*}$ and $ξ_{2}^{*}$ are given by $ξ_{1}^{*} = {\begin{matrix} (2.4678, 5) & (5, 5) \\ 0.8234 & 0.1766 \end{matrix}}, ξ_{2}^{*} = {\begin{matrix} (5, 2.4678) & (0, 5) \\ 0.8234 & 0.1766 \end{matrix}},$ respectively. In this case, the conditions $h_{i} (x_{i}, ξ_{i}^{*}) < 0$ , i = 1, 2, are not satisfied, which means that both designs are not R-optimal. Accordingly, finding a saturated R-optimal design for this model that does not contain the support point $a$ may be an alternative scheme. This type of R-optimal design will be elaborated in Theorem 4.2.

Figure 2. Plots of the functions ϕ in (Equation3(3) $ϕ (x, β) = Q (f (x)^{⊤} β) f (x)^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} f (x) ⩽ p$ (3) ) for the R-optimal designs on $X = [0, 5]^{2}$ for the Poisson regression models discussed in Examples 4.1 and 4.2: (a) for $β = (- 0.5, 0.5)^{⊤}$ and (b) for $β = (1, 1)^{⊤}$ .

Figure 2. Plots of the functions ϕ in (Equation3(3) ϕ(x,β)=Q(f(x)⊤β)f(x)⊤M(ξ∗,β)−1(∑j=1pej,pej,p⊤ej,p⊤M(ξ∗,β)−1ej,p)M(ξ∗,β)−1f(x)⩽p(3) ) for the R-optimal designs on X=[0,5]2 for the Poisson regression models discussed in Examples 4.1 and 4.2: (a) for β=(−0.5,0.5)⊤ and (b) for β=(1,1)⊤.

Theorem 4.2

Let the assumptions (A1)–(A4) be satisfied for models with information matrices of the form (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ) and without intercept. Let $s_{i j}$ $(i, j = 1, \dots, p)$ be the elements of the matrix $S$ in (Equation5(5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5) ) evaluated at a design of the form ${a - (x_{i} / β_{i}) e_{i, p}; ω_{i}}_{i = 1}^{p}$ . If a solution exists, let $x_{i}^{*} \in (0, \infty)$ and the weights $ω_{i}^{*}$ be the unique solutions of the common system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation10(10) $x_{i} - 2 \frac{Q (a^{⊤} β - x_{i})}{Q^{'} (a^{⊤} β - x_{i})} [\frac{1}{1 - z^{⊤} 1_{p}} \sum_{j = 1}^{p} \frac{z_{j} \sqrt{Q (a^{⊤} β - x_{i})}}{\sqrt{Q (a^{⊤} β - x_{j})}} \frac{s_{j i}}{\sqrt{s_{i i} s_{j j}}} + 1] = 0$ (10) ) given by (10) $x_{i} - 2 \frac{Q (a^{⊤} β - x_{i})}{Q^{'} (a^{⊤} β - x_{i})} [\frac{1}{1 - z^{⊤} 1_{p}} \sum_{j = 1}^{p} \frac{z_{j} \sqrt{Q (a^{⊤} β - x_{i})}}{\sqrt{Q (a^{⊤} β - x_{j})}} \frac{s_{j i}}{\sqrt{s_{i i} s_{j j}}} + 1] = 0$ (10) for all i, where $z = (z_{1}, \dots, z_{p})^{⊤}$ with $z_{j} = a_{j} β_{j} / x_{j}$ , $1_{p} = (1, \dots, 1)^{⊤} \in R^{p}$ . If $0 < x_{i}^{*} ⩽ | β_{i} | (v_{i} - u_{i})$ holds, then the design (11) $ξ^{*} = {\begin{matrix} a - (x_{1}^{*} / β_{1}) e_{1, p} & \dots & a - (x_{p}^{*} / β_{p}) e_{p, p} \\ ω_{1}^{*} & \dots & ω_{p}^{*} \end{matrix}}$ (11) will be R-optimal on $X$ provided that the condition $h_{i} (a_{i}, ξ^{*}) < 0$ $(o r ϕ (a, β) < p)$ is satisfied.

Example 4.2

Consider the same model given in Example 4.1. Here we consider $β = (1, 1)^{⊤}$ . The common system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation10(10) $x_{i} - 2 \frac{Q (a^{⊤} β - x_{i})}{Q^{'} (a^{⊤} β - x_{i})} [\frac{1}{1 - z^{⊤} 1_{p}} \sum_{j = 1}^{p} \frac{z_{j} \sqrt{Q (a^{⊤} β - x_{i})}}{\sqrt{Q (a^{⊤} β - x_{j})}} \frac{s_{j i}}{\sqrt{s_{i i} s_{j j}}} + 1] = 0$ (10) ) has the solutions $x_{1}^{*} = x_{2}^{*} = 1.8755$ and $ω_{1}^{*} = ω_{2}^{*} = 0.5$ . Then the design $ξ^{*} = {\begin{matrix} (3.1245, 5) & (5, 3.1245) \\ 0.5 & 0.5 \end{matrix}}$ is R-optimal due to $ϕ (a, β) < 2$ (see also Figure (b)).

Corollary 4.1

Let $X = [0, \infty)^{p}$ and let the assumptions (A1)–(A4) be satisfied for models with information matrices of the form (Equation1(1) $M (ξ, β) = \int_{X} Q (f (x)^{⊤} β) f (x) f (x)^{⊤} d ξ (x) = \sum_{i = 1}^{m} ω_{i} Q (f (x_{i})^{⊤} β) f (x_{i}) f (x_{i})^{⊤},$ (1) ) and without intercept. Then the following design $ξ^{*}$ is R-optimal for each $β_{i} < 0$ , $i = 1, \dots, p$ , (12) $ξ^{*} = {\begin{matrix} - (ψ^{- 1} (0) / β_{1}) e_{1, p} & \dots & - (ψ^{- 1} (0) / β_{p}) e_{p, p} \\ 1 / p & \dots & 1 / p \end{matrix}},$ (12) where the function $ψ (\cdot)$ is defined as $ψ (x) = x - 2 Q (- x) / Q^{'} (- x)$ for x>0.

Remark 4.1

It is worthwhile mentioning that the design (Equation12(12) $ξ^{*} = {\begin{matrix} - (ψ^{- 1} (0) / β_{1}) e_{1, p} & \dots & - (ψ^{- 1} (0) / β_{p}) e_{p, p} \\ 1 / p & \dots & 1 / p \end{matrix}},$ (12) ) shown in Corollary 4.1 is also D- and A-optimal.

Remark 4.2

For the case of p = 1, the one point design $ξ^{*} = {a; 1}$ that simultaneously satisfies the additional conditions in Theorem 4.1 is saturated R-optimal. However, if the design $ξ^{*} = {a; 1}$ is not R-optimal, the proposed method in Theorem 4.2 can be used to search for a saturated R-optimal design.

4.2. PSO-generated R-optimal designs

Only finding saturated designs may not be enough for determining the R-optimality of a design in the design class Ξ, since the optimal designs depend on the unknown parameters. For example, let $β = (0.5, 0.5)^{⊤}$ for the Poisson regression model discussed in Example 4.1. The R-optimal design (see Figure ) is then given by $ξ^{*} = {\begin{matrix} (1.4321, 5) & (5, 1.4321) & (5, 5) \\ 0.4666 & 0.4666 & 0.0668 \end{matrix}} .$ This means that the number of support points of R-optimal designs for models without intercept may exceed the number of regression parameters. Thereby, the aforementioned method by solving equations is unable to determine an R-optimal design in Ξ and we require an effective algorithm to generate an R-optimal design. Here we employ the PSO algorithm to find optimal designs for the models under consideration and a pseudo code of PSO is described as below.

Figure 3. Plot of the function ϕ in (Equation3(3) $ϕ (x, β) = Q (f (x)^{⊤} β) f (x)^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} f (x) ⩽ p$ (3) ) for the R-optimal design on $X = [0, 5]^{2}$ for $β = (0.5, 0.5)^{⊤}$ in the Poisson regression model without intercept.

Figure 3. Plot of the function ϕ in (Equation3(3) ϕ(x,β)=Q(f(x)⊤β)f(x)⊤M(ξ∗,β)−1(∑j=1pej,pej,p⊤ej,p⊤M(ξ∗,β)−1ej,p)M(ξ∗,β)−1f(x)⩽p(3) ) for the R-optimal design on X=[0,5]2 for β=(0.5,0.5)⊤ in the Poisson regression model without intercept.

In Section 4.2, the support points of each position $ξ_{i}$ must be given by $a - (x_{1} / β_{1}) e_{1, p - 1}, \dots, a - (x_{p} / β_{p}) e_{p, p}$ and $a$ .

The index t is equal to $0, 1, \dots$ , and two criteria can be used to end iteration, achieving a maximum number of iterations or verifying whether the equivalence condition attains a pre-specified threshold.

The notations $v_{i}^{(t)}$ and $ξ_{i}^{(t)}$ are, respectively, the current velocity and position for the ith particle. $θ_{t}$ is the inertia weight that modulates the influence of the former velocity, which can be a constant or a decreasing function with values between 0 and 1. $α_{1}$ and $α_{2}$ are both random variables from $U (0, 1)$ . $γ_{1}$ and $γ_{2}$ are two constants reflecting the cognitive learning level and social learning level, respectively.

Example 4.3

Consider the proportional hazards regression models with three covariates, in which the linear predictor does not include an intercept term. Let $X = [0, 3]^{3}$ and $β = (- 2.5, 0.5, 0.5)^{⊤}$ . In order to assess the effect of the amount of censoring q on R-optimal designs, we adjust the censoring time c to achieve overall censoring probabilities of 20%, 40%, 60% and 80% for the balanced design $ξ_{b}$ . For instance, we choose c = 60 for type I censoring and c = 133 for random censoring when q = 0.4. In this example, the PSO algorithm with 150 particles and 100 iterations is able to find the R-optimal design with the required accuracy, which can be implemented by $R$ software in less than 20 seconds on a standard PC. Table summarizes the numerical results, including R-efficiencies of the balanced design $ξ_{b}$ and the amount of censoring $q^{*}$ under the corresponding R-optimal design. Some of the points are clear from the numerical results.

For the three-covariate case, the support points of R-optimal designs for type I censoring and random censoring exceed the number of parameters.
With increasing amounts of censoring, the R-optimal designs shift the support point on the edge $(x_{1}, 3, 3)^{⊤}$ towards the vertex $a = (0, 3, 3)^{⊤}$ , and the R-efficiency of the balanced design $ξ_{b}$ is reduced gradually.
The overall probability of censoring $q^{*}$ under the R-optimal design is less than for the balanced design $ξ_{b}$ .

Table 4. The locally R-optimal designs on $X = [0, 3]^{3}$ for $β = (- 2.5, 0.5, 0.5)^{⊤}$ in the proportional hazards regression models, the R-efficiencies of the balanced design $ξ_{b}$ and the overall probability of censoring under the R-optimal design.

Display Table

5. Discussion

The present paper investigates the construction of locally R-optimal designs for a large class of nonlinear multiple regression models. For the case of models with intercept, the R-optimal designs on a rectangular design region can be determined by utilizing the similar arguments in Schmidt (Citation2019) but finding its optimal weight is different. We notice that the structure of the R-optimal designs is similar to those criteria reported in Schmidt (Citation2019), especially in terms of the location of support points. For the case of models without intercept, however, with the same method we can determine the saturated R-optimal designs only from two design subclasses addressed in Section 4.1. Moreover, the PSO algorithm has been used to generate non-saturated R-optimal designs for both cases. Some conditions in Theorems 3.1, 4.1 and 4.2 are required, which ensure that the support points are located within the design region. If these conditions are not satisfied, optimal designs may then be having more support points. In addition, a nonlinear system of equations must be solved numerically in order to search for the saturated R-optimal designs. Although the existence of the solution to these equations is not proved theoretically, the numerical exploration shows that the solution in all considered examples always exists.

It is worthwhile mentioning that the locally optimal designs discussed so far are derived for a given value of the model parameter vector $β$ . One might choose such a value of $β$ from an initial guess or estimation when some historical observations can be obtained. It might be of interest to study how the design will be affected by wrongly specified parameters. For illustration, we consider the locally R-optimal designs for the first-order Poisson regression models with an intercept on the design region $X = [0, 5]^{p - 1}$ for p = 2, 3, 4, respectively. In specific, we assume that the true parameter vector is $β = (0, - 1)^{⊤}$ for p = 2, $β = (0, - 1, - 1)^{⊤}$ for p = 3, and $β = (0, - 1, - 1, - 1)^{⊤}$ for p = 4. We generate the locally R-optimal designs for various misspecified values of $β$ , and calculate their R-efficiencies with respect to the locally R-optimal design for the true parameter vector for each p = 2, 3, 4, which are shown in Table . It is observed from Table that the efficiency of the locally R-optimal design for the misspecified value of $β$ decreases as the value of $β$ diverges from its true value, and the loss of R-efficiency is disastrous when the difference between the true value and the misspecified value of $β$ is relatively serious. Numerical results with other examples yield similar conclusions, which are not reported here for the sake of saving space.

Table 5. The R-efficiencies of the locally R-optimal designs on $X = [0, 5]^{p - 1}$ for various misspecified $β$ for the first-order Poisson regression models with intercept.

Display Table

To overcome the parameter dependence of the locally optimal design, a commonly used approach to the computation of locally optimal designs is weighted designs (see, e.g., Atkinson et al., Citation2007, Chap. 18), where a prior distribution for $β$ , which may be either discrete or continuous, is assumed in advance. Another method is the computation of maximin efficient designs, i.e., maximizing the minimal efficiency with respect to the parameters (see Dette, Citation1997; Konstantinou et al., Citation2014).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

Lei He's work is supported by the National Natural Science Foundation of China [Grant Number 12101013] and the Natural Science Foundation of Anhui Province [Grant Number 2008085QA15]. Rong-Xian Yue's work is supported by the National Natural Science Foundation of China [Grant Numbers 11971318, 11871143].

References

Atkinson, A. C., Donev, A. N., & Tobias, R. D. (2007). Optimum experimental designs, with SAS. Oxford University Press.
Google Scholar
Chernoff, H. (1953). Locally optimal designs for estimating parameters. The Annals of Mathematical Statistics, 24(4), 586–602. https://doi.org/10.1214/aoms/1177728915
Google Scholar
Dette, H. (1997). Designing experiments with respect to ‘standardized’ optimality criteria. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 59(1), 97–110. https://doi.org/10.1111/rssb.1997.59.issue-1
Google Scholar
Fedorov, V. V. (1972). Theory of optimal experiments. Academic Press.
Google Scholar
Freise, F., Gaffke, N., & Schwabe, R. (2021). The adaptive Wynn algorithm in generalized linear models with univariate response. The Annals of Statistics, 49(2), 702–722. https://doi.org/10.1214/20-AOS1974
Web of Science ®Google Scholar
Hao, H., Zhu, X., Zhang, X., & Zhang, C. (2021). R-optimal design of the second-order Scheffé mixture model. Statistics & Probability Letters, 173(C), 109069. https://doi.org/10.1016/j.spl.2021.109069
Google Scholar
Idais, O., & Schwabe, R. (2021). Analytic solutions for locally optimal designs for gamma models having linear predictors without intercept. Metrika, 84(1), 1–26. https://doi.org/10.1007/s00184-019-00760-3
Web of Science ®Google Scholar
Kalish, L. A., & Harrington, D. P. (1988). Efficiency of balanced treatment allocation for survival analysis. Biometrics, 44(3), 815–821. https://doi.org/10.2307/2531593
Web of Science ®Google Scholar
Konstantinou, M., Biedermann, S., & Kimber, A. (2014). Optimal designs for two-parameter nonlinear models with application to survival models. Statistica Sinica, 24(1), 415–428. https://doi.org/10.5705/ss.2011.271
Web of Science ®Google Scholar
Li, K. H., Lau, T. S., & Zhang, C. (2005). A note on D-optimal designs for models with and without an intercept. Statistical Papers, 46(3),451–458. https://doi.org/10.1007/BF02762844
Web of Science ®Google Scholar
Liu, P., Gao, L. L., & Zhou, J. (2022). R-optimal designs for multi-response regression models with multi-factors. Communications in Statistics – Theory and Methods, 51(2), 340–355. https://doi.org/10.1080/03610926.2020.1748655
Web of Science ®Google Scholar
Liu, X., & Yue, R.-X. (2020). Elfving's theorem for R-optimality of experimental designs. Metrika, 83(4), 485–498. https://doi.org/10.1007/s00184-019-00728-3
Web of Science ®Google Scholar
Pukelsheim, F., & Torsney, B. (1993). Optimal weights for experimental designs on linearly independent support points. The Annals of Statistics, 19(3), 1614–1625. https://doi.org/10.2307/2241966
Web of Science ®Google Scholar
Radloff, M., & Schwabe, R. (2019). Locally D-optimal designs for non-linear models on the k-dimensional ball. Journal of Statistical Planning and Inference, 203, 106–116. https://doi.org/10.1016/j.jspi.2019.03.004
Web of Science ®Google Scholar
Russell, K. G., Woods, D. C., Lewis, S. M., & Eccleston, E. C. (2009). D-optimal designs for Poisson regression models. Statistica Sinica, 19(2), 721–730. https://doi.org/10.2307/24308852
Web of Science ®Google Scholar
Schmidt, D. (2019). Characterization of c-, L- and ϕk-optimal designs for a class of non-linear multiple-regression models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 81(1), 101–120. https://doi.org/10.1111/rssb.12292
Web of Science ®Google Scholar
Schmidt, D., & Schwabe, R. (2015). On optimal designs for censored data. Metrika, 78(3), 237–257. https://doi.org/10.1007/s00184-014-0500-1
Web of Science ®Google Scholar
Schmidt, D., & Schwabe, R. (2017). Optimal design for multiple regression with information driven by the linear predictor. Statistica Sinica, 27(3), 1371–1384. https://doi.org/10.5705/ss.202015.0385
Web of Science ®Google Scholar
Silvey, S. D. (1980). Optimal design. Chapman and Hall.
Google Scholar
Whittle, P. (1973). Some general points in the theory of optimal experimental designs. Journal of the Royal Statistical Society: Series B (Methodological), 35(1), 123–130. https://doi.org/10.1111/j.2517-6161.1973.tb00944.x
Web of Science ®Google Scholar
Yang, M., Biedermann, S., & Tang, E. (2013). On optimal designs for nonlinear models: A general and efficient algorithm. Journal of the American Statistical Association, 108(504), 1411–1420. https://doi.org/10.1080/01621459.2013.806268
Web of Science ®Google Scholar

Appendix

Proof

Proof of Theorem 2.1

Let

f_{β} (x) = \sqrt{Q (f (x)^{⊤} β)} f (x)

. The Fisher information matrix of approximate design ξ given by (Equation1) can then be written as

M (ξ, β) = \int_{X} f_{β} (x) f_{β} (x)^{⊤} d ξ (x) .

For any designs

ξ, \bar{ξ} \in Ξ

and

α \in (0, 1)

, define

ξ_{α} = (1 - α) ξ + α \bar{ξ}

. We have

M (ξ_{α}, β) = (1 - α) M (ξ, β) + α M (\bar{ξ}, β)

and

\begin{aligned} \frac{\partial ψ (ξ_{α}, β)}{\partial α} & = \frac{\partial}{\partial α} [\prod_{j = 1}^{p} e_{j, p}^{⊤} M^{- 1} (ξ_{α}, β) e_{j, p}] \\ = \sum_{j = 1}^{p} (\prod_{\begin{matrix} i = 1 \\ i \neq j \end{matrix}}^{p} e_{i, p}^{⊤} M^{- 1} (ξ_{α}, β) e_{i, p}) \frac{\partial}{\partial α} (e_{j, p}^{⊤} M^{- 1} (ξ_{α}, β) e_{j, p}), \end{aligned}

where

\frac{\partial}{\partial α} (e_{j, p}^{⊤} M^{- 1} (ξ_{α}, β) e_{j, p}) = e_{j, p}^{⊤} [M^{- 1} (ξ_{α}, β) (M (ξ, β) - M (\bar{ξ}, β)) M^{- 1} (ξ_{α}, β)] e_{j, p} .

The directional derivative of ψ at ξ in the direction of

\bar{ξ}

, denoted by

\nabla_{ψ} (ξ, \bar{ξ})

, is then given by

\begin{aligned} \nabla_{ψ} (ξ, \bar{ξ}) & = lim_{α \to 0^{+}} \frac{\partial ψ (ξ_{α}, β)}{\partial α} \\ = \sum_{j = 1}^{p} \frac{ψ (ξ)}{e_{j, p}^{⊤} M^{- 1} (ξ, β) e_{j, p}} e_{j, p}^{⊤} [M^{- 1} (ξ, β) (M (ξ, β) - M (\bar{ξ}, β)) M^{- 1} (ξ, β)] e_{j, p} \\ = ψ (ξ) \sum_{j = 1}^{p} (1 - \frac{e_{j, p}^{⊤} M^{- 1} (ξ, β) M (\bar{ξ}, β) M^{- 1} (ξ, β) e_{j, p}}{e_{j, p}^{⊤} M^{- 1} (ξ, β) e_{j, p}}) \\ = ψ (ξ) (p - t r {M^{- 1} (ξ, β) M (\bar{ξ}, β) M^{- 1} (ξ, β) \sum_{j = 1}^{q} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M^{- 1} (ξ, β) e_{j, p}}}) . \end{aligned}

Note that the directional derivative

\nabla_{ψ} (ξ, \bar{ξ})

is linear in

\bar{ξ}

for any fixed

ξ \in Ξ

, i.e.,

\nabla_{ψ} (ξ, \bar{ξ}) = \int_{X} \nabla_{ψ} (ξ, δ_{x}) d \bar{ξ} (x),

where

δ_{x}

is the Dirac measure at

x

. Following Whittle (Citation1973), the design

ξ^{*} \in Ξ

is R-optimal if and only if

inf_{x \in X} \nabla_{ψ} (ξ, δ_{x}) = 0

. As a consequence the assertion of Theorem 2.1 follows.

Proof

Proof of Theorem 3.2

From Theorem 3.1, R-optimal design on $X = [0, \infty)^{p - 1}$ for $β_{i} < 0$ , $i = 1, \dots, p - 1$ , has the following form (A1) $ξ = {\begin{matrix} - (x_{1} / β_{1}) e_{1, p - 1} & \dots & - (x_{p - 1} / β_{p - 1}) e_{p - 1, p - 1} & 0_{p - 1} \\ ω_{1} & \dots & ω_{p - 1} & ω_{p} \end{matrix}},$ (A1) where $0_{p - 1}$ is a vector of $(p - 1)$ 0's, $x_{i}$ and $ω_{i}$ , $i = 1, \dots, p - 1$ , can be determined by the common system of Equations (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation6(6) $x_{i} - 2 \frac{Q (f (a)^{⊤} β - x_{i})}{Q^{'} (f (a)^{⊤} β - x_{i})} [1 - \sqrt{\frac{Q (f (a)^{⊤} β - x_{i})}{Q (f (a)^{⊤} β)}} \frac{s_{1, i + 1}}{\sqrt{s_{11} s_{i + 1, i + 1}}}] = 0.$ (6) ) for convenience. We denote $Q_{i} = Q (β_{0} - x_{i})$ for $i = 1, \dots, p - 1$ and $Q_{p} = Q (β_{0})$ according to the support points of the design (EquationA1(A1) $ξ = {\begin{matrix} - (x_{1} / β_{1}) e_{1, p - 1} & \dots & - (x_{p - 1} / β_{p - 1}) e_{p - 1, p - 1} & 0_{p - 1} \\ ω_{1} & \dots & ω_{p - 1} & ω_{p} \end{matrix}},$ (A1) ). Employing the previously mentioned decomposition approach and letting $y = (β_{1} / x_{1}, \dots, β_{p - 1} / x_{p - 1})^{⊤}$ and $Λ = d i a g ((Q_{p} ω_{p})^{- 1}, (Q_{1} ω_{1})^{- 1}, \dots, (Q_{p - 1} ω_{p - 1})^{- 1})$ , we can obtain that the inverse of the information matrix of ξ is given by $\begin{aligned} M (ξ, β)^{- 1} & = X^{- 1} Q^{- 1 / 2} D_{ω}^{- 1} Q^{- 1 / 2} (X^{⊤})^{- 1} \\ = (\begin{matrix} 1 & 0_{p - 1}^{⊤} \\ y & - d i a g (y) \end{matrix}) Λ (\begin{matrix} 1 & y^{⊤} \\ 0_{p - 1} & - d i a g (y) \end{matrix}) = (\begin{matrix} \frac{1}{Q_{p} ω_{p}} & \frac{1}{Q_{p} ω_{p}} y^{⊤} \\ \frac{1}{Q_{p} ω_{p}} y & - d i a g (\tilde{y}) \end{matrix}), \end{aligned}$ where $\tilde{y} = (β_{1}^{2} {\tilde{λ}}_{1 p} / x_{1}^{2}, \dots, β_{p - 1}^{2} {\tilde{λ}}_{p - 1, p} / x_{p - 1}^{2})^{⊤}$ with ${\tilde{λ}}_{i p} = (Q_{i} ω_{i})^{- 1} + (Q_{p} ω_{p})^{- 1}$ , $i = 1, \dots, p - 1$ . Then the matrix $S$ defined in (Equation5(5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5) ) is given by $S = (\begin{matrix} ω_{p} + \frac{1}{Q_{p}} \sum_{i = 1}^{p - 1} \frac{1}{{\tilde{λ}}_{i p}} & - \frac{1}{\sqrt{Q_{1} Q_{p}} {\tilde{λ}}_{1 p}} & \dots & - \frac{1}{\sqrt{Q_{p - 1} Q_{p}} {\tilde{λ}}_{p - 1, p}} \\ - \frac{1}{\sqrt{Q_{1} Q_{p}} {\tilde{λ}}_{1 p}} & \frac{1}{Q_{1} {\tilde{λ}}_{1 p}} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ - \frac{1}{\sqrt{Q_{p - 1} Q_{p}} {\tilde{λ}}_{p - 1, p}} & 0 & \dots & \frac{1}{Q_{p - 1} {\tilde{λ}}_{p - 1, p}} \end{matrix}),$ which is entirely unrelated to the parameters $(β_{1}, \dots, β_{p - 1})^{⊤}$ . Hence, the solutions of $x_{i}$ and $ω_{i}$ for $i = 1, \dots, p - 1$ do not depend on the parameters $(β_{1}, \dots, β_{p - 1})^{⊤}$ .

In order to prove the following Theorems 4.1 and 4.2, the extended design region with $x_{i} \in (- \infty, v_{i}]$ for $β_{i} > 0$ and $x_{i} \in [u_{i}, \infty)$ for $β_{i} < 0$ will be considered.

Proof

Proof of Theorem 4.1

We will only prove the case $τ = 1$ , and the others can be treated similarly. With the previous decomposition strategy the information matrix of $ξ_{1}^{*}$ can be written as $M (ξ_{1}^{*}, β) = {\tilde{X}}^{⊤} D_{ω} \tilde{X} = X^{⊤} Q^{1 / 2} D_{ω} Q^{1 / 2} X,$ where $X = (\begin{matrix} a_{1} & a_{- 1}^{⊤} \\ a_{1} 1_{p - 1} & 1_{p - 1} a_{- 1}^{⊤} - d i a g (x_{- 1}^{*}) \end{matrix}),$ with $a_{- 1} = (a_{2}, \dots, a_{p})^{⊤}$ , $1_{p - 1} = (1, \dots, 1)^{⊤}$ and $x_{- 1}^{*} = (x_{2}^{*} / β_{2}, \dots, x_{p}^{*} / β_{p})^{⊤}$ . Denote $y_{- 1}^{*} = (β_{2} / x_{2}^{*}, \dots, β_{p} / x_{p}^{*})^{⊤}$ and $z_{- 1}^{*} = (a_{2} β_{2} / x_{2}^{*}, \dots, a_{p} β_{p} / x_{p}^{*})^{⊤}$ , and then we have $X^{- 1} = (\begin{matrix} \frac{1 - z_{- 1}^{* ⊤} 1_{p - 1}}{a_{1}} & \frac{1}{a_{1}} z_{- 1}^{* ⊤} \\ y_{- 1}^{*} & - d i a g (y_{- 1}^{*}) \end{matrix}) .$ It follows that $(X^{⊤})^{- 1} a = e_{1, p}$ and $(X^{⊤})^{- 1} (a - (x_{i}^{*} / β_{i}) e_{i, p}) = e_{i, p}$ , $i = 2, \dots, p$ .

Now letting $ℓ_{i} (x_{i}) = ℓ_{i} (x_{i}, ξ^{*})$ and $h_{i} (x_{i}) = h_{i} (x_{i}, ξ^{*})$ which are defined in (Equation7(7) $h_{i} (x_{i}, ξ^{*}) = ℓ_{i} (x_{i}, ξ^{*}) - \frac{p}{Q (a^{⊤} β + β_{i} (x_{i} - a_{i}))} ⩽ 0$ (7) ) to simplify writing, and using the formulas (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation5(5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5) ) we obtain $ℓ_{i} (a_{i}) = e_{1, p}^{⊤} Q^{- 1 / 2} D_{ω}^{- 1} S D_{ω}^{- 1} Q^{- 1 / 2} e_{1, p} = \frac{s_{11}}{(ω_{1}^{*})^{2} Q (a^{⊤} β)} = \frac{p}{Q (a^{⊤} β)} = p_{1} (a_{1}),$ and $ℓ_{i} (a_{i} - x_{i}^{*} / β_{i}) = p / Q (a^{⊤} β - x_{i}^{*})$ for $i = 2, \dots, p$ . Thus we have $h_{1} (a_{1}) = h_{i} (a_{i}) = h_{i} (a_{i} - x_{i}^{*} / β_{i}) = 0$ for $i = 2, \dots, p$ . As in the proof of Theorem 9 in Schmidt (Citation2019), it is easily shown that $h_{i}^{'} (x_{i})$ has at most two roots, $lim_{x_{i} \to \pm \infty} h_{i} (x_{i}) = \pm \infty$ for $β_{i} > 0$ and $lim_{x_{i} \to \pm \infty} h_{i} (x_{i}) = \mp \infty$ for $β_{i} < 0$ . To prove the R-optimality of $ξ^{*}$ , it suffices here to show that $h_{i}^{'} (a_{i} - x_{i}^{*} / β_{i}) = 0$ holds since $h_{1} (x_{1}) < 0$ for $x_{1} \in [u_{1}, v_{1}] ∖ {a_{1}}$ and $h_{i} (a_{i}) = 0$ for all $i = 2, \dots, p$ . We have $ℓ_{i}^{'} (x) = 2 e_{i, p}^{⊤} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} (a + (x - a_{i}) e_{i, p}) .$ For $i = 2, \dots, p$ , utilizing $e_{i, p}^{⊤} X^{- 1} = (β_{i} / x_{i}^{*}) (1, - e_{i, p - 1}^{⊤})$ we obtain $\begin{aligned} ℓ_{i}^{'} (a_{i} - \frac{x_{i}^{*}}{β_{i}}) & = \frac{2 β_{i}}{x_{i}^{*}} (1, - e_{i, p - 1}^{⊤}) Q^{- 1 / 2} D_{ω}^{- 1} S D_{ω}^{- 1} Q^{- 1 / 2} e_{i, p} \\ = \frac{2 (β_{i} / x_{i}^{*}) s_{1 i}}{ω_{1}^{*} ω_{i}^{*} \sqrt{Q (a^{⊤} β)} \sqrt{Q (a^{⊤} β - x_{i}^{*})}} - \frac{2 (β_{i} / x_{i}^{*}) s_{i i}}{ω_{i}^{* 2} Q (a^{⊤} β - x_{i}^{*})} . \end{aligned}$ Hence $h_{i}^{'} (a_{i} - \frac{x_{i}^{*}}{β_{i}}) = p [\frac{2 (β_{i} / x_{i}^{*})}{\sqrt{Q (a^{⊤} β)} \sqrt{Q (a^{⊤} β - x_{i}^{*})}} \frac{s_{1 i}}{\sqrt{s_{11} s_{i i}}} - \frac{2 (β_{i} / x_{i}^{*})}{Q (a^{⊤} β - x_{i}^{*})} + \frac{β_{i} Q^{'} (a^{⊤} β - x_{i}^{*})}{(Q (a^{⊤} β - x_{i}^{*}))^{2}}] .$ The equations $h_{i}^{'} (a_{i} - x_{i}^{*} / β_{i}) = 0$ are equivalent to the equations $x_{i}^{*} - 2 \frac{Q (a^{⊤} β - x_{i}^{*})}{Q^{'} (a^{⊤} β - x_{i}^{*})} [1 - \sqrt{\frac{Q (a^{⊤} β - x_{i}^{*})}{Q (a^{⊤} β)}} \frac{s_{1 i}}{\sqrt{s_{11} s_{i i}}}] = 0$ for $i = 2, \dots, p$ . If the $x_{i}^{*}$ and $ω_{i}^{*}$ are the solutions of this system of equations combined with Equation (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ), then the design $ξ_{1}^{*}$ is R-optimal.

Proof

Proof of Theorem 4.2

With the same decomposition method we have $M (ξ^{*}, β) = {\tilde{X}}^{⊤} D_{ω} \tilde{X} = X^{⊤} Q^{1 / 2} D_{ω} Q^{1 / 2} X,$ where $X = 1_{p} a^{⊤} - d i a g (x^{*})$ and $x^{*} = (x_{1}^{*} / β_{1}, \dots, x_{p}^{*} / β_{p})^{⊤}$ . Let $y^{*} = (β_{1} / x_{1}^{*}, \dots, β_{p} / x_{p}^{*})^{⊤}$ and $z^{*} = (a_{1} β_{1} / x_{1}^{*}, \dots, a_{p} β_{p} / x_{p}^{*})^{⊤}$ , and then we get $X^{- 1} = - d i a g (y^{*}) - \frac{y^{*} z^{* ⊤}}{1 - z^{* ⊤} 1_{p}}$ and $(X^{⊤})^{- 1} (a - (x_{i}^{*} / β_{i}) e_{i, p}) = e_{i, p}$ . We still let $ℓ_{i} (x_{i}) = ℓ_{i} (x_{i}, ξ^{*})$ and $h_{i} (x_{i}) = h_{i} (x_{i}, ξ^{*})$ to simplify writing. It follows from the formulas (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ) and (Equation5(5) $S = {({\tilde{X}}^{- 1})}^{⊤} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ, β)^{- 1} e_{j, p}}) {\tilde{X}}^{- 1} .$ (5) ) that $ℓ_{i} (a_{i} - x_{i}^{*} / β_{i}) = e_{i, p}^{⊤} Q^{- 1 / 2} D_{ω}^{- 1} S D_{ω}^{- 1} Q^{- 1 / 2} e_{i, p} = \frac{s_{i, i}}{(ω_{i}^{*})^{2} Q (a^{⊤} β - x_{i}^{*})} = \frac{p}{Q (a^{⊤} β - x_{i}^{*})} .$ Thus we have $h_{i} (a_{i} - x_{i}^{*} / β_{i}) = 0$ for $i = 1, \dots, p$ . Due to the fact that $h_{i}^{'} (x_{i})$ has at most two roots, $lim_{x_{i} \to \pm \infty} h_{i} (x_{i}) = \pm \infty$ for $β_{i} > 0$ and $lim_{x_{i} \to \pm \infty} h_{i} (x_{i}) = \mp \infty$ for $β_{i} < 0$ . To prove the R-optimality of $ξ^{*}$ , it suffices here to show that $h_{i}^{'} (a_{i} - x_{i}^{*} / β_{i}) = 0$ holds since $h_{i} (a_{i}) < 0$ for all $i = 1, \dots, p$ . We have $ℓ_{i}^{'} (x) = 2 e_{i, p} M (ξ^{*}, β)^{- 1} (\sum_{j = 1}^{p} \frac{e_{j, p} e_{j, p}^{⊤}}{e_{j, p}^{⊤} M (ξ^{*}, β)^{- 1} e_{j, p}}) M (ξ^{*}, β)^{- 1} (a + (x - a_{i}) e_{i, p}) .$ With $e_{i, p}^{⊤} X^{- 1} = - (β_{i} / x_{i}^{*}) (e_{i, p}^{⊤} + z^{* ⊤} / (1 - z^{* ⊤} 1_{p}))$ we have $\begin{aligned} ℓ_{i}^{'} (a_{i} - \frac{x_{i}^{*}}{β_{i}}) & = - \frac{2 β_{i}}{x_{i}^{*}} (e_{i, p}^{⊤} + \frac{z^{* ⊤}}{1 - z^{* ⊤} 1_{p}}) Q^{- 1 / 2} D_{ω}^{- 1} S D_{ω}^{- 1} Q^{- 1 / 2} e_{i, p} \\ = - \frac{2 (β_{i} / x_{i}^{*}) s_{i i}}{ω_{i}^{* 2} Q (a^{⊤} β - x_{i}^{*})} - \frac{2 (β_{i} / x_{i}^{*})}{(1 - z^{* T} 1_{p}) \sqrt{Q (a^{⊤} β - x_{i}^{*})}} \sum_{j = 1}^{p} \frac{z_{j} s_{i j}}{ω_{i}^{*} ω_{j}^{*} \sqrt{Q (a^{⊤} β - x_{j}^{*})}} . \end{aligned}$ Hence $\begin{aligned} h_{i}^{'} (a_{i} - \frac{x_{i}^{*}}{β_{i}}) & = p [- \frac{2 (β_{i} / x_{i}^{*})}{Q (a^{⊤} β - x_{i}^{*})} - \frac{2 (β_{i} / x_{i}^{*})}{(1 - z^{* T} 1_{p}) \sqrt{Q (a^{⊤} β - x_{i}^{*})}} \sum_{j = 1}^{p} \frac{z_{j} s_{i j} / \sqrt{s_{i i} s_{j j}}}{\sqrt{Q (a^{⊤} β - x_{j}^{*})}} \\ + \frac{β_{i} Q^{'} (a^{⊤} β - x_{i}^{*})}{(Q (a^{⊤} β - x_{i}^{*}))^{2}}] . \end{aligned}$ The equations $h_{i}^{'} (a_{i} - x_{i}^{*} / β_{i}) = 0$ ( $i = 1, \dots, p$ ) are equivalent to $x_{i}^{*} - 2 \frac{Q (a^{⊤} β - x_{i}^{*})}{Q^{'} (a^{⊤} β - x_{i}^{*})} [\frac{1}{1 - z^{* ⊤} 1_{p}} \sum_{j = 1}^{p} \frac{z_{j} \sqrt{Q (a^{⊤} β - x_{i}^{*})}}{\sqrt{Q (a^{⊤} β - x_{j}^{*})}} \frac{s_{i j}}{\sqrt{s_{i i} s_{j j}}} + 1] = 0, i = 1, \dots, p .$ If the $x_{i}^{*}$ and $ω_{i}^{*}$ are the solutions of this system of equations combined with Equation (Equation4(4) $ω_{j}^{*} = \sqrt{\frac{s_{j j}}{p}}, j = 1, \dots, p,$ (4) ), then the design $ξ^{*}$ is R-optimal.

Locally R-optimal designs for a class of nonlinear multiple regression models

Abstract

1. Introduction

2. Model specification and R-optimality criterion

Pukelsheim & Torsney, Citation1993

3. R-optimal designs for models with intercept

Table 1. The simulation results obtained from the locally R-, D-optimal designs and the balanced design on $X = [0, 5]$ for the first-order Poisson regression model with intercept.

Table 2. Comparison of R-optimal design with D- and A-optimal designs on $X = [0, 5]^{2}$ for the Poisson regression model discussed in Example 3.2.

Table 3. R-optimal designs on $X = [0, 3]^{2}$ for proportional hazards regression models for different $β$ and R-efficiencies of the balanced design $ξ_{b}$ .

4. R-optimal designs for models without intercept

4.1. Some theoretical results on saturated designs

4.2. PSO-generated R-optimal designs

Table 4. The locally R-optimal designs on $X = [0, 3]^{3}$ for $β = (- 2.5, 0.5, 0.5)^{⊤}$ in the proportional hazards regression models, the R-efficiencies of the balanced design $ξ_{b}$ and the overall probability of censoring under the R-optimal design.

5. Discussion

Table 5. The R-efficiencies of the locally R-optimal designs on $X = [0, 5]^{p - 1}$ for various misspecified $β$ for the first-order Poisson regression models with intercept.

Disclosure statement

References

Appendix

Proof of Theorem 2.1

Proof of Theorem 3.2

Proof of Theorem 4.1

Proof of Theorem 4.2

Information for

Open access

Opportunities

Help and information

Locally R-optimal designs for a class of nonlinear multiple regression models

Abstract

1. Introduction

2. Model specification and R-optimality criterion

Pukelsheim & Torsney, Citation1993

3. R-optimal designs for models with intercept

Table 1. The simulation results obtained from the locally R-, D-optimal designs and the balanced design on X=[0,5] for the first-order Poisson regression model with intercept.

Table 2. Comparison of R-optimal design with D- and A-optimal designs on X=[0,5]2 for the Poisson regression model discussed in Example 3.2.

Table 3. R-optimal designs on X=[0,3]2 for proportional hazards regression models for different β and R-efficiencies of the balanced design ξb.

4. R-optimal designs for models without intercept

4.1. Some theoretical results on saturated designs

4.2. PSO-generated R-optimal designs

Table 4. The locally R-optimal designs on X=[0,3]3 for β=(−2.5,0.5,0.5)⊤ in the proportional hazards regression models, the R-efficiencies of the balanced design ξb and the overall probability of censoring under the R-optimal design.

5. Discussion

Table 5. The R-efficiencies of the locally R-optimal designs on X=[0,5]p−1 for various misspecified β for the first-order Poisson regression models with intercept.

Disclosure statement

Additional information

Funding

References

Appendix

Proof of Theorem 2.1

Proof of Theorem 3.2

Proof of Theorem 4.1

Proof of Theorem 4.2

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. The simulation results obtained from the locally R-, D-optimal designs and the balanced design on $X = [0, 5]$ for the first-order Poisson regression model with intercept.

Table 2. Comparison of R-optimal design with D- and A-optimal designs on $X = [0, 5]^{2}$ for the Poisson regression model discussed in Example 3.2.

Table 3. R-optimal designs on $X = [0, 3]^{2}$ for proportional hazards regression models for different $β$ and R-efficiencies of the balanced design $ξ_{b}$ .

Table 4. The locally R-optimal designs on $X = [0, 3]^{3}$ for $β = (- 2.5, 0.5, 0.5)^{⊤}$ in the proportional hazards regression models, the R-efficiencies of the balanced design $ξ_{b}$ and the overall probability of censoring under the R-optimal design.

Table 5. The R-efficiencies of the locally R-optimal designs on $X = [0, 5]^{p - 1}$ for various misspecified $β$ for the first-order Poisson regression models with intercept.