1,578
Views
0
CrossRef citations to date
0
Altmetric
Articles

A three-parameter logistic regression model

, & ORCID Icon
Pages 265-274 | Received 11 Feb 2020, Accepted 18 Jun 2020, Published online: 24 Jul 2020

Abstract

Dose–response experiments and data analyses are often carried out according to an optimal design under a model assumption. A two-parameter logistic model is often used because of its nice mathematical properties and plausible stochastic response mechanisms. There is an extensive literature on its optimal designs and data analysis strategies. However, a model is at best a good approximation in a real-world application, and researchers must be aware of the risk of model mis-specification. In this paper, we investigate the effectiveness of the sequential ED-design, the D-optimal design, and the up-and-down design under the three-parameter logistic regression model, and we develop a numerical method for the parameter estimation. Simulations show that the combination of the proposed model and the data analysis strategy performs well. When the logistic model is correct, this more complex model has hardly any efficiency loss. The three-parameter logistic model works better than the two-parameter logistic model in the presence of model mis-specification.

1. Introduction

Dose–response experiments are routinely conducted to study the relationship between the dose of a stimulus and the response of the experiment subjects. Simple and easy-to-interpret models are preferred for the dose–response relationship. The goal of the experiment is often to accurately determine the median dose level: the level of stimulant at which half the recipients respond. Other effective dose levels are also of interest.

There has been much discussion of optimal designs where the dose–response relationship is completely known (Ford et al., Citation1985; Sitter & Fainaru, Citation1997; Sitter & Forbes, Citation1997; Sitter & Wu, Citation1993; Wu, Citation1985). In applications, one may first run a pilot study to obtain an estimate of the dose–response relationship. The optimal design based on the fitted model is then used to guide further experiments (P. Li & Wiens, Citation2011; Wang et al., Citation2013; Wu & Tian, Citation2014). A full sequential approach can also be used: the parameter estimates are updated after each run, and the result is used to suggest dose levels for the subsequent runs.

Naturally, if the model is mis-specified, the optimal design is misguided and the effective dose levels can be poorly estimated. One way to reduce the risk of model mis-specification is to apply a more flexible and hence more complex dose–response model. We seek a trade-off between model flexibility and inference efficacy. A nonparametric model is the most flexible and free from the risk of model mis-specification. However, it likely needs more experiment runs to achieve the same estimation precision as an analysis under approximately valid parametric assumptions. The commonly used logistic or probit models are simple and have good mathematical and statistical properties. They are satisfactory in many applications, but their model assumptions impose restrictions on the dose–response relationship. A slightly more complex model can lower the risk of mis-specification without complicating the issues related to optimal design and data analysis (El-Saidi, Citation1993; G. Li & Majumdar, Citation2008; O'Brien et al., Citation2009).

In this paper, we study the design and analysis problems for a three-parameter logistic regression model. We develop an easy-to-implement iterative numerical algorithm with guaranteed convergence for the maximum likelihood estimate (MLE). We investigate the effectiveness of the sequential ED-design (Yu et al., Citation2016) and the D-optimal design. The ED-design optimises the estimation of targeted effective dose-levels while the D-optimal design targets model parameters. See Sections 3.3 and 3.2. We use the vertex direction method (VDM) to find the D-optimal design.

Simulation studies show that the combination of the proposed model, design, and data analysis strategy performs well. When the logistic model is correct, the three-parameter model has little efficiency loss. When the three-parameter model holds but the logistic model is violated, the new approach is more efficient.

The paper is organised as follows. In Section 2, we introduce the three-parameter logistic model and a numerical algorithm for the MLE. Design problems under this model are discussed in Section 3. In Section 4, we present some simulation results and an example. Section 5 provides concluding remarks.

2. Three-parameter logistic model and the MLE

Consider a dose–response experiment in which a stimulant at level X is applied to a subject. The subject either responds (Y = 1) or does not respond (Y = 0) to the stimulus. The dose–response relationship is defined to be π(x)=P(Y=1|X=x). In most applications, we envisage π(x) as a smooth increasing function of x. Based on data collected on the response values, y1,,yn, for selected dose levels, x1,,xn, scientists may be interested in the accurate estimation of a specific dose level x=ed50 at which π(x)=0.5. We use edγ to denote the effective dose level at which the probability of response is γ%. One may also focus on the shape of π(x) for x between, for example, ed25 and ed75.

If one is interested in a single ED level, effective statistical inference is possible without a parametric assumption on π(x). If a detailed dose–response relationship is desired over a dose range, then a suggested parametric form π(x) is likely necessary and helpful. A commonly employed model for π(x) is the logistic dose–response relationship: (1) logit{π(x)}=logπ(x)1π(x)=α+βx(1) for parameters α and β. In applications, the dose level may be log-transformed before the logistic or other model is applied (G. Li & Majumdar, Citation2008; O'Brien et al., Citation2009). In addition to its mathematical convenience, the logistic regression model permits the interpretation of the size of β. For example, in epidemiology, when Y stands for catching a disease and x for the exposure level, the value of β is the log-odds ratio of catching the disease when the exposure level is increased by a unit.

Statisticians and scientists are keenly aware that the logistic and other two-parameter models can be poor approximations of the true dose–response relationship. A more flexible model can be advantageous if it is not too complex. One such choice is a three-parameter logistic dose–response model (El-Saidi, Citation1993): (2) logit{πλ(x)}=logπλ(x)1πλ(x)=α+βx.(2) We require λ>0 to ensure that πλ(x) is between 0 and 1, and we do not place restrictions on α and β.

We note that when λ=1, the three-parameter model becomes the commonly used logistic model. In this case, for any γ(0,100), the model satisfies edγ+ed(100γ)=2αβ assuming β0. Such a restriction is hard to justify in applications. The introduction of the parameter λ helps to soften this restriction without overburdening the system. Under this model, the effective dose level at γ is given by edγ=logit((γ/100)λ)αβ. An explicit expression for the dose–response relationship is π(x)=P{Y=1|X=x}=exp(α+βx)1+exp(α+βx)1/λ. As discussed in Yu et al. (Citation2016), many sequential designs, including the ED-design, contain a step to update the estimate of the model parameters. The MLE is a common choice.

2.1. Numerical method for MLE

Let (xi,yi):i=1,,n be observations from a dose–response experiment and assume model (Equation2). The log-likelihood based on this data set is given by n(θ)=i=1n{yilog(π(xi))+(1yi)log(1π(xi))} where θ=(α,β,λ)T.

When λ=1 is fixed, the model becomes the usual logistic model, and n(θ) is known to be concave in α and β. The concavity permits a simple numerical solution to the MLE of α and β. We remark that when the xi's corresponding to y = 1 are completely separated from those corresponding to y = 0, the maximum point βˆ=±. This problem is easily avoided by adding informative pseudo-observations, as in Yu et al. (Citation2016), and this is done implicitly in our calculations.

Given any value of λ, the log-likelihood is concave in α and β. Given any α and β, the log-likelihood is concave in λ. Because of these properties, the following two-loop iterative numerical algorithm works nicely. We start with the initial value λ(0)=1 and set k = 0. Let ε be a small positive value such as 105.

  1. Let n(k)(α,β)=n(α,β,λ(k)). Use an iterative algorithm to solve (α(k+1),β(k+1))=argmaxα,βn(k)(α,β).

  2. Let ai=exp(α(k+1)+β(k+1)xi)1+exp(α(k+1)+β(k+1)xi) and define n(k)(λ)=i=1n{(1yi)log(1aiλ)+yilog(aiλ)}. Use an iterative algorithm to solve λ(k+1)=argmaxλn(k)(λ). Write θ(k)=(α(k),β(k),λ(k))τ. If n(θ(k+1))n(θ(k))ϵ, stop and report θ(k+1) and n(θ(k+1)). Otherwise, set k = k + 1 and return to Step (1).

In the above presentation, we have used n(k)(α,β) and n(k)(λ) as two different functions. The objective functions in both loops are concave, and they thus guarantee the convergence of any sensible iterative procedures that we may use in these two steps, and hence of the entire algorithm. We state the concave conclusions in two lemmas.

Lemma

Function n(k)(λ) in Step (2) is concave in λ given any data set (xi,yi) for i=1,2,,n with n1.

Proof.

To prove the concavity, it suffices to show that the second derivative of this function is always nonnegative. Straightforward algebra shows that n(k)(λ)λ=i=1n(yiaiλ)(logai)1aiλ and subsequently 2n(k)(λ)λ2=i=1naiλ(yi1)(logai)2(1aiλ)20 since yi1 for all i. Therefore, the function is concave as claimed.

Lemma

Given any data set (xi,yi) for i=1,2,,n with n1, the objective function n(k)(α,β) in Step (1) is concave in α,β given any λ>0.

Proof.

For notational simplicity, we will drop the superscript k and subscript n from n(k)(α,β) and denote it simply as (α,β) in this proof. We start with the case where n = 1, so we also drop the summation and the subindex i. To prove this result, it suffices to show that the Hessian matrix H=2α22αβ2αβ2β2 is positive definite. We have α=yπ(x)(1y)1π(x)π(x)α=(yπ(x))(1πλ)λ(1π(x)) and 2α2=(y1)(1πλ(x))λ(1π(x))2λπλ1(yπ(x))1π(x)π(x)α=1λ2π(x)(1πλ(x))(y1)(1πλ(x))(1π(x))2λπλ1(yπ(x))1π(x). We now show that the above second derivative is less than or equal to 0. Note that the first factor in 2/α2 is nonnegative, so we need to determine only the sign of the second factor. We consider the cases y = 1 and y = 0 separately.

  1. When y = 1, the first term in the second factor vanishes, and the second term is clearly less than or equal to 0.

  2. When y = 0, the second factor becomes λπλ(1π(x))(1πλ(x))(1πλ(x))2.

Denote its numerator as f(λ), with derivative f(λ)=πλ(1π(x))+λπλ(1π(x))logπ(x)+πλ(x)logπ(x)=πλ(x){1π(x)+(1+λλπ(x))logπ(x)}πλ(x){logπ(x)+(1+λλπ(x))logπ(x)}=λπλ(x)(1π(x))logπ(x)0, where we have used the inequality 1π(x)logπ(x). Combining this with the fact that f(0)=0, we find f(λ)0 for all λ0. This further implies 2/α20 when y = 0.

Combining (a) and (b), since y is either 0 or 1, we conclude that for all λ>0, 2α20. To complete the proof, we note that 2αβ=x2α2;2β2=x22α2. Therefore, in the sense of nonnegative definiteness, we find H=2α22αβ2αβ2β2=2α21xxx20. When the design contains n dose levels, the Hessian matrix is the sum of n nonnegative definite matrices. Hence, it remains nonnegative definite. This completes the proof.

By these two lemmas, n(θ(k)) is an increasing sequence in k with upper bound 0. Hence, n(θ(k)) has a finite limit as k. The corresponding θ(k) likely converges to at least a local maximum point. A rigorous discussion of the global maximum can be tedious and distracting; we do not pursue the issue in this paper.

3. Designs for the three-parameter logistic model

The choice of a new model leads to routine derivations rather than new design issues. All the optimality criteria may be applied to the three-parameter logistic model (Equation2), and we discuss some specific issues below.

3.1. Up-and-down design

The up-and-down design and its variations do not require a parametric model on the dose–response relationship π(x). This design is used for the purpose of accurately estimating a specific effective dose level edγ such as ed50. It requires the user to choose beforehand a grid of dose levels Ω={x1,,xK} for some K based on prior information on π(x) such that x1<edγ<xK.

The experiment starts by assigning a stimulus at level xj in Ω to the subject. If the subject responds, the level is decreased to xj1, and otherwise it is increased to xj+1. Special rules apply if xj is on the boundary of Ω. Variations may include staying at xj with a specific positive probability related to the target edγ, and a nonparametric estimate of edγ may be used. Our experience shows that such estimators are not efficient. In the simulations, we use the MLE under the assumed model and estimate edγ even though the data are generated according to an up-and-down design.

3.2. D-optimal design and the vertex direction method

The variance–covariance matrix of the MLE of the parameter θ is well approximated by In1(θ) when the number of runs n is large, where In(θ) is the Fisher information. A D-optimal design Ψ maximises the determinant of In(θ). Under the (two-parameter) logistic response model, Ψ is a uniform distribution on ed17.6 and ed82.4; and under the probit model, Ψ is a uniform distribution on ed12.8 and ed87.2 (Sitter & Wu, Citation1993).

As far as we are aware, there have been no direct results on the D-optimal design for the three-parameter logistic model. We can apply a numerical method called VDM (Fedorov, Citation1972; Wu, Citation1978; Wynn, Citation1972). We implemented this method via an R function to obtain approximate D-optimal designs for the simulation studies. Some particulars are as follows.

A nonsequential design is composed of a set of dose levels x1,,xk and the numbers of units applied at these levels m1,,mk. Mathematically, a design is a distribution Ψ on X with probability mass function ψ(xj)=mj/n where n is the total number of units in the experiment. Let the Fisher information of the observations obtained under Ψ be I(Ψ), and let δx be the design assigning dose level x to all the units. We use I(x) for I(δx). It can be seen that I(Ψ)=XI(x)dΨ(x). The popular D-optimal design is defined to be Ψ=argmaxΨ{log[det(I(Ψ))]}. Define the directional derivative D(Ψ;x)=limϵ0+{log[det(I((1ϵ)Ψ)+ϵδx)]log[detI(Ψ)]}ϵ1. For the three-parameter logistic regression model (Equation2), D(Ψ;x)=tr(I1(Ψ)I(x))3, where the constant 3 is the dimension of I. It is known that Ψ is the D-optimal design if and only if D(Ψ;x)0 for any x. Starting from an initial design Ψ=Ψ(0), VDM searches for x=argmaxD(Ψ;x) and ϵ=argmaxlog[det(I((1ϵ)Ψ+ϵδx)}. It updates Ψ(k) via Ψ(k+1)=(1ϵ)Ψ(k)+ϵδx,k=0,1,2, until the determinant of the Fisher information stops increasing.

Under the three-parameter logistic model, the optimal design depends on the λ values but not the (α,β) values. We implement VDM with an R function for the three-parameter logistic model (Equation2). The D-optimal designs for a number of λ values are given in Table . Up to potential round-off error, all the D-optimal designs are found to be uniform distributions on three dose levels. In the future, we hope to show that this is always true.

Table 1. D-optimal design for three-parameter logistic models.

3.3. ED-design

The D-optimal design and many other optimal designs focus on the precision of the parameter estimation under the assumed model. The form of the parameter under consideration is generally that permitting the most convenient analytical presentation of the dose–response model. Under the three-parameter logistic regression model, for instance, one naturally takes θ=(α,β,λ) as the target parameter. In applications, we are more interested in precisely estimating ed levels. Hence, the ED-design to be introduced is more relevant.

Let ξ=g(θ) for some smooth function g with gradient function g(θ). The variance of its MLE is approximately (3) {gT(θ)}{I1(θ)}{g(θ)}.(3) The ED-design aims to minimise (4) j=1m{gjT(θ)}{I1(θ)}{gj(θ)}(4) given m selected ed levels ξj=gj(θ) among all possible designs Ψ. Clearly, the solution depends on the value of the unknown parameter θ. One may avoid this difficulty with a sequential design.

Suppose the experiment has been carried out at dose levels x1,,xi with response values y1,,yi. Let θˆi be the intermediate MLE of θ based on the data obtained from these i trials. Let Ii(θˆi;+x) be the Fisher information based on the first i trials and the potential (i+1)th trial to be run at dose level x. We choose the next dose level x that minimises (5) v(i;x)=j=1m{gjT(θˆi)}{Ii1(θˆi;+x)}{gj(θˆi)}.(5) This rule is applied until a sufficient number of trials is obtained. The outcome is a sequential ED-design.

The sequential ED-design needs an initial set of trials and the corresponding θˆ. We recommend and use a uniform initial design on a set of doses Ω={x1,,xK} for K = 7, with x1 and xK being equally spaced grids between the perceived ed01 and ed99.

4. Simulation studies

We conduct simulation studies to explore several issues related to the use of the three-parameter logistic regression model (Equation2) for the dose–response experiment. We use N = 1000 repetitions for all model/design combinations. The sample sizes are chosen to be n = 30, 60, and 120. We choose three effective dose levels each time as the estimation targets and obtain their MLEs. For each model/design setting, we compute the RMSE of a single ed level as RMSE(ξˆj)=N1r=1N(ξˆrjξj)2, where ξˆrj is the estimate of the ED-level ξj in the rth repetition. The total RMSE is computed as RMSE=j=13RMSE2(ξˆj). Three designs are included in the simulation. One is the up-and-down design whose implementation does not depend on the model, but a specific target ed level will be indicated in the summary of the results. We choose a set of doses Ω={x1,,x7}, with x1 and x7 being equally spaced grids between the anticipated ed01 and ed99. We simulate on the D-optimal design and the sequential ED-design, assuming the relevant knowledge of the dose–response model as discussed in the last section. For the sequential ED-design, we use a uniform initial design on Ω as specified for the up-and-down design.

The simulations answer several questions related to the combination of the ED-design and the three-parameter model for the dose–response experiment. The first question concerns the performance of the ED-design. Does it have any advantages over other designs under a three-parameter model? The results indicate that the ED-design works well. The second question concerns whether or not the three-parameter model is necessary. If the true relationship is the three-parameter rather than the two-parameter logistic model, then using the correct model is expected to be helpful. The results show that for ed levels over a local region, it is important to apply the correct three-parameter model. Finally, if the true dose–response relationship is a two-parameter logistic model, how much efficiency do we lose by using a more complex three-parameter model? The results show that the loss is limited.

4.1. Three-parameter model both true and assumed

We generate data according to the three-parameter logistic regression model (Equation2) with α=6.265, β=0.055, λ=0.5, and α=14.148, β=0.1, λ=2 in two separate simulations. We target three sets of dose levels: (a) ed25, ed50, ed75; (b) ed10, ed25, ed40; and (c) ed60, ed75, ed90. The corresponding dose–response curves are depicted in Figure .

Figure 1. Dose–response curves in the simulation.

Figure 1. Dose–response curves in the simulation.

The up-and-down design requires a single target ed level. In the simulation, we always take the middle level as its target. The ed01 and ed99 values under λ=0.5 are (74, 211); the ed01 and ed99 values under λ=2 are (49, 180). These values are used to determine the first-stage design Ω. The results are reported in Table .

Table 2. RMSEs when fitting data from three-parameter logistic models under a number of designs.

We observe that the RMSEs under each design decrease as n increases. Their sizes are not dramatically different, but those of the D-optimal design are higher. The sequential ED-design has the best overall performance. The up-and-down design gives the lowest RMSEs for a single ed level. This is expected because D-optimality aims for precise estimation of θ, not the ed levels, and the up-and-down design was not intended for the estimation of ed levels under a parametric model. Nevertheless, it is nice to find that the sequential ED-design works well.

4.2. Applying a three-parameter logistic model when two parameters suffice

When the two-parameter logistic model is appropriate but a three-parameter model is assumed, the results are likely suboptimal. In this section, we examine the degree of efficiency loss. We generate dose–response data from the two-parameter logistic regression model and analyse the data under both two-parameter and three-parameter models. We consider only the D-optimal design and the sequential ED-design. The up-and-down design is not included because it does not depend on the model assumption, although the data analysis could be performed under some model assumption.

In this simulation, we generate data from the two-parameter logistic model: logit(π(x))=6.265+0.055x. The results are presented in Table . The first two columns are obtained under the correct two-parameter model assumption. The D-optimal design in this case is a uniform distribution on ed17.6 and ed82.4, which is well known but not used in applications. The remaining columns are obtained under the three-parameter model, which is also correct but more complex than necessary.

Table 3. RMSEs when fitting three-parameter model to data from two-parameter model.

These results show that the ED-design has advantages over the D-optimal design: the simulated RMSEs under the former are always lower than those under the latter. The efficiency gain can be as much as 40%.

In addition, the use of the more complex three-parameter model does not significantly reduce the efficiency. When we target ed10, ed25, and ed40, the total RMSE increases from 17.93 to 18.36 when the sample size n = 30. This loss is below 2.5%. The worst case is when n = 120: the efficiency loss is 5.6%.

In comparison, the efficiency of the D-optimal design can be strongly affected. When we target ed60, ed75, and ed90 and n = 120, the efficiency loss is as high as 18%. When n = 30, the use of the more complex model makes the D-optimal design more efficient. This may be because that the initial design takes a large proportion of the number of trials.

Overall, if the ED-design is used, the use of a three-parameter model does not greatly affect the efficiency in the estimation of the ed levels.

4.3. Effects under model mis-specification

In this section, we investigate the effect of two kinds of model mis-specification: (a) the dose–response relationship satisfies the three-parameter logistic regression model with λ1, and (b) the relationship is not a three-parameter logistic model.

In both situations, we compute the RMSEs of the ed estimates in two settings: the design and analysis are done under (i) the three-parameter model assumption, and (ii) the usual two-parameter model assumption. We examine the accuracy of the design and analysis in both cases.

For (a), we generate data from two three-parameter models: (6) logit(πλ(x))=6.265+0.055x(6) with λ=0.5, and (7) logit(πλ(x))=14.148+0.1x(7) with λ=2. Under model (Equation6), ed25 = 114, ed50 = 130, and ed75 = 148; and under model (Equation7), ed25 = 114, ed50 = 130, and ed75 = 144. The results are presented in Table . The first two columns are obtained under the two-parameter assumption. The D-optimal design in this case is a uniform distribution on ed17.6 and ed82.4. The remaining columns are obtained under the correct three-parameter assumption.

Table 4. RMSEs when fitting three-parameter model to data from three-parameter model.

We observe that the ED-design is noticeably superior to the D-optimal design in both situations: the simulated RMSEs under the former are always lower than those under the latter. The efficiency gain can be as much as 30%.

In addition, the use of the three-parameter model when the data are generated under that model significantly increases efficiency. When λ=0.5 and we target ed25, ed50, and ed75, the total RMSE decreases from 15.57 to 14.02 when n = 30, a gain of 11%. Overall, the use of the three-parameter model makes the ED-design more efficient.

For (b), we generate data from the three-parameter probit model: (8) π(x)=Φ1/λ(6.265+0.055x)(8) with λ=0.5, and (9) π(x)=Φ1/λ(14.148+0.1x)(9) with λ=2. Under this model, ed25 = 114, ed50 = 124, and ed75 = 134 when λ=0.5; and ed25 = 86, ed50 = 102, and ed75 = 117 when λ=2. The results are presented in Table . The first two columns are obtained under the two-parameter assumption. The D-optimal design in this case is a uniform distribution on ed12.8 and ed87.2. The remaining columns are obtained under the three-parameter assumption. Both model assumptions are incorrect, but is it better to use the three-parameter logistic model with the ED-design?

Table 5. RMSEs when fitting three-parameter model to data from probit model.

Clearly, the ED-design is noticeably superior to the D-optimal design in both situations: the simulated RMSEs under the former are always lower than those under the latter. When λ=2 and we target ed10, ed25, and ed40, the total RMSE decreases from 12.89 to 8.39 when n = 30, a gain of 54%.

In addition, the use of the more complex model when the data are generated under the three-parameter probit model noticeably increases the efficiency. When we target ed60, ed75, and ed90 with λ=2 the total RMSE decreases from 8.29 to 7.12 when n = 30, a gain of 16%. Overall, combining the more complex model with the ED-design leads to efficiency gains when the model is mis-specified.

4.4. Example

Brown (Citation1982) assumed a three-parameter logistic response model for the relationship between the wheezing symptom and the age of British coal miners. The number of subjects examined and the number with the symptom can be found in their paper.

We first fit the observed data using model (Equation2). The MLEs of the model parameters are αˆ=1.798, βˆ=0.044, and λˆ=0.400. We then fit the observed data using a simple logistic model. The MLEs of the model parameters are αˆ=4.225 and βˆ=0.065. For comparison, Figure  shows the observed data and the two fitted curves.

Figure 2. Observed data and fitted curves for British coal miners.

Figure 2. Observed data and fitted curves for British coal miners.

Based on fitted two-parameter and three-parameter models, the age at which 25% coal miners will develop the symptoms is 47.7 and 48.1, respectively. Both match the real data closely. The three-parameter model predicts the ages at which 50% and 75% of miners develop the symptoms are 66.8 and 88.7, respectively. In comparison, these figures are 65.0 and 81.9 based on the two-parameter model. While two models give very different numbers, predicting the proportion based on either model is not reliable at age 80 due to extreme extrapolation. At each observed ED level, we compute the corresponding age based on fitted models and the difference to the observed age. The resulting sum of squares (weighted) is 6906 for the three-parameter model and 15,420 for the two-parameter model. The use of the three-parameter model provides a much-improved fit.

5. Concluding remarks

We have explored the use of a three-parameter logistic regression model for dose–response experiments. The sequential ED-design can easily be applied to this model, and the resulting data analysis is effective.

Simulation results show that the three-parameter logistic regression model is an effective extension of the commonly used two-parameter model that does not lead to more complex data analysis issues. The combination of the ED-design and the data analysis strategy works well. When the logistic model is correct, the more complex model has hardly any efficiency loss. When the three-parameter model holds but the logistic model is violated, the new approach gains substantial ground. It will be a useful addition to the toolbox for dose–response experiments.

Acknowledgments

The authors gratefully acknowledge the fundings from the National Natural Science foundation of China, 11871419 and the Natural Science and Engineering Research Council of Canada.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The authors gratefully acknowledge the fundings from the National Natural Science foundation of China [Grant Number 11871419] and the Natural Science and Engineering Research Council of Canada.

Notes on contributors

Xiaoli Yu

Dr. Xiaoli Yu is currently self-employed.

Shaoting Li

Dr. Shaoting Li is an associate professor at the School of Statistics, Dongbei University of Finance and Economics.

Jiahua Chen

Jiahua Chen is Canada Research Chair, tier I at the Department of Statistics, University of British Columbia.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.