1,489
Views
0
CrossRef citations to date
0
Altmetric
Articles in the special topic of Bayesian analysis

A system for determining maximum tolerated dose in clinical trial

ORCID Icon, , & ORCID Icon
Pages 288-302 | Received 21 Jan 2020, Accepted 01 Jan 2021, Published online: 20 Jan 2021

Abstract

Toxicity study, especially in determining the maximum tolerated dose (MTD) in phase I clinical trial, is an important step in developing new life-saving drugs. In practice, toxicity levels may be categorised as binary grades, multiple grades, or in a more generalised case, continuous grades. In this study, we propose an overall MTD framework that includes all the aforementioned cases for a single toxicity outcome (response). The mechanism of determining MTD involves a function that is predetermined by user. Analytic properties of such a system are investigated and simulation studies are performed for various scenarios. The concept of the continual reassessment method (CRM) is also implied in the framework and Bayesian analysis, including Markov chain Monte Carlo (MCMC) methods are used in estimating the model parameters.

1. Introduction

The Continual Reassessment Method (CRM), first introduced by O'Quigley et al. (Citation1990), draws much attention from the biostatistical community. The fundamental idea behind the CRM was that a dose-toxicity curve would be fit to the data and that each patient or patient cohort would be assigned to the dose most likely to be associated with the target toxicity level, designated as maximum tolerated dose (MTD). In phase I cancer clinical trials, the primary goal is to find the maximum tolerated dose (MTD). In practice, the MTD is often defined as the dose of the drug that will produce a defined dose-limiting toxicity (DLT) in a pre-specified percentage of patients.

The toxicity measurement in the original CRM study by O'Quigley et al. (Citation1990) is for binary data, i.e., the response is either toxic, or non-toxic. CRM has several attractive properties. One is its quantitative explanation for the probability of toxicity for the MTD. The second is its utilisation of prior information about the possible toxicity at each dose level since the Bayesian inference is employed. Lastly, it often has smaller number of patients assigned to lower, ineffective doses. Most of the recent literature report properties of CRM using simulations, e.g., see Chevret (Citation1993), Faries (Citation1994), Goodman et al. (Citation1995), Korn et al. (Citation1994), O'Quigley (Citation1992), and O'Quigley and Chevret (Citation1991).

Variations of the CRM, its comparisons to other methods, and its applications can be found in Cheng and Lee (Citation2015), Heyd and Carlin (Citation1999), Iasonos et al. (Citation2008), Liu et al. (Citation2015), Morita (Citation2011), Onar et al. (Citation2009), Onimaru et al. (Citation2015), Piantadosi and Liu (Citation1996), Salter et al. (Citation2015), Storer (Citation2001), Yang et al. (Citation2010), Zohar and Chevret (Citation2003), and Zohar et al. (Citation2011). In addition, Clertant and O'Quigley (Citation2017,Citation2019) propose semiparametric methods in dose finding, which may be reduced to the family of CRM under certain parametric conditions. On the other hand, some proposals made to include both toxicity and efficacy as endpoints to guide escalation or to consider competing endpoints can be found in Braun (Citation2002), Gooley et al. (Citation1994), Ji et al. (Citation2019), North et al. (Citation2019), and Thall and Lee (Citation2003). Stopping rules in designs under the CRM framework are considered in Ishizuka and Ohashi (Citation2001), O'Quigley (Citation2002), O'Quigley and Reiner (Citation1998), and Zohar and Chevret (Citation2003). The last one considers two-stage CRM designs.

On the other hand, toxicity levels in phase I cancer trials are commonly categorised to multiple grades in the Common Toxicity Criteria (CTC) by the National Cancer Institute (Citation1999). The general guidelines of the NCI are grade 0 for no toxicity; grade 1 for minimal toxicity; grade 2 for moderate toxicity; grade 3 for severe toxicity; grade 4 for life threatening; and grade 5 for death. In most dose allocation procedures, these grades are dichotomised based on the DLT. For example, if grade 4 is considered as DLT, then grades 0-3 will be non-DLT and treated identically from the point of view of experimental design. Such polychotomous structure works for relatively mild toxicities such as neutropenia (a usually reversible deficiency in white blood cells). Although both grade 3 and 4 neutropenia may be considered as DLT, their difference in severity is usually not important in dose escalation. This is not the case for severe and possibly irreversible effects such as renal, liver, and neurological toxicities. For example, grade 4 renal toxicity requires dialysis and is much more dangerous than grade 3. Also, for renal and many other toxicities even grade 2 severity may be of concern to physicians, both for its implications on patient health and comfort, and because of the possibility of progression to grade 3 and grade 4 toxicity (i.e., irreversibility). Such concerns should be addressed in the dose escalation process. Wang et al. (Citation2000) extended the CRM incorporating the idea of unequal weights on the assessments of grade 3 and grade 4 toxicity in the dose escalation. Bekele and Thall (Citation2004) proposed a Bayesian method for dose-finding in a sarcoma trial based on a vector of correlated, ordinal-valued toxicities with severity levels varying with dose and they also developed a method for jointly eliciting the prior, a vector of weights quantifying the clinical importance of each level of each type of toxicity, and a target total toxicity burden (TTB) acceptable to the physicians. Later, Yuan et al. (Citation2007) propose another extension of the continual reassessment method (CRM), called the Quasi-likelihood approach, to incorporate grade information. They convert the toxicity grades to numeric scores that reflect their impacts on the dose allocation procedure, and then incorporated into the CRM using the quasi-Bernoulli likelihood. A simulation study demonstrates that the Quasi-CRM is superior to the standard CRM and comparable to a univariate version of the Bekele and Thall (Citation2004) method. Zhong et al. (Citation2012) proposed a trivariate continual reassessment method for clinical trials of toxicity, efficacy, and surrogate efficacy.

In Yang and Ye (Citation2012), the CRM is extended to the multi-toxicity grade case by introducing latent random variable, along with the idea of the overall MTD, which will be investigated in length in this research. In the current study, a more general setup of the toxicity grade system for any toxicity measure will be introduced.

The article is organised as follows. In Section 2, a unified system of the MTD for a single toxicity response is introduced, along with some examples and discussions of its analytic properties. In Section 3, the framework of the MTD finding for the continuous toxicity response in location-scale family with linear mean in dosage is established and investigated. It is followed by simulation studies for the normal response with linear mean in dosage by incorporating Bayesian posterior analysis in Section 4. Discussions of the target toxicity probability curve defined in Section 2 are also given in Section 4. Section 5 contains conclusions and discussions. All the necessary proofs are given in the Appendix.

2. A unified system of the maximum tolerated dose

2.1. The model and the MTD definition

Let X=[Dosemin,Dosemax] denote the range of all possible doses for the drug under investigation. In practice, a set of K doses, D={d1,d2,,dK}, from X (i.e. DX), is pre-selected. Denote by Y the toxicity response at dose xX, which is assumed to be a random variable on Y. The support of Y could be either discrete or continuous sets. Assume that Y has a cumulative distribution function as (1) Y|xF(y|x),<y<.(1) In general, the higher the dose, the more severe the response. Hence, we assume that F(y|x) satisfies the following regularity condition.

Regularity condition: For all yYinf{Y}, the probability P(Yy|x) is continuous and strictly increasing in dose x.

In the above regularity condition, y takes value in Yinf{Y} because P(Yinf{Y}|x)1, for all xX.

For any value yY, we define a level-y severe toxicity region, denoted by Ty={Yy}, as the following: there is a severe toxicity response if Yy, and no severe toxicity response if Y <y. Given yY and a level-y severe toxicity region Ty, there is a pre-specified target toxicity tolerance probability function, θy. Such a function defines the belief of the tolerated toxicity level (probability) by the users at the measure level y. To illustrate the concept of the systematic MTD, we need to define certain terminologies.

Definition 2.1

A dose x is said to be level-θy tolerable if the probability of the severe toxicity region Ty at dose x is no more than θy, i.e. P(Yy|x)θy, and the dose x is said not level-θy tolerable if the probability of Ty at dose x is more than θy, i.e. P(Yy|x)>θy.

For a given y, Definition 2.1 gives the dose level x at which its probability of toxicity region is smaller than the predetermined level θy. In addition, for a given a critical value y, we define the level-y maximum tolerated dose as follows.

Definition 2.2

For a value yY, associated with its target toxicity tolerance probability θ=θy, the level-y maximum tolerated dose (or briefly level-y MTD), denoted by yMTD(θ), is defined by (2) yMTD(θ)=sup{x|P(Yy|x)θ}=sup{x|P(Y<y|x)1θ}.(2)

Using the Definition 2.2, the following propositions are straightforward due to the increasing in x function P(Yy|x) and hence their proofs are omitted.

Proposition 2.1

For any critical value y, yY, the level-y MTD, yMTD(θ), is increasing in θ, i.e., yMTD(θ)yMTD(θ),for any θ<θ.

Referring to any level-y maximum tolerated dose, yMTD(θ), Proposition 2.1 shows that, in general, the higher the toxicity tolerance probability θ is, the larger the dose level can be applied to achieve the maximum efficacy of the drug. On the other hand, the higher the toxicity grade that is treated as the DLT, the larger the amount of the drug dose level can be tolerated. So, we have the following proposition.

Proposition 2.2

Given the target toxicity tolerance probability θy, the level-y MTD, yMTD(θ), is increasing in the critical value y, i.e., yMTD(θ)yMTD(θ),for any y<y,where y,yY.

As in the polychotomous case, see Yang and Ye (Citation2012), we take the supremum here because we believe that, given a target toxicity probability, the higher the dosage, the more efficient the chemical compound, i.e. dose-response curves for both toxicity and efficacy are increasing in the dosage, or, simply expressed, ‘the more pain, the more gain’.

Using Definition 2.2 above, the overall MTD may be defined as the largest dose x such that it is level-y tolerable for all yY. Based on Definition 2.2, dose x is level-y tolerable if and only if dose xyMTD(θy). Hence, the overall MTD can be defined as the infimum of all the level-y MTD as follows.

Definition 2.3

Given yY and the pre-specified target toxicity tolerance probability θ=θ(y), the overall MTD, denoted by YMTD(θ), is defined as YMTD(θ)=inf{yMTD(θ(y))|yY},where inf{S} is the infimum of set S and yMTD(θ(y)) is the level-y MTD associated with its target toxicity tolerance probability θ(y), for yY.

Before we discuss more on this overall MTD, let's take a look at some examples.

2.2. Illustrative examples

Example 2.1

Dichotomous Responses

For the dichotomous situation (see O'Quigley et al., Citation1990, Yang & Ye, Citation2012, and many others), the toxicity grade Y defined on Y={0,1} is Bernoulli distributed, with P(Y=y|x)=pxy(1px)1y, y=0 or 1 at any dose xX, where px is the probability of toxicity at dose x. Under the dichotomous situation, the only possible critical value is y = 1 such that the DLT is experienced if Y = 1 or DLT is not experienced if Y = 0. Here, given the target toxicity tolerance probability θ, the overall MTD is YMTD=sup{x|P(Y=1|x,λ)θ},where Λ is a vector of parameters related to the stated probability. As a special case, in the original CRM procedure of O'Quigley et al. (Citation1990), P(Y=1|x,λ)=([1+tanh(x)]/2)λ, for λ>0, is utilised, which is continuous increasing in x, the overall MTD is YMTD(θ)=supx1+tanh(x)2λθ=12log(θ1/λ1).Since there are only two y responses, we can define θy=1,if y=y0,θ0,if y=y1,where θ0 is the target toxicity tolerance probability at y=y1. In the dichotomous case, we can just let y0=0 and y1=1. In this special case, the cumulative distribution function F in (Equation1) is F(y|x)=1,if y>1,1+tanh(x)2λ,if 0<y1,0,if y0,at dose x.

Example 2.2

Polychotomous Responses

Polycho-tomous toxicity response in phase I clinic trials has drawn attention recently in research and practice. Wang et al. (Citation2000) extended the CRM by incorporating the idea of unequal weights on the assessments of grade 3 and grade 4 toxicity in the dose escalation. Bekele and Thall (Citation2004) proposed a Bayesian solution for dose finding in a sarcoma trial based on a vector of correlated, ordinal-valued toxicities with severity levels varying with dose. Yuan et al. (Citation2007) proposed another extension of the continual reassessment method (CRM), called the Quasi-CRM, to incorporate the grade information. In 2012, Yang and Ye (Citation2012) first proposed the MTD system for polychotomous responses.

Suppose that a 5-point ordinal scale is used to describe the severity of each type of toxicity, with grade 0 representing no toxicity, grade 1 minor toxicity, grade 2 moderate toxicity, grade 3 severe toxicity and grade 4 very severe toxicity. Hence, Y={0,1,,4}. For an individual subject treated at dose x, suppose P(Y=y|x)=py(x), for y = 0, 1, 2, 3, 4 and xX. For the critical value y=1,2,3,4, set the target toxicity tolerance probabilities θ=(θ1,θ2,θ3,θ4), the level-y MTD is yMTD(θy)=sup{x|P(Yy|x,β)θy}=supxi=y4pi(x)θy.Hence, the overall MTD is YMTD(θ)=min{1MTD(θ1),2MTD(θ2),3MTD(θ3),4MTD(θ4)}.In Example 2.2, the cumulative distribution function F in (Equation1) is F(y|x)=1,if y>4,p0(x)+p1(x)+p2(x)+p3(x),if 3<y4,p0(x)+p1(x)+p2(x),if 2<y3,p0(x)+p1(x),if 1<y2,p0(x),if 0<y1,0,if y0,at dose x.

Examples 2.1 and 2.2 show that the dichotomous and polychotomous response models are special cases of the unified system defined in Section 2.1.

2.3. The target toxicity probability function

In Definition 2.3, we propose a criterion in determining the overall MTD as the smallest level-y MTD. In the CRM procedures with binary toxicity response, the practitioner often would control a level-y MTD at a pre-specified value. For instance, the target level of θ=0.2 was used in O'Quigley et al. (Citation1990), where θ is the probability of response corresponding to the aimed-for target level. Since we are dealing with a response that might be continuous, the target toxicity tolerance probability curve θ(y) represents the ‘aimed-for target level curve’ that we would like to control. Hence, the target level in the dichotomous case, as well as in the polychotomous case, described in Yang and Ye (Citation2012), are special cases of the θ(y) curve defined here.

2.4. Properties of the system

In order to investigate the properties of the overall MTD defined in Section 2.1, we introduce the following notations. Denoted by Y the set of all possible values of the toxicity response. Y can be either an ordered categorical or a numerical set. Suppose F={F(y|x),<y<|xX} such that F(|x) is a cumulative distribution function, for all xX, and θ=θ(y),yY, where θ(y) is the target toxicity tolerance probability associated with the level-y severe toxicity Ty, yY. The triplet (Y,F,θ) is called the toxicity system defined on X. The following lemma will be shown in the Appendix.

Lemma 2.1

Suppose that g(y) and g0(y) are two functions defined on Y, such that |g0(y)|<. Then infyY{g(y)g0(y)}infyY{g(y)}infyY{g0(y)}supyY{g(y)g0(y)}.

Using Lemma 2.1, the proof of the following Convergence Theorem is also given in the Appendix.

Theorem 2.1

Convergence Theorem

Suppose that {(Y,F,θn)|n=0,1,2,} is a sequence of toxicity systems, such that |yMTD(θ0(y))|<, where yMTD(θ0(y)) is the level-y MTD of the toxicity system (Y,F,θ0). If θn converges uniformly to a limit θ0, i.e., for any ϵ>0, there exists an N>0, such that for all n>N and yY, |θn(y)θ0(y)|<ϵ, then (3) limnYMTD(θn)=YMTD(θ0),(3) where YMTD(θn) is the overall MTD with respect to the toxicity system (Y,F,θn), n=0,1,2,.

Theorem 2.1 shows that ‘θn converges uniformly to θ0’ is a sufficient condition of (Equation3), which means that if {θn} gets closer to θ0, as n, the MTDs resulted from (Y,F,θn) converges to the MTD resulted from (Y,F,θ0). Similar to the proof of Theorem 2.1, we can straightforwardly show the following Robustness Theorem.

Theorem 2.2

Robustness Theorem

Suppose (Y,F,θ1) and (Y,F,θ2) are two toxicity systems, such that all the level-y MTDs are finite. Define Δ=supyY{|θ1(y)θ2(y)|}. For any ϵ>0, there exists a δ>0, such that if Δ<δ then YMTD(θ1)YMTD(θ2)<ϵ.

Theorem 2.2 states that if θ1 and θ2 are close, so are the overall MTDs resulted from them. Based on Propositions 2.1 and 2.2, we have the following Reduction Theorem whose proof is given in the Appendix.

Theorem 2.3

Reduction Theorem

Suppose (Y,F,θ) is a toxicity system. For the target toxicity tolerance probability θ=θ(y), if there exist y1,y2Y such that y1<y2 and θ(y)θ(y1) for all y1y<y2, then the toxicity system (Y,F,θ) is equivalent to the toxicity system (Y,F,θ), where Y=Y{yY|y1y<y2} and F={F(y|x),<y<|xX} such that, for all xX, (4) F(y|x)=F(y|x),if y<y1 or yy2,F(y2|x),if y1y<y2,(4) where F(|x)F, for all xX. Furthermore, restricted on Y, the target toxicity tolerance probability θ=θ(y),yY, i.e. all the toxicity grade levels between and including y1 and y2 should be combined into one single toxicity grade level y1. Here, ‘equivalence’ is in the sense of finding the overall MTD.

Figure (a,b) graphically illustrate the target toxicity tolerance probabilities in systems (Y,F,θ) and (Y,F,θ), respectively.

Figure 1. Graph illustration for target toxicity tolerance probability in Theorem 2.3. (a) The target toxicity tolerance probability θ in system (Y,F,θ); (b) The target toxicity tolerance probability θ in system (Y,F,θ).

Figure 1. Graph illustration for target toxicity tolerance probability in Theorem 2.3. (a) The target toxicity tolerance probability θ in system (Y,F,θ); (b) The target toxicity tolerance probability θ∗ in system (Y∗,F∗,θ∗).

Figure indicates that the set of toxicity levels, Y, reduces to the the set of toxicity levels, Y=Y[y1,y2) and, restricted on Y=Y[y1,y2), the target toxicity tolerance probability θ (the curve in Figure (b)) serves exactly the same purpose in finding the overall MTD as the target toxicity tolerance probability θ (the curve in Figure (a)). In addition, Figure (a,b) graphically illustrate the F and F in systems (Y,F,θ) and (Y,F,θ), respectively. It indicates that, at all dose levels xX, the toxicity response YF(y|x) (the curve in Figure (a)) becomes a mixture toxicity response YF(y|x) (the curve in Figure (b)), where F(y|x) is given in (Equation4).

Figure 2. Graph illustration for F in Theorem 2.3, where x1<x2. (a) F in system (Y,F,θ); (b) F in system (Y,F,θ).

Figure 2. Graph illustration for F in Theorem 2.3, where x1<x2. (a) F in system (Y,F,θ); (b) F∗ in system (Y∗,F∗,θ∗).

Theorem 2.3 shows that the redundant portions in a toxicity system can be removed so that a simpler system can be considered. The following corollary is a direct result from Theorem 2.3.

Corollary 2.1

Suppose Y is a continuous toxicity response which takes value on interval Y=[a,b], where <a<b<+ and (Y,F,θ) is a toxicity system. If the target toxicity probability θ=θ(y) is a stepwise increasing function, i.e., there exists θ1>>θM such that θ=θ(y)=θ1,if ay<y1,θ2,if y1y<y2,θM,if {yM1yb},then the toxicity system (Y,F,θ) reduces to a polychotomous toxicity system (Y,F,θ), where Y={1,,M} which is a polychotomization of Y corresponding to the bin boundaries a=y0<y1<<yM1<yM=b, and θ={θ1,θM}. Furthermore, F={F(y|x),<y<|xX} such that, F(y|x)=0,if y<a,F(yi|x),if yi1y<yi, for i=1,,M,1,if yb,where F(|x)F, for all xX.

Corollary 2.1 shows that once the target toxicity tolerance probability curve is reduced to a step function, we reduce the continuous toxicity response problem to a polychotomous toxicity response problem.

3. Continuous toxicity response case: location and scale family response with linear mean in dose x

Suppose that a continuous toxicity response Y belongs to a location-scale family with the cumulative distribution function (5) Y|xFy(β0+β1x)σ,<y<,(5) at dose level x, where β1>0. In order to obtain an analytic solution of the overall MTD, a common scale parameter, σ, is assumed and the location parameter, μ(x)=β0+β1x, is assumed to be linearly increasing in the dose level x.

In practice, it is common to give a lower critical value, y0, and a upper critical value, y1, for the toxicity grade Y. Only those toxicity grades which fall in interval (y0,y1) are seriously considered. Hence, based on Theorem 2.3, one can define the target toxicity tolerance probability as follows. (6) θ=θ(y)=1,if yy0,h(y,y0,y1),if y0<y<y1,θ0,if yy1,,(6) where 0<θ0h(y,y0,y1)1 for all y(y0,y1) and h(y,y0,y1) is strictly decreasing in y. Usually, θ0=0 means the toxicity is 100% fatal and hence we can restrict to θ0>0, since extremely large toxic grades should be avoided as much as possible.

Since β1>0 due to the fact the probability of the toxicity increases in dosage, according to (Equation2), the level-y MTD is (7) yMTD(h(y,y0,y1))=,yσF1[1for yy0,h(y,y0,y1)]β0β1,for y(y0,y1),yσF1(1θ0)β0β1,for yy1.(7) Consequently, since β1>0 the overall MTD is (8) YMTD(θ)=infyY{yMTD(θ(y))}=mininfyy0{yMTD(θ(y))},infy(y0,y1){yMTD(θ(y))},infyy1{yMTD(θ(y))}=min{x1,x2},(8) where, x1=infy(y0,y1)yσF1[1h(y,y0,y1)]β0β1, and x2=y1σF1(1θ0)β0β1.Since (β0,β1) are constants, we have arginfyσF1[1h(y,y0,y1)]β0β1=arginf{yσF1[1h(y,y0,y1)]}.Hence, for a fixed distribution F(), the optimal point, y, is only determined by the target toxicity tolerance probability h(y,y0,y1). Therefore, the choice of the function h(y,y0,y1) in (Equation6) is crucial. In Example 3.1 with the normal response, we will discuss the choice of h(y,y0,y1). According to (Equation7), the following propositions are straightforward.

Proposition 3.1

Suppose F() is the standard cumulative distribution function of the location-scale family in (Equation5) and the target toxicity tolerance probability is in the form of (Equation6). Then, for a fixed β1>0, the overall MTD is decreasing in β0.

Proposition 3.1 indicates that, for a fixed positive slope β1, the higher the mean toxicity response of a toxicity system, the lower is the overall MTD.

Proposition 3.2

Suppose F() is the standard cumulative distribution function of the location-scale family in (Equation5) and the target toxicity tolerance probability is in the form of (Equation6). Then, for fixed β0,

(a)

if yσF1[1h(y,y0,y1)]<β0, the overall MTD is increasing in β1;

(b)

if yσF1[1h(y,y0,y1)]>β0, the overall MTD is decreasing in β1,

where y=arginf{yσF1[1h(y,y0,y1)]}.

Next, we use normal toxicity response with linear predictor (dose) to illustrate the location-scale family responses.

Example 3.1

Normal Response

Suppose that the continuous toxicity response Y has a cumulative distribution function (9) Y|xΦy(β0+β1x)σ,<y<,(9) at dose level x, where β1>0, and Φ() is the cumulative distribution function of the standard normal random variable.

In order to illustrate the proposed dose-finding strategy for the continuous toxicity response, we choose the following target toxicity tolerance probability function (10) θα=θα(y)=1,if yy0,θ0+(1θ0)y1yy1y0α,if y0<y<y1,θ0,if yy1,(10) where α>0. It is easy to verify that (Equation10) is a special case of (Equation6).

Figure (a) shows the target toxicity tolerance probability curve of θ(y) in (Equation10). On interval (y0,y1), θ(y) is convex if the power α>1, concave if the power α<1 and straightly decreasing if the power α=1. Figure (b) shows the negative derivative curve of the θ(y). It is clear that if α>1, the toxicity tolerance probability decreases quickly as the toxicity response level increases near the lower toxicity response level which means that the drug in study could quickly become toxic in the range. On the other hand, if α<1, the toxicity tolerance probability decreases slowly as the toxicity response level increases near the lower toxicity response level and then moves quickly near the upper toxicity response level. If α=1, the toxicity tolerance probability from y0 to y1 decreases at the same rate. In Section 4, simulation studies are preformed to check the influence of the power α on the overall MTD.

Figure 3. Graph illustration for (a) the target toxicity tolerance probability with form (Equation10) and (b) the negative derivative curve of the target toxicity tolerance probability.

Figure 3. Graph illustration for (a) the target toxicity tolerance probability with form (Equation10(10) θα=θα(y)=1,if y≤y0,θ0+(1−θ0)y1−yy1−y0α,if y0<y<y1,θ0,if y≥y1,(10) ) and (b) the negative derivative curve of the target toxicity tolerance probability.

Under this setting, both Φ() and θ() are continuous functions. Hence, using (Equation8), the overall MTD under this setting is (11) YMTD(θα)=infy(y0,y1)yσΦ1{(1θ0)×[1(y1yy1y0)α]}β0β11.(11) Based on (Equation11), the following proposition is proven in the Appendix.

Proposition 3.3

Suppose that the toxicity response is normally distributed as in (Equation9), and the target toxicity tolerance probability is set to be of the form (Equation10), then the overall MTD in (Equation11) is decreasing in α and (12) limαYMTD(θα)=y0σΦ1(1θ0)β0β1=y0+σΦ1(θ0)β0β1,(12) where YMTD(θα) is defined in (Equation11).

Proposition 3.3 shows that the larger the power α is or the more expected weights the lower toxicity response (close to left critical point y0) is assigned, the lower the overall MTD is. As the power α, the minimum overall MTD is obtained at the dose level x at which the probability of level-y0 severe toxicity Ty0={Yy0} achieves θ0, i.e., P(Yy0|x)=θ0.

Notice that as α, the target toxicity probability in (Equation10) reduces to θα=θα(y)=1,if yy0,θ0,if y>y0.According to Corollary 2.1, the continuous toxicity system reduces to the dichotomous toxicity system shown in Section 2.2. The MTD can thus be determined as (13) MTD=Φ1(θ0)β0β1.(13) In (Equation12), denote β0=β0y0 and set σ=1, then (Equation13) and (Equation12) are exactly same. In the dichotomous case discussed in Yang et al. (Citation2010), the underlying distribution of the toxicity response is unknown, hence we introduce the normal latent variable. In this study, the toxicity response is assumed to be normally distributed. Hence, (Equation12) is obtained as the power α in (Equation10) goes to infinity. Table  shows illustrative results of MTD in Proposition 3.3 for various α.

Table 1. An Illustration of Proposition 3.3. The parameters for the true model are β0=1, β1=1 and σ=1. The parameters for the target toxicity tolerance probability are y0=1, y1=1 and θ0=0.05.

4. Simulation studies for the normal toxicity response with Bayesian analysis

4.1. Bayesian posterior analysis

Suppose d1,,dK are K ordered dose levels under investigation. Let Fj={(x1,y1),,(xj,yj)}denote the history of the first j assignments and responses, where xl{d1,,dK} is the dose level assigned to the lth patient and yl is the corresponding observed response, l=1,,j. According to (Equation9) in Example 3.1, the likelihood function of β0, β1 and σ2 is L(β0,β1,σ2|Fj)=l=1j12πσ2×exp[yl(β0+β1xl)]22σ2(σ2)j/2exp12σ2l=1j[ylβ0β1xl]2.Let π(β0,β1,σ2) be the prior on (β0,β1,σ2), the joint posterior density function of (β0,β1,σ2) given data Fj is (14) π(β0,β1,σ2|Fj)π(β0,β1,σ2)(σ2)j/2×exp12σ2l=1j[ylβ0β1xl]2.(14) Since it is assumed that the average toxicity response is increasing in dose level x, i.e., the parameter β1 is assumed to be greater than 0, the prior π(β0,β1,σ2) should be defined on {(β0,β1,σ2)|β1>0,σ2>0}. Furthermore, we assume that (β0,β1,σ2) are independent in the prior. According to (Equation14), the full conditional posterior distributions are obtained as follows.

π(β0|β1,σ2,Fj): The posterior densities of β0, given β1, σ2 and data, is given as follows.

  • If a flat prior π(β0)1 is assigned on β0, then, β0|β1,σ2,FjNl=1j(ylβ1xl)j,σ2j.

  • If a proper conjugate prior N(β¯0,σ¯02) is assigned, then, β0|β1,σ2,FjNσ¯02l=1j(zlβ1xl)+σ2β¯0σ2+σ¯02j,σ2σ¯02σ2+σ¯02j.

π(β1|β0,σ2,Fj): The posterior densities of β1, given β0, σ2 and data, is given as follows.
  • If a flat prior π(β1)I(β1>0) is assigned on β1, then, β1|β0,σ2,FjNl=1j(ylβ0)xll=1jxl2,σ2l=1jxl2I(β1>0).

  • If a proper conjugate truncated normal prior N(β¯1,σ¯12)I(β1>0) is assigned, then, β1|β0,σ2,FjNσ¯12l=1j(ylβ0)xl+σ2β¯1σ2+σ¯12l=1jxl2,σ2σ¯12σ2+σ¯12l=1jxl2I(β1>0).

  • If a proper, but non-conjugate exponential prior exp{β1}I(β1>0) is assigned, then, β1|β0,σ2,FjNl=1j(ylβ0)xlσ2l=1jxl2,σ2l=1jxl2I(β1>0).

π(σ2|β0,β1,Fj): The posterior densities of σ2, given β0, β2 and data, is given as follows.
  • If σ2 has an inverse gamma prior InvGamma(a,b) with the probability density function. Then the posterior distribution of σ2, given β0 and β1, follows σ2|β0,β1,FjInvGammal=1ja+j/2,b+l=1j(ylβ0β1xl)2/2.

  • If π(σ2)1/σ2, then σ2|β0,β1,FjInvGammaj/2,l=1j(ylβ0β1xl)2/2.

4.2. MTD determination procedure

Based on the Gibbs samplers, β0,β1, and σ2 could be generated from those full conditional posterior distributions given in Section 4.1. After drawing from the joint posterior distribution π(β0,β1,σ2|Fj), one can estimate the overall MTD by using the following formula. (15) YMTDˆ(θα)=infy(y0,y1)yσˆΦ1{(1θ0)×[1(y1yy1y0)α]}β0ˆβˆ11.(15) The estimates of the parameters in (Equation15) can be obtained based on the MCMC simulation. Suppose one has obtained N generations of (β0,β1,σ2), which are (β0(i),β1(i),(σ2)(i)), i=1,2,,N, then, β0ˆ=Eˆπ(β0|Fj)(β0)=1Ni=1Nβ0(i),β1ˆ=Eˆπ(β1|Fj)(β1)=1Ni=1Nβ1(i), and σˆ=Eˆπ(σ2|Fj)(σ2)=1Ni=1N(σ2)(i).With regarding to the dose allocation, one can find the next dose level xj+1{d1,,dK}, at which the (j+1)th patient is to be treated, such that it is the closest dose to the estimated overall MTD, i.e., (16) xj+1=argminx{d1,,dK}{|xYMTDˆ(θα)|},(16) where YMTDˆ(θα) is obtained from (Equation15). After collecting the (j+1)th response yj+1, given Fj+1={(x1,y1),,(xj,yj),(xj+1,yj+1)}, the updated overall MTD can be obtained from (Equation15). Continue in this way until the results of the last patient, say nth, are available. Finally, the recommended dose level will be xn+1, which is the recommended MTD.

4.3. Simulation study

In order to check the operating characteristics of the normal response model, a simulation study is performed. We suppose that there are six predetermined and ordered dose levels, d1,d2,,d6. The data are simulated according to the following distributions, Y|diN(β0+β1di,σ2),for i=1,,6,where β1>0, since we assume that the mean toxicity response is increasing in dose level, and σ2 is the common variance for the toxicity responses at all dose levels. In this simulation study, we set β0=1 and β1=2. For σ2, we consider three settings, 0.25, 1 and 2.

For the target toxicity tolerance probability in (Equation10), we set y0=2 and y1=4, which yields a range of 6 for the toxicity response. It is about 12, 6 and 4 times the standard deviations of the three settings of σ2, respectively. For θ1, we consider 0.70, 0.90 and 0.99. For the power α, we take 0.2, 1 and 5. The parameters used in the simulation are shown in Table .

Table 2. Setting of simulation study for the normal toxicity response.

Table  summarises the simulation results of 200 duplications (trials) for various θ0 levels: (0.30,0.10,0.01); various σ2 values: (2,1,0.25); and various power α values: (0.2,1,5). Each entry consists of two values (s,t), where s stands for the frequency of overall MTD recommendation on each dose level and t the frequency of exposure for each dose. The cell with the underline indicates the highest recommendation percentage, i.e., the MTD. In addition, the 6 dose-level values are also given Table , along with the true MTD value obtained using (Equation11) for each case.

Table 3. Simulation results by generating 200 data sets for each scenario: each cell consists of two values s(t), where s is the percentage that the dosage di is recommended as an MTD, and t a percentage that the dosage is used in the experiment to search for the MTD. For each row, the dosage with largest percentage in MTD selection is boldfaced.

The first observation we have in Table  is that the recommended overall MTD's are all very close to the corresponding true MTD values. In addition, Table  shows that for fixed θ0 and σ2 values, as the power α decreases, both the percentage of the recommended MTD dose level and the percentage of patient allocation are drawn toward the lower dose levels. For example, in the case of θ0=0.30 and σ2=2, the most frequently recommended dose levels are d6, d5 and d2 for the power α=0.2, 1 and 5, respectively. This result is consistent with the first claim in Proposition 3.3, which is the overall MTD is decreasing in α.

Table  also shows that, for the fixed θ0 and the power α, as σ2 decreases, both the percentage of the recommended MTD dose level and the percentage of patient allocation are drawn toward the lower dose levels. For instance, in the case that θ0=0.30 and the power α=1, the most frequently recommended dose levels are d5, d4 and d2 for σ2=2, 1 and 0.25, respectively. This result indicates that, for the normal toxicity response, the more divergence the toxicity response is, the higher is the MTD dose level recommended.

Furthermore, as the lower limit of the target toxicity tolerance probability θ0 decreases, Table  shows that both the percentage of the recommended MTD dose levels and the percentage of patient allocations are drawn toward the lower dose levels. This results is consistent with the simulation results shown in the polychotomous toxicity responses in Yang and Ye (Citation2012). In practice, it is reasonable to allocate the patient at the lower dose level if the higher toxicity level is considered more severe, which is indicated here by larger θ1 in the target toxicity tolerance probability.

4.4. Choice of the target toxicity tolerance probability function

One of the important component in the system defined in Section 2 is the target toxicity tolerance probability function θ=θ(y) in (Equation6). This function can be viewed as the desired toxicity tolerance probability curve that we would like to control. In the concept of the level-y MTD in Definition 2.2, we want to find the largest dosage x, such that P(Yy|x)θ(y). Consider a situation that, other than pre-fixed y0 and y1, we define an y such that the target toxicity tolerance probability at y reaches to a pre-fixed level θ such as a controlled target level probability for the dichotomous case. Table  lists some values for those scenarios that we consider in the following simulation by using the function θα in (Equation10). Here we consider a commonly used target level θ=0.3. In addition, Figure  shows these target toxicity probability curves.

Figure 4. Graph illustrations for the θ(y) functions discussed in Table .

Figure 4. Graph illustrations for the θ(y) functions discussed in Table 4.

Table 4. Choices for the target toxicity tolerance probability functions by (Equation10).

Notice that, in Figure , the trend of θ(y) changes with different values of θ=θ(y) and θ0. When y increases from 1 to 3, θ(y) reaches to θ0 in a slower pace. This means that the target tolerance probability curve decreases from 1 to θ0 in a slower rate.

Using the target toxicity tolerance probability curves, simulation results in obtaining the MTD's for the same normal-response model investigated in Section 4.3 are shown in Table . In Table , the recommended MTD dosage level increases as y increases. When the variance of the toxicity response is larger, the recommended MTD levels spread out more than those with smaller variances.

Table 5. Simulation results of the recommended MTD levels, as well as dosage allocation percentages for different choices of θ and y.

Finally in this section, we consider a situation that is similar to a dichotomous case, although the toxicity response is still set as normal distribution in Section 4.3. However, the target toxicity probability curve is set as h(y,y0,y1)=1,if 2y<1,0.3,if 1y4.In this situation, the level-y MTD can be expressed as yMTD(h(y,y0,y1)=yσΦ1(0)β0β1=,if 2y<1,yσΦ1(0.7)β0β1=y0.5244σβ0β1,if 2581y4,The overall MTD thus can be obtained accordingly. A simulation study for this model is given in Table .

Table 6. Simulation results of the recommended MTD levels, as well as dosage allocation percentages for ‘dichotomous’ cases. Two decision criteria are employed: criterion I is to choose next dosage level at a nearest dose, and criterion II is to choose next dosage level at the lower bound.

In Table , two MTD dosage selection criteria are used. Criterion I chooses the next level MTD by the form in (Equation16), where a dose level that has nearest distance to YMTDˆ(θ) is chosen, while Criterion II chooses the next level MTD by choosing the largest dose level that is no larger than YMTDˆ(θ). The justification of Criterion II is that we choose next dose level in a more conservative way. In Table , we observe that most of the time, level d4 is chosen, unless when σ=0.5, Criterion I chooses d5. Since we know the true model in this case, we can calculate the toxicity probability at d4 and d5 as P(Y>1|d4)=(0.28,0.20,0.05), for σ2=(2,1,0.25), respectively, and P(Y>1|d5)=0.5 for all σ2. Hence, it is more aggressive, i.e., with more risk, in using Criterion I for the case of a smaller variance.

5. Conclusions and discussions

In this study, we propose a new framework of determining the maximum tolerated dose for a single toxicity response in clinical trial study. We have a new definition as overall MTD, YMTD(θ), which yields a unified model for the dose finding problem in Phase I clinical trials. The analytic properties of the overall MTD are examined. We prove the Convergence, Robustness and Reduction theorems of the overall MTD. It is shown that the traditional definition of MTD in the case of the dichotomous toxicity responses (see O'Quigley et al., Citation1990; Yang et al., Citation2010) as well as the overall MTD, MMTD(θ), introduced in the polychotomous toxicity responses (Yang & Ye, Citation2012), are special cases of this more generally defined overall MTD. In other words, this unified model makes it possible to consider the dichotomous, polychotomous and continuous toxicity responses under the same framework.

In order to find the overall MTD, the target toxicity tolerance probability θ(y), that corresponds to the level-y severe toxicity Ty, for yY, needs to be pre-specified. In the cases of dichotomous and polychotomous toxicity responses, the determination of θ(y), yY, is relatively straightforward (see for instance, Yang & Ye, Citation2012). In the case of continuous toxicity response, one needs to interact with the physicians to obtain an acceptable target toxicity tolerance probability curve θ(y). In general, the higher the toxicity tolerance level, the lower the chance that the patient exposes to the toxicity is allowed, i.e., the target toxicity tolerance probability θ(y) is decreasing in y, where yY. For all yY, θ0θ(y)1. A lower bound θ0>0 is necessary for the continuous toxicity response. Otherwise, if infyY{θ(y)}=0, YMTD(θ)= according to (Equation7). This will be impractical.

As an example for the continuous case, the normal toxicity response with linear mean in dosage is studied in this research. According to Corollary 2.1 and Proposition 3.3, the normal toxicity system could reduce to the dichotomous probit system described in Yang et al. (Citation2010), as the power α in target toxicity tolerance probability (Equation10) goes to infinity. This indicates that the proposed framework is closed under the normal distribution. In addition, we may also use a different target toxicity tolerance probability curve for the normal-response model to reach similar dichotomous response result, such as shown in Section 4.4. Furthermore, we perform several simulation studies and it is shown that the unified model works well for the normal-toxicity responses.

In practice, other continuous toxicity models, such as logistic, beta if the toxicity is represented in percentage or other suitable models, can be utilised under the same framework. Prior elicitation is also an important issue. In this study, we provide full conditional distributions for the parameters for various priors. When more complex models or hard-to-deal-with priors are used, complex full conditional distributions may be encountered, and the difficulties of simulation may arise. However, many simulation methods can be applied to handle those difficulties, such as acceptance-rejection algorithms or Metropolis-Hastings algorithm.

Acknowledgements

Research of Keying Ye and Min Wang were partially supported by grants from the College of Business at the University of Texas at San Antonio.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

Keying Ye

Keying Ye is a professor of Statistics in the Department of Management Science and Statistics at the University of Texas at San Antonio. His research interests include Bayesian analysis, statistical inference and decision theory, and statistical applications.

Xiaobin Yang

Xiaobin Yang is a senior statistician in the HouseCanary Inc., San Antonio, Texas. His research interests include statistical analysis and applications.

Ying Ji

Ying Ji is a statistical consultant and her research interests include statistical analysis and applications.

Min Wang

Min Wang is an associate professor of Statistics in the Department of Management Science and Statistics at the University of Texas at San Antonio. His research interests include Bayesian statistics, high dimensional inference and statistical applications and machine learning.

References

Appendix

Proof

Proof of Lemma 2.1.

Since, for all yY, infyY{g(y)g0(y)}g(y)g0(y)supyY{g(y)g0(y)}, we have infyY{g(y)g0(y)}+g0(y)g(y)g0(y)+g0(y)supyY{g(y)g0(y)}+g0(y). Therefore, infyY{g(y)g0(y)}+infyY{g0(y)}infyY{g(y)}supyY{g(y)g0(y)}+infyY{g0(y)}.Consequently, the fact is proved by subtracting infyY{g0(y)} on each side of both inequalities because infyY{g0(y)}< since |g0(y)|<.

Proof

Proof of Theorem 2.1.

For all yY, denote hy(x)P(Y<y|x). According to the regularity condition, hy(x) is continuous and strictly decreasing in x. Hence hy1(x) is also continuous and strictly decreasing in x, where hy1 is the inverse function of hy. Therefore, according to (2.2), the level-y MTD for toxicity system (Y,F,θn) is yMTD(θn(y))=sup{x|P(Y<y|x)1θn(y)}=sup{x|hy(x)1θn(y)}=sup{x|xhy1(1θn(y))}=hy1(1θn(y)),where n=0,1,2,. Since hy1 is continuous, one has, for any ϵ>0, there exists a δ>0, such that if |θn(y)θ0(y)|<δ, then |yMTD(θn(y))yMTD(θ0(y))|=|hy1(θn(y))hy1(θ0(y))|<ϵ.Since θn uniformly converges to θ0, for the δ>0, there exists an N>0, such that for all n>N and yY, |θn(y)θ0(y)|<δ.

Consequently, for any ϵ>0, there exists a N>0, such that for all n>N and yY, ϵ<yMTD(θn(y))yMTD(θ0(y))<ϵ.Furthermore, according to Lemma 2.1, we have ϵ<infyY{yMTD(θn(y))}infyY{yMTD(θ0(y))}<ϵ,or, ϵ<YMTD(θn)YMTD(θ0)<ϵ,which implies that limnYMTD(θn)=YMTD(θ0).

Proof

Proof of Theorem 2.3.

Let YMTD(θ) and YMTD(θ) denote the overall MTD of toxicity systems (Y,F,θ) and (Y,F,θ), respectively. Since θ(y)θ(y1) for all y1y<y2, using Propositions 2.1 and 2.2, we have, y1MTD(θ(y1))yMTD(θ(y1))yMTD(θ(y)),for all y1y<y2,where yMTD(θ(y)) is the level-y MTD associated with system (Y,F,θ). Hence, (A1) YMTD(θ)=inf{yMTD(θ(y))|yY}=inf{yMTD(θ(y))|yY}.(A1) Using (Equation4) and θ=θ(y),yY, we have {xX|F(y|x)1θ(y)}={xX|F(y|x)1θ(y)},for all yY.Hence, according to (2.2), yMTD(θ(y))=yMTD(θ(y)), for all yY, where yMTD(θ(y)) is the level-y MTD associated with system (Y,F,θ). Finally, using (EquationA1), YMTD(θ)=inf{yMTD(θ(y))|yY}=inf{yMTD(θ(y))|yY}=YMTD(θ)which implies that system (Y,F,θ) is equivalent to system (Y,F,θ) in the sense of finding the overall MTD.

Proof

Proof of Proposition 3.3.

Since [(y1y)/(y1y0)]α is decreasing in α, for any y(y0,y1), it is easy to show that the overall MTD in (Equation11) is decreasing in α.

Since Φ1[(1θ0)(1[y1yy1y0]α)] is continuously increasing in α, we have limαΦ1(1θ0)1y1yy1y0α=Φ1(1θ0), for any y(y0,y1), which implies that for any ϵ>0, there exists a real number A, such that for any α>A, β1ϵσ<Φ1(1θ0)1y1yy1y0αΦ1(θ1)<0,for any y(y0,y1). Hence, for any ϵ>0, there exists a real number A, such that for any α>A, 0<infy(y0,y1)yσΦ1(1θ0)1y1yy1y0αβ0β1y0σΦ1(1θ0)β0β1=infy(y0,y1)(yy0)σΦ1(1θ0)1y1yy1y0α1y1yy1y0αΦ1(1θ0))β11<infy(y0,y1)yy0β1+ϵ=ϵ,

which implies (Equation12).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.