Search in:

Statistical Theory and Related Fields Volume 4, 2020 - Issue 1

Submit an article Journal homepage

Free access

371

Views

CrossRef citations to date

Altmetric

Listen

Articles

Power analysis, sample size calculation for testing the largest binomial probability

Thuan Nguyena Oregon Health and Science University, Portland, OR, USA;b University of California, Davis, CA, USAView further author information

Jiming Jianga Oregon Health and Science University, Portland, OR, USA;b University of California, Davis, CA, USACorrespondence[email protected]
View further author information

Pages 78-83 | Received 31 Oct 2018, Accepted 20 Feb 2019, Published online: 15 Mar 2019

Cite this article
https://doi.org/10.1080/24754269.2019.1586283
CrossMark

In this article

1. Introduction
2. Asymptotic null distribution
3. Power and sample size calculation
4. Animal clinical trial revisited
Acknowledgements
Disclosure statement
Additional information
References
Appendixes

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

A procedure is developed for power analysis and sample size calculation for a class of complex testing problems regarding the largest binomial probability under a combination of treatments. It is shown that the asymptotic null distribution of the likelihood-ratio statistic is not parameter-free, but $χ_{1}^{2}$ is a conservative asymptotic null distribution. A nonlinear Gauss-Seidel algorithm is proposed to uniquely determine the alternative for the power and sample size calculation given the baseline binomial probability. An example from an animal clinical trial is discussed.

Keywords:

Asymptotic null distribution
binomial probability
complex hypotheses
Gauss-Seidel
logistic regression
power
sample size
tests

1. Introduction

In biological and medical research, it is often necessary to perform power analysis, or determine the sample size required for binomial trials involving multiple treatments. For example, such an analysis/calculation is required by the National Institutes of Health (NIH) for research grant applications. One particular type of questions that need to be answered is regarding the largest binomail probability under these treatments. Suppose that there are r treatments, denoted by $1, \dots, r$ . Here the ‘treatment’ can be a real treatment (e.g., drug), or a factor that has two levels (e.g., male/female). Let $x_{q}$ be the indicator for the qth treatment (0 or 1), $1 \leq q \leq r$ . Suppose that n independent trials are run under each combination of the treatments, $x_{1}, \dots, x_{r}$ , resulting a binary outcome for each trial (1 – success, 0 – failure). Let $N (x_{1}, \dots, x_{r})$ be the total number of successes under $x_{1}, \dots, x_{r}$ . It is assumed that $N (x_{1}, \dots, x_{r})$ has a Binomia ${n, p (x_{1}, \dots, x_{r})}$ distribution, where $p (x_{1}, \dots, x_{r})$ is the probability of success in a single trial under $x_{1}, \dots, x_{r}$ . It is believed that all the treatments have at least nonnegative effects, so $p (0, \dots, 0)$ is the smallest probability and $p (1, \dots, 1)$ is the largest. The question is to determine the sample size, n, so that one has at least the power $1 - γ$ to prove the case that $p (1, \dots, 1)$ is higher than the rest of the probabilities, if it is indeed higher by at least as much as δ. In some cases, the researcher has a target sample size (e.g., the maximum under the budget constraint). The question then is to perform the power analysis for the given sample size. We illustrate with an example.

Example 1.1

Researchers at the Oregon Health & Science University (OHSU) were preparing to meet the deadline of a grant submission. One of the research aims of the grant proposal had to do with comparative transplantations via animal clinical trials (i.e., mice) to determine the best overall protocol, which can then be further optimised. Successful liver repopulation is defined as greater than 70% cell replacement and successful blood reconstitution is defined as greater than 50% human cells in the bone marrow. From previous studies, it was known that these levels of liver repopulation could be achieved in 20–80% of transplanted mice with a good hepatocyte donor, the average being 50–60%. For cord blood transplants into neonates, about 75% of the mice reach the desired human repopulation levels, again with a range of 50–90% between experiments. Using these numbers from the single cell type transplants, we made the assumption that a successful protocol would yield about 30% success in double-repopulation. Here the protocol involved a treatment high and low dose levels, and a control factor of youth and grown-up mice. In other words, there are two treatments, $x_{1} = 1$ for high dose and $x_{1} = 0$ for low dose; $x_{2} = 1$ for youth mice and $x_{2} = 0$ for grown-up one. The trials were to be carried out independently on n different mice, resulting a binomial proportion, under each protocol. It was determined that n should be no more than 30 due to the budget constraint. An initial request was made to perform a power analysis in order to detect a 10% difference that separates the success rates of the ‘optimal’ protocol (i.e., with high dose and youth mice) and the rest.

A naive approach to the power analysis/sample size calculation would be to compare $p (1, \dots, 1)$ with each of the other probabilities (actually, only those with exactly one of the treatment indicators equal to zero), and perform a two-sample t-test for the difference in proportions. However, this approach is low power, and often results in a sample size that the researcher cannot afford. The inefficiency of the naive approach is not surprising, because only a (small) portion of the data are involved in the two-sample t-test (for each comparison). One can do better by utilising the entire data in the analysis. To do so we need to assume a model for the binomial probability. Suppose that (1) $p (x_{1}, \dots, x_{r}) = h (β_{0} + β_{1} x_{1} + \dots + β_{r} x_{r}),$ (1) where $h (\cdot)$ satisfies $0 < h (x) < 1$ and is strictly increasing. A well-known example of h is the logistic function, $h (x) = e^{x} / (1 + e^{x})$ . Under the assumed model, the problem of interest can be expressed more precisely as testing the hypothesis (2) $H_{0} : at least one of the β_{j}, 1 \leq j \leq r, i s \leq 0$ (2) versus $H_{1} : β_{j} > 0, 1 \leq j \leq r$ . Natually, the likelihood-ratio test is considered. The latter is based on the test statistic (3) $L = 2 (\hat{l} - {\hat{l}}_{0}),$ (3) where $\hat{l}$ is the maximised log-likelihood and ${\hat{l}}_{0}$ the maximised log-likelihood under $H_{0}$ . A first step for the likelihood-ratio test (LRT) would be to determine the critical value, $c_{α}$ , corresponding to the given level of significance α, such that () $sup_{β \in H_{0}} P_{β} (L > c_{α}) \leq α,$ () where $β = (β_{0}, \dots, β_{r})^{'}$ and $P_{β}$ is the probability distribution given that β is the true vector of parameters. If the log-likelihood function is log-concave, which is the case, for example, for the logistic regression, the event inside the probability of (Equation4() $sup_{β \in H_{0}} P_{β} (L > c_{α}) \leq α,$ () ) implies that the maximum for ${\hat{l}}_{0}$ must take place on the boundary of $H_{0}$ , provided that $c_{α} > 0$ . Therefore, the critical value is computed by considering the boundary of $H_{0}$ , which is a subset of (5) ${\tilde{H}}_{0} : β_{j} = 0 for some 1 \leq j \leq r .$ (5) Unfortunately, even with the latest simplification, the asymptotic null distribution of the LRT is not parameter-free. This is shown in the next section. On the other hand, the arguments also show that a conservative asymptotic null distribution (CAND) for the LRT is $χ_{1}^{2}$ , which is parameter-free. Here the CAND is in the sense that (6) $sup_{β \in {\tilde{H}}_{0}} \underset{n \to \infty}{lim sup} P_{β} (L > χ_{1, α}^{2}) = α .$ (6)

The main objective of the current paper is to determine the sample size, n, for the LRT so that the test will have the designated power, or to obtain the power of the LRT, under a given sample size. Although standard power and sample size problems in logistic regression are well studied (e.g., Alam, Rao, & Cheng, Citation2010; Borenstein, Rothstein, & Cohen, Citation2001; Demidenko, Citation2007; Hsieh, Bloch, & Larsen, Citation1998; Novikov, Fund, & Freedman, Citation2010; Whittemore, Citation1981), to the best of our knowledge, the kind of problems that we are dealing with have not been addressed. Clearly, the power of the test depends on the alternative, and there are infinitely many possible alternatives to (Equation2(2) $H_{0} : at least one of the β_{j}, 1 \leq j \leq r, i s \leq 0$ (2) ), that is, in $H_{1}$ . On the other hand, a practitioner would prefer a ‘short answer’, as opposed to something that is case-by-case. This issue is addressed in Section 3, where a unique alternative, based on a reasonable argument, is determined. A simple Gauss-Seidel type algorithm is proposed to compute the alternative. A Monte Carlo procedure is then proposed to compute the power or sample size. A real-life application is considered in Section 4. Technical details are deferred to Appendix.

2. Asymptotic null distribution

In the standard situation, the LRT is known to have an asymptotic $χ^{2}$ distribution, under the null hypothesis, with a certain degrees of freedom that does not depend on the parameter under the null. For example, if, instead of (Equation2(2) $H_{0} : at least one of the β_{j}, 1 \leq j \leq r, i s \leq 0$ (2) ), one were to test (7) $H_{0}^{j} : β_{j} = 0$ (7) versus $H_{1}^{j} : β_{j} \neq 0$ for a fixed $1 \leq j \leq r$ , then the asymptotic distribution of the LRT is $χ_{1}^{2}$ , regardless of the value of the true $β_{k}, k \neq j$ , as long as the true $β_{j}$ is zero. In other words, the asymptotic null distribution is parameter-free. However, this is not the case for testing (Equation2(2) $H_{0} : at least one of the β_{j}, 1 \leq j \leq r, i s \leq 0$ (2) ). We show this for the case of logistic regression with r=2.

Write the log-likelihood function as $l (β_{0}, β_{1}, β_{2})$ . Let $({\hat{β}}_{0}, {\hat{β}}_{1}, {\hat{β}}_{2})^{'}$ , $({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0)^{'}$ , and $({\hat{β}}_{0}^{[1]}, 0, {\hat{β}}_{2}^{[1]})^{'}$ be the maximiser of l without constraint, over $H_{0}^{2} = {(β_{0}, β_{1}, 0) : β_{0}, β_{1} \in R}$ , and over $H_{0}^{1} = {(β_{0}, 0, β_{2}) : β_{0}, β_{2} \in R}$ , respectively. Suppose that the true parameter vector is $(β_{0}, β_{1}, 0) \in H_{0}^{2}$ , where $β_{1} \neq 0$ . By the standard asymptotic theory, we have ${\hat{β}}_{j}^{[2]} \overset{P}{⟶} β_{j}, j = 0, 1$ , as $n \to \infty$ . On the other hand, by White (Citation1982), we have ${\hat{β}}_{j}^{[1]} \overset{P}{⟶} β_{j}^{[1]}, j = 0, 2$ , as $n \to \infty$ for some $β_{0}^{[1]}$ and $β_{2}^{[1]}$ . Thus, by the Taylor expansion, we have (8) $\begin{aligned} \frac{l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0)}{n} = \frac{l (β_{0}, β_{1}, 0)}{n} \\ + \frac{1}{n} \sum_{j = 0, 1} \{{\frac{\partial l}{\partial β_{j}}|}_{{\tilde{β}}^{(2)}}\} ({\hat{β}}_{j}^{[2]} - β_{j}), \end{aligned}$ (8) (9) $\begin{aligned} \frac{l ({\hat{β}}_{0}^{[1]}, 0, {\hat{β}}_{2}^{[1]})}{n} = \frac{l (β_{0}^{[1]}, 0, β_{2}^{[1]})}{n} \\ + \frac{1}{n} \sum_{j = 0, 2} \{{\frac{\partial l}{\partial β_{j}}|}_{{\tilde{β}}^{(1)}}\} ({\hat{β}}_{j}^{[1]} - β_{j}^{[1]}), \end{aligned}$ (9) for some ${\tilde{β}}^{(j)}, j = 1, 2$ . It is easy to show that the partial derivatives are uniformly bounded when divided by n, so the second terms on the right sides of (Equation8(8) $\begin{aligned} \frac{l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0)}{n} = \frac{l (β_{0}, β_{1}, 0)}{n} \\ + \frac{1}{n} \sum_{j = 0, 1} \{{\frac{\partial l}{\partial β_{j}}|}_{{\tilde{β}}^{(2)}}\} ({\hat{β}}_{j}^{[2]} - β_{j}), \end{aligned}$ (8) ) and (Equation9(9) $\begin{aligned} \frac{l ({\hat{β}}_{0}^{[1]}, 0, {\hat{β}}_{2}^{[1]})}{n} = \frac{l (β_{0}^{[1]}, 0, β_{2}^{[1]})}{n} \\ + \frac{1}{n} \sum_{j = 0, 2} \{{\frac{\partial l}{\partial β_{j}}|}_{{\tilde{β}}^{(1)}}\} ({\hat{β}}_{j}^{[1]} - β_{j}^{[1]}), \end{aligned}$ (9) ) are $o_{P} (1)$ . As for the first terms, it is easy to show that, for any β, we have $\begin{aligned} l (β) & = c + \sum_{x_{1}, x_{2} = 0, 1} \{N (x_{1}, x_{2}) (β_{0} + β_{1} x_{1} + β_{2} x_{2}) \\ - n \log (1 + e^{β_{0} + β_{1} x_{1} + β_{2} x_{2}})\}, \end{aligned}$ where c does not depend on the parameter. Thus, we have $\begin{aligned} \frac{l (β_{0}, β_{1}, 0) - l (β_{0}^{[1]}, 0, β_{2}^{[1]})}{n} \\ = \sum_{x_{1}, x_{2} = 0, 1} [\frac{N (x_{1}, x_{2})}{n} {(β_{0} + β_{1} x_{1}) - (β_{0}^{[1]} \\ + β_{2}^{[1]} x_{2})} - \log (\frac{1 + e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0}^{1} + β_{2}^{1} x_{2}}})] . \end{aligned}$ By the weak law of large numbers, we have $n^{- 1} N (x_{1}, x_{2}) \overset{P}{⟶} p (x_{1}, x_{2}) = h (β_{0} + β_{1} x_{1})$ , with $h (x) = e^{x} / (1 + e^{x})$ , $x_{1}, x_{2} = 0, 1$ . Thus, we have (10) $\begin{aligned} \frac{l (β_{0}, β_{1}, 0) - l (β_{0}^{[1]}, 0, β_{2}^{[1]})}{n} \\ \overset{P}{⟶} \sum_{x_{1}, x_{2} = 0, 1} [\frac{e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0} + β_{1} x_{1}}} {(β_{0} + β_{1} x_{1}) \\ - (β_{0}^{[1]} + β_{2}^{[1]} x_{2})} - \log (\frac{1 + e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0}^{[1]} + β_{2}^{[1]} x_{2}}})] . \end{aligned}$ (10) For fixed $x_{1}, x_{2} \in {0, 1}$ , write $p_{0} = h (β_{0} + β_{1} x_{1})$ and $p_{1} = h (β_{0}^{[1]} + β_{2}^{[1]} x_{2})$ . Then, by the inequality in Appendix A.1, we have $\begin{aligned} \frac{e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0} + β_{1} x_{1}}} {(β_{0} + β_{1} x_{1}) - (β_{0}^{[1]} + β_{2}^{[1]} x_{2})} \\ - \log (\frac{1 + e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0}^{[1]} + β_{2}^{[1]} x_{2}}}) \\ = p_{0} \log (\frac{p_{0}}{p_{1}}) + (1 - p_{0}) \log (\frac{1 - p_{0}}{1 - p_{1}}) \geq 0, \end{aligned}$ with the equality holding if and only if $p_{0} = p_{1}$ . It follows that the right side of (Equation10(10) $\begin{aligned} \frac{l (β_{0}, β_{1}, 0) - l (β_{0}^{[1]}, 0, β_{2}^{[1]})}{n} \\ \overset{P}{⟶} \sum_{x_{1}, x_{2} = 0, 1} [\frac{e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0} + β_{1} x_{1}}} {(β_{0} + β_{1} x_{1}) \\ - (β_{0}^{[1]} + β_{2}^{[1]} x_{2})} - \log (\frac{1 + e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0}^{[1]} + β_{2}^{[1]} x_{2}}})] . \end{aligned}$ (10) ) is positive unless $p_{0} = p_{1}$ for all $x_{1}, x_{2} = 0, 1$ . Because the latter implies $β_{1} = 0$ , a contradiction, the right side of (Equation10(10) $\begin{aligned} \frac{l (β_{0}, β_{1}, 0) - l (β_{0}^{[1]}, 0, β_{2}^{[1]})}{n} \\ \overset{P}{⟶} \sum_{x_{1}, x_{2} = 0, 1} [\frac{e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0} + β_{1} x_{1}}} {(β_{0} + β_{1} x_{1}) \\ - (β_{0}^{[1]} + β_{2}^{[1]} x_{2})} - \log (\frac{1 + e^{β_{0} + β_{1} x_{1}}}{1 + e^{β_{0}^{[1]} + β_{2}^{[1]} x_{2}}})] . \end{aligned}$ (10) ) must be positive. Therefore, in conclusion, we have with probability tending to one that $l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0) > l ({\hat{β}}_{0}^{[1]}, 0, {\hat{β}}_{2}^{[1]})$ , hence $L = 2 {l ({\hat{β}}_{0}, {\hat{β}}_{1}, {\hat{β}}_{2}) - l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0)} \overset{d}{⟶} χ_{1}^{2}$ , by the standard asymptotic result [see the note below (Equation7(7) $H_{0}^{j} : β_{j} = 0$ (7) )].

Now suppose that the true parameter vector is $(β_{0}, 0, 0)$ . Then, it can be shown (see Appendix A.2) that $L \overset{d}{⟶} η - η_{1} \lor η_{2}$ , as $n \to \infty$ , where $\begin{aligned} η & = {(\begin{matrix} ξ_{0} \\ ξ_{1} \\ ξ_{2} \end{matrix})}^{'} {(\begin{matrix} 4 & 2 & 2 \\ 2 & 2 & 1 \\ 2 & 1 & 2 \end{matrix})}^{- 1} (\begin{matrix} ξ_{0} \\ ξ_{1} \\ ξ_{2} \end{matrix}), \\ (\begin{matrix} ξ_{0} \\ ξ_{1} \\ ξ_{2} \end{matrix}) & \sim N [(\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}), (\begin{matrix} 4 & 2 & 2 \\ 2 & 2 & 1 \\ 2 & 1 & 2 \end{matrix})], \\ η_{j} & = {(\begin{matrix} ξ_{0} \\ ξ_{j} \end{matrix})}^{'} {(\begin{matrix} 4 & 2 \\ 2 & 2 \end{matrix})}^{- 1} (\begin{matrix} ξ_{0} \\ ξ_{j} \end{matrix}), j = 1, 2. \end{aligned}$ Note that $η - η_{1} \lor η_{2}$ is not distributed as $χ_{1}^{2}$ . To see this, note that $η_{1} < η_{2}$ if and only if $(ξ_{1} - ξ_{0} / 2)^{2} < (ξ_{2} - ξ_{0} / 2)^{2}$ . Because the pdf of ξ is positive everywhere, there is a positive probability that $η_{1} < η_{2}$ , hence $η_{1} \lor η_{2} > η_{1}$ . On the other hand, one always has $η_{1} \lor η_{2} \geq η_{1}$ . It follows that $E (η_{1} \lor η_{2}) > E (η_{1})$ , hence $E (η - η_{1} \lor η_{2}) < E (η) - E (η_{1}) = 3 - 2 = 1$ .

The results so far in this section have shown that the asymptotic null distribution of $L$ depends on the values of the true parameters, namely, if $β = (β_{0}, β_{1}, 0)$ , where $β_{1} \neq 0$ [or $β = (β_{0}, 0, β_{2})$ , where $β_{2} \neq 0$ , by a similar argument], the asymptotic null distribution is $χ_{1}^{2}$ ; if $β = (β_{0}, 0, 0)$ , the asymptotic null distribution is not $χ_{1}^{2}$ . Thus, the asymptotic null distribution is not parameter-free.

Nevertheless, $χ_{1}^{2}$ is, in general (not restricted to logistic and r=2), a CAND in the sense of (Equation6(6) $sup_{β \in {\tilde{H}}_{0}} \underset{n \to \infty}{lim sup} P_{β} (L > χ_{1, α}^{2}) = α .$ (6) ), provided that the log-likelihood is log-concave. This can be shown with a simple argument. For any $β \in {\tilde{H}}_{0}$ , there is a $1 \leq j \leq r$ such that $β_{j} = 0$ . Due to the log-concavity, the event $L > χ_{1, α}^{2}$ is the same as the event $2 (\hat{l} - {\tilde{l}}_{0}) > χ_{1, α}^{2}$ , where ${\tilde{l}}_{0}$ is the maximised log-likelihood over ${\tilde{H}}_{0}$ [see the note above (Equation5(5) ${\tilde{H}}_{0} : β_{j} = 0 for some 1 \leq j \leq r .$ (5) )]. On the other hand, we have $2 (\hat{l} - {\tilde{l}}_{0}) \leq 2 (\hat{l} - {\hat{l}}_{0 j})$ , where ${\hat{l}}_{0 j}$ is the maximised log-likelihood over ${\tilde{H}}_{0 j} = {β : β_{j} = 0}$ . Therefore, we have $P_{β} (L > χ_{1, α}^{2}) = P_{β} {2 (\hat{l} - {\tilde{l}}_{0}) > χ_{1, α}^{2}} \leq P_{β} {2 (\hat{l} - {\hat{l}}_{0 j}) > χ_{1, α}^{2}} \to α$ , by the standard asymptotic result. Therefore, we have $\underset{n \to \infty}{lim sup} P_{β} (L > χ_{1, α}^{2}) \leq α$ .

On the other hand, if $β \in {\tilde{H}}_{0}$ such that $β_{j} = 0$ while $β_{k} \neq 0, k \neq j$ , by a similar argument as the above for the special case of logistic regression with r=2, it can be shown that, with $P_{β}$ tending to one, we have $L = 2 (\hat{l} - {\hat{l}}_{0 j}) \overset{d}{⟶} χ_{1}^{2}$ , hence $lim_{n \to \infty} P_{β} (L > χ_{1, α}^{2}) = α$ . Because $lim_{n \to \infty} P_{β} (L > χ_{1, α}^{2})$ achieves its supremum at $β_{j} = 0$ while $β_{k} \neq 0, k \neq j$ , (Equation6(6) $sup_{β \in {\tilde{H}}_{0}} \underset{n \to \infty}{lim sup} P_{β} (L > χ_{1, α}^{2}) = α .$ (6) ) must hold. Note that, by the definition of ${\tilde{H}}_{0}$ , which has $j \geq 1$ , all the indexes j,k mentioned in this paragraph are assumed to be $\geq 1$ ; in other words, $β_{0}$ is not involved.

3. Power and sample size calculation

By the results of the previous section, we can use $χ_{1, α}^{2}$ as the critical value of the LRT. As for the power calculation, although there are infinitely many alternatives that influence the power, it is often reasonable to assume, in practice, that the baseline probability is known. In other words, $h (β_{0})$ is known according to (Equation1(1) $p (x_{1}, \dots, x_{r}) = h (β_{0} + β_{1} x_{1} + \dots + β_{r} x_{r}),$ (1) ); therefore, $β_{0}$ is known.

In addition, the minimum probabilistic increase of the largest probability over the other probabilities, δ, is given. In other words, we consider all the alternatives such that (11) $p (1, \dots, 1) \geq p (x_{1}, \dots, x_{r}) + δ$ (11) for all $(x_{1}, \dots, x_{r}) \neq (1, \dots, 1)$ . It follows, under (Equation1(1) $p (x_{1}, \dots, x_{r}) = h (β_{0} + β_{1} x_{1} + \dots + β_{r} x_{r}),$ (1) ), that all of the $β_{j}, 1 \leq j \leq r$ must be positive, and that (Equation11(11) $p (1, \dots, 1) \geq p (x_{1}, \dots, x_{r}) + δ$ (11) ) is equivalent to (12) $\begin{aligned} h (β_{0} + \sum_{j = 1}^{r} β_{j}) \geq h (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}) + δ, \\ 1 \leq k \leq r . \end{aligned}$ (12) The minimum amount of increase of the left sides of (Equation12(12) $\begin{aligned} h (β_{0} + \sum_{j = 1}^{r} β_{j}) \geq h (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}) + δ, \\ 1 \leq k \leq r . \end{aligned}$ (12) ) over the right sides takes place when the equalities hold in all of the inequalities, that is, when (13) $\begin{aligned} h (β_{0} + \sum_{j = 1}^{r} β_{j}) = h (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}) + δ, \\ 1 \leq k \leq r . \end{aligned}$ (13) This results in r equations, from which we can uniquely determine the alternative. Note that (Equation13(13) $\begin{aligned} h (β_{0} + \sum_{j = 1}^{r} β_{j}) = h (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}) + δ, \\ 1 \leq k \leq r . \end{aligned}$ (13) ) is a nonlinear equation system; however, it can be solved conveniently by utilising a Gauss-Seidel type algorithm (e.g., Jiang, Citation2000). Namely, from (Equation13(13) $\begin{aligned} h (β_{0} + \sum_{j = 1}^{r} β_{j}) = h (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}) + δ, \\ 1 \leq k \leq r . \end{aligned}$ (13) ) we have (14) $\begin{aligned} β_{k} & = h^{- 1} \{h (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}) + δ\} \\ - β_{0} - \sum_{1 \leq j \leq r, j \neq k} β_{j} \\ = g (β_{0} + \sum_{1 \leq j \leq r, j \neq k} β_{j}), \end{aligned}$ (14) where $g (x) = h^{- 1} {h (x) + δ} - x$ (the inverse of h exists because h is assumed to be strictly increasing). Thus, given the initial values $β_{j}^{(0)}, 1 \leq j \leq r - 1$ (e.g., all zero), we have (15) $\begin{aligned} β_{k}^{(l)} & = g \{β_{0} + \sum_{j = 1}^{k - 1} β_{j}^{(l - 1)} + \sum_{j = k + 1}^{r} β_{j}^{(l)}\}, \\ k = r, \dots, 1 \end{aligned}$ (15) for $l = 1, 2, \dots$ . The convergence is guaranteed, and fast. We illustrate with an example.

Example 1.1

continued

In the animal clinical trial example, the researchers suggested a baseline probability of 0.3. Using the logistic regression, we have $β_{0} = l o g i t (0.3) = - 0.8472979$ . Furthermore, the minimum probability increase was set (again by the researchers) as $δ = 0.1$ . Recall that r=2 in this case. The Gauss-Seidel algorithm converged within three iterations. Table shows the R outputs of the first five iterations.

Table 1. Convergence of Gauss-Seidel algorithm.

Display Table

Given the target sample size, the power of the LRT at the alternative can be computed by a Monte Carlo method. Consider, for example, the case of logistic regression. Under the alternative $β = (β_{0}, β_{1}, \dots, β_{r})^{'}$ , one can simulate data, $N (x_{1}, \dots, x_{r}), x_{1}, \dots, x_{r} \in {0, 1}$ , under the logistic regression. For each simulated data set, r+1 logistic regressions are fit. The first one is under the full model, the next one under the model without $x_{1}$ ,…, and the last one under the model without $x_{r}$ . Let $\hat{l}, {\hat{l}}_{01}, \dots, {\hat{l}}_{0 r}$ denote the maximised log-likelihoods as results of these logistic regressions. We compute $L = 2 (\hat{l} - max_{1 \leq j \leq r} {\hat{l}}_{0 j})$ for the simulated data set. This is repeated B times, resulting $L^{(b)}, b = 1, \dots, B$ . The power of the LRT is then approximated by $B^{- 1} # {1 \leq b \leq B : L^{(b)} > χ_{1, α}^{2}}$ .

If, instead, the goal is to determine the sample size so that the LRT has a designated power, say, γ, we can use the following bisection procedure to speed up the search for the minimum sample size. First pick a couple of initial sample sizes, $n_{0}$ and $n_{1}$ , and compute the power under $n_{0}$ and $n_{1}$ using the above procedure. Suppose that the power under $n_{0}$ is less than γ, and the power under $n_{1}$ is greater than γ. We then let $n_{2} = (n_{0} + n_{1}) / 2$ (take the integer part, if necessary), and compute the power under $n_{2}$ using the above procedure. If the power under $n_{2}$ is greater than γ, let $n_{3} = (n_{0} + n_{2}) / 2$ ; otherwise, let $n_{3} = (n_{2} + n_{1}) / 2$ , and so on. The procedure should converge quickly to a single integer, $n_{*}$ , so that either $n_{*}$ or $n_{*} + 1$ is the minimum sample size to have the power greater than or equal to γ.

4. Animal clinical trial revisited

Let us go back to Example 1.1 of Section 1. Recall the initial request was to make a power analysis based on the sample size n=30 for detecting a 10% difference between $p (1, 1)$ and the rest of the binomial probabilities, and the baseline probability was set as 30%. We considered a logistic regression with r=2 for this case. Here $p_{0} = 0.3$ and $δ = 0.1$ . The alternative was computed by the Gauss-Seidel algorithm [see Example 1.1 (continued) in Section 3] as $β_{0} = - 0.8472979$ and $β_{1} = β_{2} = 0.4069759$ . The corresponding probabilities are $p (0, 0) = 0.3$ , $p (0, 1) = p (1, 0) = 0.3916643$ , and $p (1, 1) = 0.4916643$ . As we can see, the minimum difference between $p (1, 1)$ and the rest of the p's is 0.1. For this alternative, the power of the LRT at 5% level of significance was computed, using the Monte-Carlo method described in Section 3 with B=1000, as approximately 88%.

As the 80% power was considered satisfactory by the researchers, it appeared that the sample size might be reduced a little. However, when the same procedure was applied to n=20, the power was computed as approximately 78%. In communicating with the lead researcher, the researcher suggested that he would rather sacrifice a little regarding the minimum probabilistic difference in exchange for a reduced sample size (i.e., n=20). Thus, the new δ was set as 0.15. The new alternative was computed by the Gauss-Seidel algorithm (which, again, converged in three iterations) as $β_{0} = - 0.8472979$ and $β_{1} = β_{2} = 0.6051085$ . The corresponding probabilities are $p (0, 0) = 0.3$ , $p (0, 1) = p (1, 0) = 0.4397469$ , and $p (1, 1) = 0.5897469$ . As we can see, the minimum probabilistic increase of $p (1, 1)$ over the rest of the p's is 0.15. For the new alternative, the power of the LRT at 5% level of significance was computed, again using the Monte-Carlo method with B=1000, as approximately 86%. The researcher was satisfied with the result.

Acknowledgments

The authors are grateful to Dr. Markus Grompe of the Doernbecher Children's Hospital of the Oregon Health & Science University for presenting the problem from their research, and for information and helpful discussions. The authors also wish to thank a reviewer for helpful comments.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

The authors' research is partially supported by the National Institutes of Health (NIH) grant R01-GM085205A1. In addition, Thuan Nguyen's research is partially supported by the National Science Foundation (NSF) grant SES-1118469; Jiming Jiang's research is partially supported by the National Science Foundation (NSF) grant SES-1121794.

Notes on contributors

Thuan Nguyen

Thuan Nguyen is Associate Professor, Department of Public Health and Preventive Medicine, Oregon Health and Science University, USA.

Jiming Jiang

Jiming Jiang is Professor, Department of Statistics, University of California, Davis, USA.

References

Alam, M. K., Rao, M. B., & Cheng, F.-C. (2010). Sample size determination in logistic regression. Sankhyā B, 72, 58–75. doi: 10.1007/s13571-010-0004-6
Google Scholar
Borenstein, M., Rothstein, H., & Cohen, J. (2001). Power and precision. Englewood, US: Biostat Inc.
Google Scholar
Demidenko, E. (2007). Sample size determination for logistic regression revisited. Statistics in Medicine, 26, 3385–3397. doi: 10.1002/sim.2771
PubMed Web of Science ®Google Scholar
Hsieh, F. Y., Bloch, D. A., & Larsen, M. D. (1998). A simple method of sample size calculation for linear and logistic regression. Statistics in Medicine, 17, 1623–1634. doi: 10.1002/(SICI)1097-0258(19980730)17:14<1623::AID-SIM871>3.0.CO;2-S
PubMed Web of Science ®Google Scholar
Jiang, J. (2000). A nonlinear Gauss-Seidel algorithm for inference about GLMM. Computational Statistics, 15, 229–241. doi: 10.1007/s001800000030
Web of Science ®Google Scholar
Jiang, J. (2010). Large sample techniques for statistics. New York: Springer.
Google Scholar
Novikov, I., Fund, N., & Freedman, L. S. (2010). A modified approach to estimating sample size for simple logistic regression with one continuous covariate. Statistics in Medicine, 29, 97–105.
PubMed Web of Science ®Google Scholar
White, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica, 50, 1–25. doi: 10.2307/1912526
Web of Science ®Google Scholar
Whittemore, A. (1981). Sample size for logistic regression with small response probability. Journal of the American Statistical Association, 76, 27–32. doi: 10.1080/01621459.1981.10477597
Web of Science ®Google Scholar

Appendix

A.1. An inequality

Consider the function

g (p) = p^{p_{0}} (1 - p)^{1 - p_{0}}

and

h (p) = \log {g (p)}, 0 < p < 1

. We have

h^{'} (p) = {p (1 - p)}^{- 1} (p_{0} - p)

. Thus,

h^{'} (p) > 0

= 0

, or

< 0

depending on

p < p_{0}

p = p_{0}

, or

p > p_{0}

. It follows that

h (\cdot)

, hence

g (\cdot)

, has a unique maximum at

p = p_{0}

, and

g (p) < g (p_{0})

for any

p \neq p_{0}

. Thus, for any

0 < p_{1} < 1, p_{1} \neq p_{0}

, we have

{(\frac{p_{0}}{p_{1}})}^{p_{0}} {(\frac{1 - p_{0}}{1 - p_{1}})}^{1 - p_{0}} = \frac{g (p_{0})}{g (p_{1})} > 1.

A.2. Some derivation in Section 2

We show that $L \overset{d}{⟶} η - η_{1} \lor η_{2}$ as $n \to \infty$ , where the η's are defined in Section 2, if $(β_{0}, 0, 0)$ is the true parameter vector. With the notation introduced in Section 2, we have, by the Taylor expansion, $\begin{aligned} l (β_{0}, 0, 0) = l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0) \\ + \frac{1}{2} {(\begin{matrix} β_{0} - {\hat{β}}_{0}^{[2]} \\ - {\hat{β}}_{1}^{[2]} \end{matrix})}^{'} \\ \times {(\begin{matrix} \partial^{2} l / \partial β_{0}^{2} & \partial^{2} l / \partial β_{0} \partial β_{1} \\ \partial^{2} l / \partial β_{1} \partial β_{0} & \partial^{2} l / \partial β_{1}^{2} \end{matrix})|}_{({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0)} \\ \times (\begin{matrix} β_{0} - {\hat{β}}_{0}^{[2]} \\ - {\hat{β}}_{1}^{[2]} \end{matrix}) \end{aligned}$ $+ o_{P} (1)$ , implying $\begin{aligned} l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0) = l (β_{0}, 0, 0) \\ - \frac{1}{2} {(\begin{matrix} {\hat{β}}_{0}^{[2]} - β_{0} \\ {\hat{β}}_{1}^{[2]} \end{matrix})}^{'} \\ E \{{(\begin{matrix} \partial^{2} l / \partial β_{0}^{2} & \partial^{2} l / \partial β_{0} \partial β_{1} \\ \partial^{2} l / \partial β_{1} \partial β_{0} & \partial^{2} l / \partial β_{1}^{2} \end{matrix})|}_{(β_{0}, 0, 0)}\} (\begin{matrix} {\hat{β}}_{0}^{[2]} - β_{0} \\ {\hat{β}}_{1}^{[2]} \end{matrix}) \end{aligned}$ $+ o_{P} (1)$ . Also, by the standard asymptotic expansion (e.g., Jiang, Citation2010, Ch. 4), we have $\begin{aligned} (\begin{matrix} {\hat{β}}_{0}^{[2]} - β_{0} \\ {\hat{β}}_{1}^{[2]} \end{matrix}) = {[E \{{(\begin{matrix} \partial^{2} l / \partial β_{0}^{2} & \partial^{2} l / \partial β_{0} \partial β_{1} \\ \partial^{2} l / \partial β_{1} \partial β_{0} & \partial^{2} l / \partial β_{1}^{2} \end{matrix})|}_{(β_{0}, 0, 0)}\}]}^{- 1} \\ \times {(\begin{matrix} \partial l / \partial β_{0} \\ \partial l / \partial β_{1} \end{matrix})|}_{(β_{0}, 0, 0)} + O_{P} (n^{- 1}) . \end{aligned}$ It follows that $\begin{aligned} l ({\hat{β}}_{0}^{[2]}, {\hat{β}}_{1}^{[2]}, 0) = l (β_{0}, 0, 0) \\ - \frac{1}{2} {(\begin{matrix} \partial l / \partial β_{0} \\ \partial l / \partial β_{1} \end{matrix})|}_{(β_{0}, 0, 0)}^{'} \\ {[E \{{(\begin{matrix} \partial^{2} l / \partial β_{0}^{2} & \partial^{2} l / \partial β_{0} \partial β_{1} \\ \partial^{2} l / \partial β_{1} \partial β_{0} & \partial^{2} l / \partial β_{1}^{2} \end{matrix})|}_{(β_{0}, 0, 0)}\}]}^{- 1} \\ \times {(\begin{matrix} \partial l / \partial β_{0} \\ \partial l / \partial β_{1} \end{matrix})|}_{(β_{0}, 0, 0)} + o_{P} (1) . \end{aligned}$ Similarly, we have $\begin{aligned} l ({\hat{β}}_{0}^{[1]}, 0, {\hat{β}}_{2}^{[1]}) = l (β_{0}, 0, 0) \\ - \frac{1}{2} {(\begin{matrix} \partial l / \partial β_{0} \\ \partial l / \partial β_{2} \end{matrix})|}_{(β_{0}, 0, 0)}^{'} \\ {[E \{{(\begin{matrix} \partial^{2} l / \partial β_{0}^{2} & \partial^{2} l / \partial β_{0} \partial β_{2} \\ \partial^{2} l / \partial β_{2} \partial β_{0} & \partial^{2} l / \partial β_{2}^{2} \end{matrix})|}_{(β_{0}, 0, 0)}\}]}^{- 1} \\ \times {(\begin{matrix} \partial l / \partial β_{0} \\ \partial l / \partial β_{2} \end{matrix})|}_{(β_{0}, 0, 0)} + o_{P} (1), \end{aligned}$

and $\begin{aligned} l ({\hat{β}}_{0}, {\hat{β}}_{1}, {\hat{β}}_{2}) = l (β_{0}, 0, 0) \\ - \frac{1}{2} {\frac{\partial l}{\partial β}|}_{(β_{0}, 0, 0)}^{'} {\{E ({\frac{\partial^{2} l}{\partial β \partial β^{'}}|}_{(β_{0}, 0, 0)})\}}^{- 1} {\frac{\partial l}{\partial β}|}_{(β_{0}, 0, 0)} \\ + o_{P} (1) . \end{aligned}$ Furthermore, we have (16) $E ({\frac{\partial^{2} l}{\partial β \partial β^{'}}|}_{(β_{0}, 0, 0)}) = - n h^{'} (β_{0}) (\begin{matrix} 4 & 2 & 2 \\ 2 & 2 & 1 \\ 2 & 1 & 2 \end{matrix}),$ (16) and, by the standard asymptotic result, (17) $\frac{1}{\sqrt{n}} {\frac{\partial l}{\partial β}|}_{(β_{0}, 0, 0)} \overset{d}{⟶} N [(\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}), h^{'} (β_{0}) (\begin{matrix} 4 & 2 & 2 \\ 2 & 2 & 1 \\ 2 & 1 & 2 \end{matrix})],$ (17) Let $ξ_{n} = (ξ_{n, 0}, ξ_{n, 1}, ξ_{n, 2})^{'}$ denote the left side of (EquationA2(17) $\frac{1}{\sqrt{n}} {\frac{\partial l}{\partial β}|}_{(β_{0}, 0, 0)} \overset{d}{⟶} N [(\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}), h^{'} (β_{0}) (\begin{matrix} 4 & 2 & 2 \\ 2 & 2 & 1 \\ 2 & 1 & 2 \end{matrix})],$ (17) ) divided by $\sqrt{h^{'} (β_{0})}$ , and A denote the $3 \times 3$ matrix in (EquationA1(16) $E ({\frac{\partial^{2} l}{\partial β \partial β^{'}}|}_{(β_{0}, 0, 0)}) = - n h^{'} (β_{0}) (\begin{matrix} 4 & 2 & 2 \\ 2 & 2 & 1 \\ 2 & 1 & 2 \end{matrix}),$ (16) ). For $a = (a_{0}, a_{1}, a_{2})^{'}$ and $A = (a_{s t})_{s, t = 0, 1, 2}$ , let $a [0, j]$ and $A [0, j]$ denote the subvector $(a_{0}, a_{j})^{'}$ and submatrix $(a_{s t})_{s, t = 0, j}$ , respectively, j=1,2. Then, by the above expressions, it is easy to show that $L = ξ_{n}^{'} A^{- 1} ξ_{n} - max_{j = 1, 2} {ξ_{n} [0, j]^{'} A [0, j]^{- 1} ξ_{n} [0, j]} + o_{P} (1) .$ Because $ξ_{n} \overset{d}{⟶} ξ = (ξ_{0}, ξ_{1}, ξ_{2})^{'} \sim N (0, A)$ , as $n \to \infty$ , by the continuous mapping theorem (e.g., Jiang, Citation2010, p.30), we have $L \overset{d}{⟶} η - η_{1} \lor η_{2}$ , where $η = ξ^{'} A ξ$ and $η_{j} = ξ [0, j]^{'} A [0, j]^{- 1} ξ [0, j], j = 1, 2$ .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Power analysis, sample size calculation for testing the largest binomial probability

Abstract

1. Introduction

2. Asymptotic null distribution

3. Power and sample size calculation

continued

Table 1. Convergence of Gauss-Seidel algorithm.

4. Animal clinical trial revisited

Acknowledgments

Disclosure statement

Notes on contributors

Thuan Nguyen

Jiming Jiang

References

Appendix

A.1. An inequality

A.2. Some derivation in Section 2

Information for

Open access

Opportunities

Help and information

Power analysis, sample size calculation for testing the largest binomial probability

Abstract

1. Introduction

2. Asymptotic null distribution

3. Power and sample size calculation

continued

Table 1. Convergence of Gauss-Seidel algorithm.

4. Animal clinical trial revisited

Acknowledgments

Disclosure statement

Additional information

Funding

Notes on contributors

Thuan Nguyen

Jiming Jiang

References

Appendix

A.1. An inequality

A.2. Some derivation in Section 2

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date