Publication Cover
Sequential Analysis
Design Methods and Applications
Volume 27, 2008 - Issue 1
265
Views
0
CrossRef citations to date
0
Altmetric
Original Articles

Discussion on “Second-Guessing Clinical Trial Designs” by Jonathan J. Shuster and Myron N. Chang

Pages 30-32 | Received 20 Sep 2006, Accepted 10 Apr 2007, Published online: 04 Feb 2008

Abstract

An argument is put forward to include fully sequential tests as useful methods in clinical trials. Generalizations of the Pocock (Citation1977) and O'Brien and Fleming (Citation1979) group sequential tests to fully sequential strategies are available now for several data structures. Looking at the whole statistical process gives a lot of additional information, and this makes flexible decision making possible.

Subject Classifications:

1. INTRODUCTION

This contribution raises the important issue of fixed sample size versus different group sequential trial designs. We might go one step further and add fully sequential methods to the pool of available arsenal.

When data accrues over a lengthy period of time, it is an important consideration how often the data should be looked at to be able to come to a timely and correct decision. If in a study only the end summary is published (fixed-sample-size design), then Shuster and Chang suggest that one can estimate by interpolation what knowledge would have been available at some earlier times. This allows readers to decide if a more frequent evaluation scheme would have been more efficient in saving time and/or money, which is naturally of great importance for ethical and financial reasons.

Currently, fixed-sample-size and group sequential methods are in use. Fully sequential trials are not considered as an option simply because they are not in the pool of the approved methodologies. As the nature and purpose of medical trials is widely varied, in many cases the full statistical sequence could add a lot of insight into the phenomenon under study. Its use and presentation could circumvent later criticisms for failing to be open or to adopt a strategy for early decision making. Second-guessing as discussed in this paper would then be unnecessary.

Before putting forth arguments for adding fully sequential monitoring to the accepted arsenal of sequential tests, let us have a brief look at why group sequential trials are the methods of choice when interim analyses are called for.

The need for interim analyses has already been well established, and there is no longer any pressing debate about it now. The vast literature on group sequential methods and their popularity are also well known. It is their technical feasibility that made group sequential methods so widely used since the 1980s. Yet the first sequential methods in statistics were fully sequential in nature: for example, control charts (dated to the period 1920–1930) and the sequential likelihood ratio tests of Wald (around 1940).

Clinical trials have more complicated data structures, so monitoring the mean of independent, identically distributed observations or choosing between two simple hypotheses are not sufficient for their need. Attempts to extend the first sequential testing methods above to more complex situations, such as when nuisance parameters are present, were not successful (see, e.g., Remark 1 of Gombay, Citation2002a, and Gombay, Citation2002b, and references therein). In many cases the distribution of the full sequence, Stat(k), k = 1, 2,…, n, that is, the values of the statistic calculated after each new observation was deemed intractable (e.g., Basseville and Nikiforov, Citation1993).

On the other hand, fixed-sample-size statistics were well understood and accepted for many such situations. The work of Pocock (Citation1977) and O'Brien and Fleming (Citation1979) pioneered the adjustment of critical values for the fixed-sample-size tests when interim analyses are called for. Their implementation called for numerical calculations involving multivariate distributions.

2. NEW TESTS

With the development of strong approximations in statistics, however, often the full statistics sequence can now be used for testing hypotheses and theoretical critical values are available that have been shown to work well. For example, the generalized likelihood ratio (Gombay, Citation1996), the score vector, and the Wald statistics sequence (Gombay, Citation2002a) have now strong approximations that allow sequential tests. There is evidence in published (Gombay and Hussein, Citation2006) and unpublished simulations that the sequential versions of the Pocock and O'Brien-Fleming tests are not losing power or stop later than the group sequential version, allowing only a limited number of looks at the data. Furthermore, the full statistics sequence can reveal important characteristics of the treatments. See Example 1 of Gombay (Citation2005), where the statistical process is significant at the early stages, but later no difference in treatment effect was present. This revealed that Treatment A had a fatal effect on many people with a heart condition soon after the treatment began, but for those who survived it was not effective.

From this example it is obvious that reconstructing earlier situations from end results, as advocated by the paper under discussion, is only useful if the process is homogeneous along the time variable. It cannot reveal hidden variable effects that may present themselves only in the analysis of the full sequence.

So, if the clinical trial's results have to be open to scrutiny, fully accessible to all (or to a select group), then only the whole statistics process can answer the “what if” questions. Otherwise, we have a case of need for second-guessing, which may or may not be correct. There is a trend now to call for more transparency (e.g., Lilford et al., Citation2001; Reidpath, Citation2001; and in discussions in meetings), and if this will be a successful effort for the protection of all involved, then fully sequential methods will have to be an integral part of the available accepted methods.

Having said that, it is obvious that not all trials need fully sequential methodology. However, one argument used against it, though valid 20 years ago, is no longer so. This is the argument of the technical difficulty arising from the meeting times of the data and safety monitoring committees. We now have the technology of vast databases with data entry from all over the world and instantaneous updating. Using this, or indeed a much smaller-scale, technology for multicenter medical trials makes this problem obsolete. Also, a statistics process can be presented effectively by a graph, together with the critical values, and this would not violate any privacy requirements.

In conclusion, I welcome this contribution for raising many important, though somewhat dormant issues, and for arguing that medical trial designs may have to be more flexible and open than they are allowed to be by the current methodologies of choice.

Notes

Recommended by N. Mukhopadhyay

REFERENCES

  • Basseville , M. and Nikiforov , I. V. ( 1993 ). Detection of Abrupt Changes: Theory and Applications , Englewood Cliffs : Prentice Hall .
  • Gombay , E. ( 1996 ). The Weighted Sequential Likelihood Ratio , Canadian Journal of Statistics 24 : 2209 – 239 .
  • Gombay , E. ( 2002a ). Parametric Sequential Tests in the Presence of Nuisance Parameters , Theory of Stochastic Processes 8 : 106 – 118 .
  • Gombay , E. ( 2002b ). Sequential Tests of Composite Hypotheses, in Limit Theorems in Probability and Statistics II, Proceedings of the Fourth Hungarian Colloquium on Limit Theorems in Probability and Statistics, Balatonlelle, Hungary, June 28–July 2, 1999, I. Berkes , E. Csáki , and M. Csörgo , eds., pp. 107 – 125 , Budapest: Bolyai János Matematikai Társulat .
  • Gombay , E. ( 2005 ). Weighted Logrank Statistics in Sequential Trials, Technical Report Series 05.09, Statistics Center, University of Alberta, Calgary .
  • Gombay , E. and Hussein , A. A. ( 2006 ). Sequential Comparison of Two Populations by Parametric Tests , Canadian Journal Statistics , 34 : 217 – 232 .
  • Lilford , R. J. , Braunholtz , Z. , Edwards , S. , and Stevens , A. ( 2001 ). Monitoring Clinical Trials—Interim Results Should Be Publicly Available , British Medical Journal 323 : 441 – 442 .
  • O'Brien , P. C. and Fleming , T. R. ( 1979 ). A Multiple Testing Procedure for Clinical Trials , Biometrics 35 : 549 – 556 .
  • Pocock , S. J. ( 1977 ). Interim Analyses for Randomized Clinical Trials: The Group Sequential Approach , Biometrics 38 : 153 – 162 .
  • Reidpath , D. ( 2001 ). Interim Data Are at Least as Important as Interim Analyses , British Medical Journal 323 : 1425 – 1425 .
  • Recommended by N. Mukhopadhyay

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.