Sequential analysis

In statistics, sequential analysis or sequential hypothesis testing is statistical analysis where the sample size is not fixed in advance. Instead data is evaluated as it is collected, and further sampling is stopped in accordance with a pre-defined stopping rule as soon as significant results are observed. Thus a conclusion may sometimes be reached at a much earlier stage than would be possible with more classical hypothesis testing or estimation, at consequently lower financial and/or human cost.

History

Sequential analysis was first developed by Abraham Wald[1] with Jacob Wolfowitz, W. Allen Wallis, and Milton Friedman[2] while at Columbia University's Statistical Research Group as a tool for more efficient industrial quality control during World War II. Its value to the war effort was immediately recognised, and led to its receiving a "restricted" classification.[3] Another early contribution to the method was made by K.J. Arrow with D. Blackwell and M.A. Girshick.[4]

A similar approach was independently developed at the same time by Alan Turing, as part of the Banburismus technique used at Bletchley Park, to test hypotheses about whether different messages coded by German Enigma machines should be connected and analysed together. This work remained secret until the early 1980s.[5]

Applications of sequential analysis

Clinical trials

In a randomized trial with two treatment groups, group sequential testing may for example be conducted in the following manner: After n subjects in each group, i.e., a total of 2n subjects, are available, an interim analysis is conducted. That means, a statistical test is performed to compare the two groups, if the null hypothesis is rejected, the trial is terminated. Otherwise, the trial continues. Another n subjects per group are recruited. The statistical test is performed again, now including all 4n subjects. If the null is rejected, the trial is terminated. Otherwise, it continues with periodic evaluations until a maximum number of interim analyses have been performed. At this point, the last statistical test is conducted, and the trial is discontinued.[6]

Other applications

Sequential analysis also has a connection to the problem of gambler's ruin that has been studied by, among others, Huyghens in 1657.[7]

Step detection is the process of finding abrupt changes in the mean level of a time series or signal. It is usually considered as a special kind of statistical method known as change point detection. Often, the step is small and the time series is corrupted by some kind of noise, and this makes the problem challenging because the step may be hidden by the noise. Therefore, statistical and/or signal processing algorithms are often required. When the algorithms are run online as the data is coming in, especially with the aim of producing an alert, this is an application of sequential analysis.

Bias

The statistics of a trial that is stopped early at only n samples are different than a similar trial that is run for a predetermined number of trials, even if they end up collecting the same number of samples. If this is not accounted for when interpreting the sequential trial, the results will be biased. Therefore, it is important that proper methodology is followed in order to avoid false conclusions.

See also

Notes

  1. Wald, Abraham (June 1945). "Sequential Tests of Statistical Hypotheses". The Annals of Mathematical Statistics. 16 (2): 117186. doi:10.1214/aoms/1177731118. JSTOR 2235829.
  2. Berger, James (2008). "Sequential Analysis". The New Palgrave Dictionary of Economics, 2nd Ed. doi:10.1057/9780230226203.1513.
  3. Kenneth J. Arrow, David Blackwell and M.A. Girshick (1949). "Bayes and minimax solutions of sequential decision problems". Econometrica. 17 (3/4): 213244. doi:10.2307/1905525. JSTOR 1905525.
  4. Randell, Brian (1980), "The Colossus", A History of Computing in the Twentieth Century (PDF), p. 30, retrieved 22 March 2011
  5. Korosteleva, Olga (2008). Clinical Statistics: Introducing Clinical Trials, Survival Analysis, and Longitudinal Data Analysis (First ed.). Jones and Bartlett Publishers. ISBN 0-7637-5850-7.
  6. Gosh, B. K.; Sen, P. K. (1991). Handbook of Sequential Analysis. New York: Marcel Dekker. ISBN 9780824784089.

References

Commercial
This article is issued from Wikipedia - version of the 11/22/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.