Search in:

Journal of Statistics Education Volume 11, 2003 - Issue 1

Submit an article Journal homepage

Free access

408

Views

CrossRef citations to date

Altmetric

Listen

Original Articles

Logical Extremes, Beta, and the Power of the Test

James V. PintoNorthern Arizona UniversityCorrespondence[email protected]

Pin NgNorthern Arizona UniversityCorrespondence[email protected]

David S. AllenNorthern Arizona UniversityCorrespondence[email protected]

| Published online: 01 Dec 2017

Cite this article
https://doi.org/10.1080/10691898.2003.11910698

In this article

1. Introduction
2. Calculation Process
3. The Logical Extremes
4. Conclusions
Acknowledgements
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Abstract

There is a potential misuse of the power function under the logical extreme when the null hypothesis is true. The power function is defined to measure the probability of rejecting the null given any value of the parameter being tested. It can be used to obtain the power and the β values only under the alternative hypothesis. When the null is true, the power function can be used to obtain the size of the test. The power and the probability of committing a Type II error are, however, undefined and, hence, the power function should not be used to obtain these values.

Keywords

Power function
Simulations
Type II error

1. Introduction

It is not an easy or smooth task to calculate the power of a test for beginning statistics students. Until computer algorithms and simulations were designed to help in this process (see, for example, CitationDoane, Mathieson, and Tracy 2001), the best one could do was to calculate a single point on the power function representing 1 – β (as in CitationKvanli, Ravur, and Guynes 2000). CitationYu and Behrens (1994) illustrate the usefulness of simulations in determining the power of a test. It is possible to calculate every point on the power curve, but in-class demonstrations are usually limited to single points due to computational constraints and the repetitive nature of the process. Logical extremes, as detailed below, are usually ignored. The purpose of this note is to demonstrate that the logical extremes are nontrivial and that one of the extremes results in calculations of β and the power of the test that are inconsistent with the common pedagogy. A detailed history of the nature of hypothesis testing, including a section of the power of the test in light of the Fisher approach versus the Neyman-Pearson approach, can be found in CitationLehmann (1993). He concluded that combining the best features of both could unify the two approaches. This historical debate will not be addressed in this paper.

2. Calculation Process

We will use the familiar test on the population mean of a normal distribution with a known variance to set up the stage for logical extremes. We assume that the data generating process is where φ is the probability density function of a normal distribution with an unknown mean μ and a known variance σ².

We first consider testing the simple null hypothesis against a two-sided composite alternative assuming σ = 0.1 and n = 36. In our example, under the null hypothesis, the sampling distribution of the sample mean is normal, centered at with a standard error of . This is illustrated by the normal distribution on the left in , which was generated using Visual Statistics (CitationDoane, Mathieson, and Tracy 2001). The size of the z-test (the probability of committing a Type I error), α, is chosen to be 10%. also presents a second normal distribution on the right, that is the sampling distribution of the sample mean at the specific value of μ = 1.51 when the null hypothesis is false. The shaded area under this sampling distribution in the middle acceptance region measures the probability of committing a Type II error while the shaded areas in the rejection regions provide the power of the test, 1 – β.

Figure 1 Beta and the Power of the Test for Two-Tailed Test.

The power function, defined as the probability of rejecting the null given a specific value of μ, for the z test is given by , where Φ is the cumulative distribution function of the standard normal distribution. This yields 1 – β = 0.16 on the power function at this single point of μ = 1.51. The entire power function is shown in and resembles an upside down normal distribution. It should be noted that the power function in this example approaches one asymptotically as we move away from 1.5 in either direction.

Figure 2 The Power Function for a Two-Tailed Test.

For the one-tailed test, we consider the composite null hypothesis versus the composite alternative . The sampling distributions of the sample mean under the null of and the specific alternative of are illustrated, respectively, by the left and right distributions in . The power function is given by . The power of the test at . is 0.25 and the whole power function is illustrated in . The power function in this example approaches one and zero asymptotically as we move away from 1.5 in the positive and negative directions, respectively.

Figure 3 Beta and the Power of the Test for One-Tail Test.

Figure 4 The Power Function for a One-Tail Test.

3. The Logical Extremes

The first logical extreme occurs when the two sampling distributions in both and pull apart to a point where there is very little overlap between them. In this case the power of the test approaches 1 (as the two distributions in and separate), and the β approaches zero. This can easily be seen in when μ moves away from in both directions and also in when μ moves away from to the right.

The second logical extreme occurs when μ approaches . One subtle point that has not usually been made explicitly in the definition of the power function is the fact that the power function yields the power of a test only when the null hypothesis is false. Under the null hypothesis, the power function provides the probability of committing a Type I error instead of the power of the test. As presented in , the power function yields the 10% size at according to the accepted pedagogy (CitationFreund and Walpole 1987).

This is where this logical extreme breaks down. In this case, it is incorrect to say that “the power of the test is 10% or the probability of committing a Type II error is 90%” since the power is defined as “the probability of rejecting a false null hypothesis” and a Type II error is defined to be “the failure to reject the false null hypothesis.” But there is no false null hypothesis when the null hypothesis is true at . β should be undefined rather than having a maximum value of 90%. Thus, the contention that the power of the test is at a minimum of 10% under the null cannot be supported because there is no false null hypothesis when the null hypothesis is true at . Both the power and the probability of committing a Type II error are undefined under this logical extreme. Unfortunately, this is usually overlooked. For example is generated for using Visual Statistics which labels the area under the “accept” region as “Type II” error and the area under the “Reject” regions as “Power.” This misconception is also apparent in the few current textbooks that attempt to address the power function. In CitationBerenson, Levine and Krehbiel (2002, pp. CD9-6) they state “Interestingly, if the population mean is actually 368 grams, the power of the test is equal to α, the level of significance (which is 0.05 in this example), because the null hypothesis is actually true.” In CitationNewbold, Carlson and Thorne (2003, p. 357), the authors make a similar statement that includes “The power of the test at μ₀ equals α, …”

Figure 5 Logical Extreme: Beta and the Power of the Test for Two-Tailed Test.

Similarly, the sampling distribution and the power function for the one-tailed test are presented in and .

Figure 6 Logical Extreme: Beta and the Power of the Test for One-Tail Test (Right).

Figure 7 Logical Extreme: The Power Function for a One-Tail Test (Right).

The power function in provides the probability of rejecting the null hypothesis over the whole range of values of μ. It can be used to obtain the power of the test only when . The size of the test is defined as the supremum of the power function over . This yields a size of 10% according to accepted pedagogy (CitationMood, Graybill and Boes 1974; CitationTamhane and Dunlop 2000). The power and β remain undefined under the null when .

4. Conclusions

Acknowledgments

The authors wish to thank two anonymous referees for their detailed and useful comments. All remaining errors are the responsibility of the authors.

References

Berenson, M. L., Levine, D. M., and Krehbiel, T. C. (2002), Basic Business Statistics, (8th ed.), Upper Saddle River, NJ: Prentice Hall.
Google Scholar
Doane, D. P., Mathieson, K., and Tracy, R. L. (2001), Visual Statistics 2.0, New York: McGraw-Hill/Irwin.
Google Scholar
Freund, J. E., and Walpole, R. E. (1987), Mathematical Statistics, Englewoods Cliffs, NJ: Prentice Hall.
Google Scholar
Kvanli, A. H., Pavur, R. J., and Guynes, C. S. (2000), Introduction to Business Statistics: A Computer Integrated, Data Analysis Approach, (5th ed.), Cincinnati, OH: South-Western College Publishing.
Google Scholar
Lehmann, E. L., “The Fisher, Neyman-Pearson Theories of Testing Hypothesis: One Theory or Two?” Journal of the American Statistical Association, 88, 1242–1249.
Web of Science ®Google Scholar
Mood, A. M., Graybill, F. A., and Boes, D. C. (1974), Introduction to the Theory of Statistics, (3rd ed.), New York: McGraw-Hill.
Google Scholar
Newbold, P., Carlson, W. L., and Thorne, B. (2003), Statistics for Business and Economics, (5th ed.), Upper Saddle River, NJ: Prentice Hall.
Google Scholar
Tamhane, A. C., and Dunlop D. D. (2000), Statistics and Data Analysis from Elementary to Intermediate, Upper Saddle River, NJ: Prentice Hall.
Google Scholar
Yu, C. H., and Behrens, J. T., (1994), “Identification of Misconceptions in Learning Statistical Power With Dynamic Graphics As a Remedial Tool,” American Statistical Association Proceedings of the Section on Statistical Education, 242–246.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Logical Extremes, Beta, and the Power of the Test

Abstract

1. Introduction

2. Calculation Process

3. The Logical Extremes

4. Conclusions

Acknowledgments

References

Information for

Open access

Opportunities

Help and information

Logical Extremes, Beta, and the Power of the Test

Abstract

1. Introduction

2. Calculation Process

3. The Logical Extremes

4. Conclusions

Acknowledgments

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date