Publication Cover
Statistics
A Journal of Theoretical and Applied Statistics
Volume 54, 2020 - Issue 6
261
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Evidence for goodness of fit in Karl Pearson chi-squared statistics

Pages 1287-1310 | Received 03 Jan 2020, Accepted 21 Nov 2020, Published online: 07 Jan 2021
 

ABSTRACT

Chi-squared tests for lack of fit are traditionally employed to find evidence against a hypothesized model, with the model accepted if the Karl Pearson statistic comparing observed and expected numbers of observations falling within cells is not significantly large. However, if one really wants evidence for goodness of fit, it is better to adopt an equivalence testing approach in which small values of the chi-squared statistic indicate evidence for the desired model. This method requires one to define what is meant by equivalence to the desired model, and guidelines are proposed. It is shown that the evidence for equivalence can distinguish between normal and nearby models, as well between the Poisson and over-dispersed models. Applications to the evaluation of random number generators and to uniformity of the digits of pi are included. Sample sizes required to obtain a desired expected evidence for goodness of fit are also provided.

Acknowledgments

The author thanks Natalie Karavarsamis for helpful comments on an early version of this manuscript and to Yuri Nikolayevsky, Robin Hill and Luke Prendergast for discussions and suggestions on a later draft. In addition, the author is indebted to the referee and Editors for their challenging and probing questions which improved the content and presentation of the text.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

Notes: The sup metric distance M and symmetrized Kullback–Leibler divergence J of p from u is tabled for each model. Also listed is the estimated power Π0.05(pj)=Ppj{Sc}, where c=χ52(0.95) of the level 0.05 χ2 test for non-uniformity.

Notes: The sample sizes are for the maximum relative error k = 1; for smaller k multiply these entries by 1/k2.

Notes: For each sample the evidence for normality was computed using the method of Section 4.1 with r given by (Equation13) and the mean and standard deviation T¯(sT) were recorded and listed in the last three columns above. The maximum expected evidence when the data are normal is m0=λ0+ν/2ν/2, where ν=r3 and λ0=n/{4(r1)}.

Note: The means(standard deviations) of the replicated values are listed.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 844.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.