Search in:

Advanced search

Measurement and Evaluation in Counseling and Development Volume 35, 2002 - Issue 2: Validity and Reliability Issues

Submit an article Journal homepage

593

Views

CrossRef citations to date

Altmetric

METHODS, PLAINLY SPEAKING

A Framework for Reporting and Interpreting Internal Consistency Reliability Estimates

Anthony J. Onwuegbuzie Anthony J. Onwuegbuzie, Department of Human Development and Psychoeducational Studies, Howard University.Correspondence[email protected]

Larry G. Daniel Larry G. Daniel, College of Education and Human Services, and Division of Educational Services and Research, University of North Florida.

Pages 89-103 | Published online: 29 Aug 2019

Cite this article
https://doi.org/10.1080/07481756.2002.12069052

References
Citations
Metrics
Reprints & Permissions

REFERENCES

Allen, M. J., & Yen, W. M. (1979). Introduction to measurement theory. Monterey, CA: Brooks/Cole.
Google Scholar
Alreck, P. L., & Settle, R. B. (1985). The survey research handbook. Homewood, IL: Irwin.
Google Scholar
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Standards for educational and psychological testing (Rev. ed.). Washington, DC: American Educational Research Association.
Google Scholar
Barnette, J. J. (2000). Effects of stem and Likert response option reversals on survey internal consistency: If you feel the need, there is a better alternative to using those negatively worded stems. Educational and Psychological Measurement, 60, 361–370.
Web of Science ®Google Scholar
Bird, K. D. (2002). Confidence intervals for effect sizes in analysis of variance. Educational and Psychological Measurement, 62, 197–226.
Web of Science ®Google Scholar
Caruso, J. C. (2000). Reliability generalization of the NEO personality scales. Educational and Psychological Measurement, 60, 236–254.
Web of Science ®Google Scholar
Chandler, R. E. (1957). The statistical concepts of confidence and significance. Psychological Bulletin, 54, 429–430.
PubMed Web of Science ®Google Scholar
Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. Orlando, FL: Holt, Rinehart & Winston.
Google Scholar
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297–334.
Web of Science ®Google Scholar
Cronbach, L. J., Gleser, G. C., & Rajaratnam, N. (1963). Theory of generalizability: A liberalization of reliability theory. British Journal of Mathematical and Statistical Psychology, 16, 137–173.
Web of Science ®Google Scholar
Cumming, G., & Finch, S. (2001). A primer on the understanding, use and calculation of confidence intervals that are based on central and noncentral distributions. Educational and Psychological Measurement, 61, 532–575.
Web of Science ®Google Scholar
Fan, X., & Thompson, B. (2001). Confidence intervals about score reliability coefficients, please: An EPM guidelines editorial. Educational and Psychological Measurement, 61, 517–531.
Web of Science ®Google Scholar
Feldt, L. S., & Brennan, R. L. (1989). Reliability. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 105–146). New York: American Council on Education.
Google Scholar
Fleishman, A. I. (1980). Confidence intervals for correlation ratios. Educational and Psychological Measurement, 40, 659–670.
Web of Science ®Google Scholar
Gay, L. R., & Airasian, P. W. (2000). Educational research: Competencies for analysis and application (6th ed.). Englewood Cliffs, NJ: Prentice Hall.
Google Scholar
Guilford, J. P. (1954). Psychometric methods (2nd ed.). New York: McGraw-Hill.
Google Scholar
Gulliksen, H. (1950). Theory of mental tests. New York: Wiley.
Google Scholar
Henson, R. K. (2001). Understanding internal consistency reliability estimates: A conceptual primer on coefficient alpha. Measurement and Evaluation in Counseling and Development, 34, 177–189.
Web of Science ®Google Scholar
Henson, R. K., Kogan, L. R., & Vacha-Haase, T. (2001). A reliability generalization study of the Teacher Efficacy Scale and related instruments. Educational and Psychological Measurement, 61, 404–420.
Web of Science ®Google Scholar
Hogan, T. P., Benjamin, A., & Brezinski, K. L. (2000). Reliability methods: A note on the frequency of use of various types. Educational and Psychological Methods, 60, 523–531.
Web of Science ®Google Scholar
Hoyt, C. (1941). Test reliability obtained by analysis of variance. Psychometrika, 6, 153–160.
Google Scholar
Huck, S. W. (2000). Reading statistics and research (3rd ed.). New York: Addison Wesley Longman.
Google Scholar
Kelley, T. L. (1921). The reliability of test scores. Journal of Educational Research, 3, 370–379.
Google Scholar
Kuder, G. F., & Richardson, M. W. (1937). The theory of the estimation of test reliability. Psychometrika, 2, 151–160.
Web of Science ®Google Scholar
Johnson, H. G. (1950). Test reliability and correction for attenuation. Psychometrika, 15, 115–119.
Google Scholar
Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Google Scholar
Magnusson, D. (1967). Test theory. Boston: Addison-Wesley.
Google Scholar
Muchinsky, P. M. (1996). The correction for attenuation. Educational and Psychological Measurement, 56, 63–75.
Web of Science ®Google Scholar
Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric theory (3rd ed.). New York: McGraw-Hill.
Google Scholar
Onwuegbuzie, A. J., Bailey, P., & Daley, C. E. (2002). The role of foreign language anxiety and students' expectations in foreign language learning. Research in the Schools, 9(1), 33–50.
Google Scholar
Onwuegbuzie, A. J., & Daniel, L. G. (2000, November). Reliability generalization: The importance of considering sample specificity, confidence intervals, and subgroup differences. Paper presented at the annual meeting of the Mid-South Educational Research Association, Bowling Green, K.Y.
Google Scholar
Onwuegbuzie, A. J., & Daniel, L. G. (2002). Uses and misuses of the correlation coefficient. Research in the Schools, 9(1), 73–90.
Google Scholar
Onwuegbuzie, A. J., Daniel, L. G., & Roberts, J. K. (2001, November). A proposed new “what if” reliability analysis for assessing the statistical significance of bivariate relationships. Paper presented at the annual meeting of the Mid-South Educational Research Association, Little Rock, AR.
Google Scholar
Pedhazur, E. J., & Schmelkin, L. P. (1991). Measurement, design, and analysis: An integrated approach. Hillsdale, NJ: Erlbaum.
Google Scholar
Roberts, J. K., & Onwuegbuzie, A. J. (in press). Alternative approaches for interpreting alpha with homogeneous subsamples. Research in the Schools.
Google Scholar
Rulon, P. J. (1939). A simplified procedure for determining the reliability of a test by split-halves. Harvard Educational Review, 9, 99–103.
Web of Science ®Google Scholar
Sawilowsky, S. S. (2000a). Psychometrics vs. datametrics: Comments on Vacha-Haase's “reliability generalization” method and some EPM editorial policies. Educational and Psychological Measurement, 60, 157–173.
Web of Science ®Google Scholar
Sawilowsky, S. S. (2000b). Reliability: Rejoinder to Thompson and Vacha-Haase. Educational and Psychological Measurement, 60, 196–200.
Web of Science ®Google Scholar
Schmidt, F. L., & Hunter, J. E. (1977). Development of a general solution to the problem of validity generalization. Journal of Applied Psychology, 62, 529–540.
Web of Science ®Google Scholar
Simmelink, S., & Vacha-Haase, T. (1999). Reliability generalization with the Rosenberg Self-Esteem Instrument. Paper presented at the annual meeting of the Rocky Mountain Psychological Association, Fort Collins, CO.
Google Scholar
Spearman, C. (1910). Correlation calculated from faulty data. British Journal of Psychology, 3, 171–295.
Google Scholar
Steiger, J. H., & Fouladi, R. T. (1992). R2: A computer program for interval estimation, power calculation, and hypothesis testing for the squared multiple correlation. Behavior Research Methods, Instruments, and Computers, 24, 581–582.
Google Scholar
Steiger, J. H., & Fouladi, R. T. (1997). Non-centrality interval estimation and the evaluation of statistical models. In L. L. Harlow, S. A. Mulaik, & J. H. Steiger (Eds.), What if there were no significance tests? (pp. 221–257). Mahwah, NJ: Erlbaum.
Google Scholar
Swearingen, D. L. (1999, April). Consequences of the midpoint response choice for survey researchers. Paper presented at the annual meeting of the American Educational Research Association, Montreal, Ontario, Canada.
Google Scholar
Thompson, B. (2002). What future quantitative social science research could look like: Confidence intervals for effect sizes. Educational Researcher, 31(3), 25–32.
Google Scholar
Thompson, B., & Vacha-Haase, T. (2000). Psychometrics is datametrics: The test is not reliable. Educational and Psychological Measurement, 60, 174–195.
Web of Science ®Google Scholar
Tremblay, P. F., & Gardner, R. C. (1995). Expanding the motivation construct in language learning. The Modern Language Journal, 79, 505–518.
Web of Science ®Google Scholar
Vacha-Haase, T. (1998). Reliability generalization: Exploring variance in measurement error affecting score reliability across studies. Educational and Psychological Measurement, 58, 6–20.
Web of Science ®Google Scholar
Vacha-Haase, T., Kogan, L. R., & Thompson, B. (2000). Sample compositions and variabilities in published studies versus those in test manuals: Validity of score reliability inductions. Educational and Psychological Measurement, 60, 509–522.
Web of Science ®Google Scholar
Vacha-Haase, T., Ness, C., Nilsson, J., & Reetz, D. (1999). Practices regarding reporting of reliability coefficients. A review of three journals. The Journal of Experimental Education, 67, 335–341.
Web of Science ®Google Scholar
Viswesvaran, C., & Ones, D. S. (2000). Measurement error in “Big Five Factors” personality assessment: Reliability generalization across studies and measures. Educational and Psychological Measurement, 60, 224–235.
Web of Science ®Google Scholar
Weems, G. H., & Onwuegbuzie, A. J. (2000, November). Characteristics of item respondents who frequently utilize midpoint response categories on rating scales. Paper presented at the annual meeting of the Mid-South Educational Research Association, Bowling Green, KY.
Google Scholar
Weems, G. H., & Onwuegbuzie, A. J. (2001). The impact of midpoint responses and reverse coding on survey data. Measurement and Evaluation in Counseling and Development, 34, 166–176.
Web of Science ®Google Scholar
Wilkinson, L., & American Psychological Association Task Force on Statistical Inference. (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54, 594–604. (Reprint available through the APA Home Page: http://www.apa.org/journals/amp/amp548594.html)
Web of Science ®Google Scholar
Yin, P., & Fan, X. (2000). Assessing the reliability of Beck Depression Inventory scores: Reliability generalization across studies. Educational and Psychological Measurement, 60, 201–223.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A Framework for Reporting and Interpreting Internal Consistency Reliability Estimates

REFERENCES

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A Framework for Reporting and Interpreting Internal Consistency Reliability Estimates

REFERENCES

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date