593
Views
15
CrossRef citations to date
0
Altmetric
METHODS, PLAINLY SPEAKING

A Framework for Reporting and Interpreting Internal Consistency Reliability Estimates

&
Pages 89-103 | Published online: 29 Aug 2019

REFERENCES

  • Allen, M. J., & Yen, W. M. (1979). Introduction to measurement theory. Monterey, CA: Brooks/Cole.
  • Alreck, P. L., & Settle, R. B. (1985). The survey research handbook. Homewood, IL: Irwin.
  • American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Standards for educational and psychological testing (Rev. ed.). Washington, DC: American Educational Research Association.
  • Barnette, J. J. (2000). Effects of stem and Likert response option reversals on survey internal consistency: If you feel the need, there is a better alternative to using those negatively worded stems. Educational and Psychological Measurement, 60, 361–370.
  • Bird, K. D. (2002). Confidence intervals for effect sizes in analysis of variance. Educational and Psychological Measurement, 62, 197–226.
  • Caruso, J. C. (2000). Reliability generalization of the NEO personality scales. Educational and Psychological Measurement, 60, 236–254.
  • Chandler, R. E. (1957). The statistical concepts of confidence and significance. Psychological Bulletin, 54, 429–430.
  • Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. Orlando, FL: Holt, Rinehart & Winston.
  • Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297–334.
  • Cronbach, L. J., Gleser, G. C., & Rajaratnam, N. (1963). Theory of generalizability: A liberalization of reliability theory. British Journal of Mathematical and Statistical Psychology, 16, 137–173.
  • Cumming, G., & Finch, S. (2001). A primer on the understanding, use and calculation of confidence intervals that are based on central and noncentral distributions. Educational and Psychological Measurement, 61, 532–575.
  • Fan, X., & Thompson, B. (2001). Confidence intervals about score reliability coefficients, please: An EPM guidelines editorial. Educational and Psychological Measurement, 61, 517–531.
  • Feldt, L. S., & Brennan, R. L. (1989). Reliability. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 105–146). New York: American Council on Education.
  • Fleishman, A. I. (1980). Confidence intervals for correlation ratios. Educational and Psychological Measurement, 40, 659–670.
  • Gay, L. R., & Airasian, P. W. (2000). Educational research: Competencies for analysis and application (6th ed.). Englewood Cliffs, NJ: Prentice Hall.
  • Guilford, J. P. (1954). Psychometric methods (2nd ed.). New York: McGraw-Hill.
  • Gulliksen, H. (1950). Theory of mental tests. New York: Wiley.
  • Henson, R. K. (2001). Understanding internal consistency reliability estimates: A conceptual primer on coefficient alpha. Measurement and Evaluation in Counseling and Development, 34, 177–189.
  • Henson, R. K., Kogan, L. R., & Vacha-Haase, T. (2001). A reliability generalization study of the Teacher Efficacy Scale and related instruments. Educational and Psychological Measurement, 61, 404–420.
  • Hogan, T. P., Benjamin, A., & Brezinski, K. L. (2000). Reliability methods: A note on the frequency of use of various types. Educational and Psychological Methods, 60, 523–531.
  • Hoyt, C. (1941). Test reliability obtained by analysis of variance. Psychometrika, 6, 153–160.
  • Huck, S. W. (2000). Reading statistics and research (3rd ed.). New York: Addison Wesley Longman.
  • Kelley, T. L. (1921). The reliability of test scores. Journal of Educational Research, 3, 370–379.
  • Kuder, G. F., & Richardson, M. W. (1937). The theory of the estimation of test reliability. Psychometrika, 2, 151–160.
  • Johnson, H. G. (1950). Test reliability and correction for attenuation. Psychometrika, 15, 115–119.
  • Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
  • Magnusson, D. (1967). Test theory. Boston: Addison-Wesley.
  • Muchinsky, P. M. (1996). The correction for attenuation. Educational and Psychological Measurement, 56, 63–75.
  • Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric theory (3rd ed.). New York: McGraw-Hill.
  • Onwuegbuzie, A. J., Bailey, P., & Daley, C. E. (2002). The role of foreign language anxiety and students' expectations in foreign language learning. Research in the Schools, 9(1), 33–50.
  • Onwuegbuzie, A. J., & Daniel, L. G. (2000, November). Reliability generalization: The importance of considering sample specificity, confidence intervals, and subgroup differences. Paper presented at the annual meeting of the Mid-South Educational Research Association, Bowling Green, K.Y.
  • Onwuegbuzie, A. J., & Daniel, L. G. (2002). Uses and misuses of the correlation coefficient. Research in the Schools, 9(1), 73–90.
  • Onwuegbuzie, A. J., Daniel, L. G., & Roberts, J. K. (2001, November). A proposed new “what if” reliability analysis for assessing the statistical significance of bivariate relationships. Paper presented at the annual meeting of the Mid-South Educational Research Association, Little Rock, AR.
  • Pedhazur, E. J., & Schmelkin, L. P. (1991). Measurement, design, and analysis: An integrated approach. Hillsdale, NJ: Erlbaum.
  • Roberts, J. K., & Onwuegbuzie, A. J. (in press). Alternative approaches for interpreting alpha with homogeneous subsamples. Research in the Schools.
  • Rulon, P. J. (1939). A simplified procedure for determining the reliability of a test by split-halves. Harvard Educational Review, 9, 99–103.
  • Sawilowsky, S. S. (2000a). Psychometrics vs. datametrics: Comments on Vacha-Haase's “reliability generalization” method and some EPM editorial policies. Educational and Psychological Measurement, 60, 157–173.
  • Sawilowsky, S. S. (2000b). Reliability: Rejoinder to Thompson and Vacha-Haase. Educational and Psychological Measurement, 60, 196–200.
  • Schmidt, F. L., & Hunter, J. E. (1977). Development of a general solution to the problem of validity generalization. Journal of Applied Psychology, 62, 529–540.
  • Simmelink, S., & Vacha-Haase, T. (1999). Reliability generalization with the Rosenberg Self-Esteem Instrument. Paper presented at the annual meeting of the Rocky Mountain Psychological Association, Fort Collins, CO.
  • Spearman, C. (1910). Correlation calculated from faulty data. British Journal of Psychology, 3, 171–295.
  • Steiger, J. H., & Fouladi, R. T. (1992). R2: A computer program for interval estimation, power calculation, and hypothesis testing for the squared multiple correlation. Behavior Research Methods, Instruments, and Computers, 24, 581–582.
  • Steiger, J. H., & Fouladi, R. T. (1997). Non-centrality interval estimation and the evaluation of statistical models. In L. L. Harlow, S. A. Mulaik, & J. H. Steiger (Eds.), What if there were no significance tests? (pp. 221–257). Mahwah, NJ: Erlbaum.
  • Swearingen, D. L. (1999, April). Consequences of the midpoint response choice for survey researchers. Paper presented at the annual meeting of the American Educational Research Association, Montreal, Ontario, Canada.
  • Thompson, B. (2002). What future quantitative social science research could look like: Confidence intervals for effect sizes. Educational Researcher, 31(3), 25–32.
  • Thompson, B., & Vacha-Haase, T. (2000). Psychometrics is datametrics: The test is not reliable. Educational and Psychological Measurement, 60, 174–195.
  • Tremblay, P. F., & Gardner, R. C. (1995). Expanding the motivation construct in language learning. The Modern Language Journal, 79, 505–518.
  • Vacha-Haase, T. (1998). Reliability generalization: Exploring variance in measurement error affecting score reliability across studies. Educational and Psychological Measurement, 58, 6–20.
  • Vacha-Haase, T., Kogan, L. R., & Thompson, B. (2000). Sample compositions and variabilities in published studies versus those in test manuals: Validity of score reliability inductions. Educational and Psychological Measurement, 60, 509–522.
  • Vacha-Haase, T., Ness, C., Nilsson, J., & Reetz, D. (1999). Practices regarding reporting of reliability coefficients. A review of three journals. The Journal of Experimental Education, 67, 335–341.
  • Viswesvaran, C., & Ones, D. S. (2000). Measurement error in “Big Five Factors” personality assessment: Reliability generalization across studies and measures. Educational and Psychological Measurement, 60, 224–235.
  • Weems, G. H., & Onwuegbuzie, A. J. (2000, November). Characteristics of item respondents who frequently utilize midpoint response categories on rating scales. Paper presented at the annual meeting of the Mid-South Educational Research Association, Bowling Green, KY.
  • Weems, G. H., & Onwuegbuzie, A. J. (2001). The impact of midpoint responses and reverse coding on survey data. Measurement and Evaluation in Counseling and Development, 34, 166–176.
  • Wilkinson, L., & American Psychological Association Task Force on Statistical Inference. (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54, 594–604. (Reprint available through the APA Home Page: http://www.apa.org/journals/amp/amp548594.html)
  • Yin, P., & Fan, X. (2000). Assessing the reliability of Beck Depression Inventory scores: Reliability generalization across studies. Educational and Psychological Measurement, 60, 201–223.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.