REFERENCES
- Allen, M. J., & Yen, W. M. (1979). Introduction to measurement theory. Monterey, CA: Brooks/Cole.
- Alreck, P. L., & Settle, R. B. (1985). The survey research handbook. Homewood, IL: Irwin.
- American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Standards for educational and psychological testing (Rev. ed.). Washington, DC: American Educational Research Association.
- Barnette, J. J. (2000). Effects of stem and Likert response option reversals on survey internal consistency: If you feel the need, there is a better alternative to using those negatively worded stems. Educational and Psychological Measurement, 60, 361–370.
- Bird, K. D. (2002). Confidence intervals for effect sizes in analysis of variance. Educational and Psychological Measurement, 62, 197–226.
- Caruso, J. C. (2000). Reliability generalization of the NEO personality scales. Educational and Psychological Measurement, 60, 236–254.
- Chandler, R. E. (1957). The statistical concepts of confidence and significance. Psychological Bulletin, 54, 429–430.
- Crocker, L., & Algina, J. (1986). Introduction to classical and modern test theory. Orlando, FL: Holt, Rinehart & Winston.
- Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297–334.
- Cronbach, L. J., Gleser, G. C., & Rajaratnam, N. (1963). Theory of generalizability: A liberalization of reliability theory. British Journal of Mathematical and Statistical Psychology, 16, 137–173.
- Cumming, G., & Finch, S. (2001). A primer on the understanding, use and calculation of confidence intervals that are based on central and noncentral distributions. Educational and Psychological Measurement, 61, 532–575.
- Fan, X., & Thompson, B. (2001). Confidence intervals about score reliability coefficients, please: An EPM guidelines editorial. Educational and Psychological Measurement, 61, 517–531.
- Feldt, L. S., & Brennan, R. L. (1989). Reliability. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 105–146). New York: American Council on Education.
- Fleishman, A. I. (1980). Confidence intervals for correlation ratios. Educational and Psychological Measurement, 40, 659–670.
- Gay, L. R., & Airasian, P. W. (2000). Educational research: Competencies for analysis and application (6th ed.). Englewood Cliffs, NJ: Prentice Hall.
- Guilford, J. P. (1954). Psychometric methods (2nd ed.). New York: McGraw-Hill.
- Gulliksen, H. (1950). Theory of mental tests. New York: Wiley.
- Henson, R. K. (2001). Understanding internal consistency reliability estimates: A conceptual primer on coefficient alpha. Measurement and Evaluation in Counseling and Development, 34, 177–189.
- Henson, R. K., Kogan, L. R., & Vacha-Haase, T. (2001). A reliability generalization study of the Teacher Efficacy Scale and related instruments. Educational and Psychological Measurement, 61, 404–420.
- Hogan, T. P., Benjamin, A., & Brezinski, K. L. (2000). Reliability methods: A note on the frequency of use of various types. Educational and Psychological Methods, 60, 523–531.
- Hoyt, C. (1941). Test reliability obtained by analysis of variance. Psychometrika, 6, 153–160.
- Huck, S. W. (2000). Reading statistics and research (3rd ed.). New York: Addison Wesley Longman.
- Kelley, T. L. (1921). The reliability of test scores. Journal of Educational Research, 3, 370–379.
- Kuder, G. F., & Richardson, M. W. (1937). The theory of the estimation of test reliability. Psychometrika, 2, 151–160.
- Johnson, H. G. (1950). Test reliability and correction for attenuation. Psychometrika, 15, 115–119.
- Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
- Magnusson, D. (1967). Test theory. Boston: Addison-Wesley.
- Muchinsky, P. M. (1996). The correction for attenuation. Educational and Psychological Measurement, 56, 63–75.
- Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric theory (3rd ed.). New York: McGraw-Hill.
- Onwuegbuzie, A. J., Bailey, P., & Daley, C. E. (2002). The role of foreign language anxiety and students' expectations in foreign language learning. Research in the Schools, 9(1), 33–50.
- Onwuegbuzie, A. J., & Daniel, L. G. (2000, November). Reliability generalization: The importance of considering sample specificity, confidence intervals, and subgroup differences. Paper presented at the annual meeting of the Mid-South Educational Research Association, Bowling Green, K.Y.
- Onwuegbuzie, A. J., & Daniel, L. G. (2002). Uses and misuses of the correlation coefficient. Research in the Schools, 9(1), 73–90.
- Onwuegbuzie, A. J., Daniel, L. G., & Roberts, J. K. (2001, November). A proposed new “what if” reliability analysis for assessing the statistical significance of bivariate relationships. Paper presented at the annual meeting of the Mid-South Educational Research Association, Little Rock, AR.
- Pedhazur, E. J., & Schmelkin, L. P. (1991). Measurement, design, and analysis: An integrated approach. Hillsdale, NJ: Erlbaum.
- Roberts, J. K., & Onwuegbuzie, A. J. (in press). Alternative approaches for interpreting alpha with homogeneous subsamples. Research in the Schools.
- Rulon, P. J. (1939). A simplified procedure for determining the reliability of a test by split-halves. Harvard Educational Review, 9, 99–103.
- Sawilowsky, S. S. (2000a). Psychometrics vs. datametrics: Comments on Vacha-Haase's “reliability generalization” method and some EPM editorial policies. Educational and Psychological Measurement, 60, 157–173.
- Sawilowsky, S. S. (2000b). Reliability: Rejoinder to Thompson and Vacha-Haase. Educational and Psychological Measurement, 60, 196–200.
- Schmidt, F. L., & Hunter, J. E. (1977). Development of a general solution to the problem of validity generalization. Journal of Applied Psychology, 62, 529–540.
- Simmelink, S., & Vacha-Haase, T. (1999). Reliability generalization with the Rosenberg Self-Esteem Instrument. Paper presented at the annual meeting of the Rocky Mountain Psychological Association, Fort Collins, CO.
- Spearman, C. (1910). Correlation calculated from faulty data. British Journal of Psychology, 3, 171–295.
- Steiger, J. H., & Fouladi, R. T. (1992). R2: A computer program for interval estimation, power calculation, and hypothesis testing for the squared multiple correlation. Behavior Research Methods, Instruments, and Computers, 24, 581–582.
- Steiger, J. H., & Fouladi, R. T. (1997). Non-centrality interval estimation and the evaluation of statistical models. In L. L. Harlow, S. A. Mulaik, & J. H. Steiger (Eds.), What if there were no significance tests? (pp. 221–257). Mahwah, NJ: Erlbaum.
- Swearingen, D. L. (1999, April). Consequences of the midpoint response choice for survey researchers. Paper presented at the annual meeting of the American Educational Research Association, Montreal, Ontario, Canada.
- Thompson, B. (2002). What future quantitative social science research could look like: Confidence intervals for effect sizes. Educational Researcher, 31(3), 25–32.
- Thompson, B., & Vacha-Haase, T. (2000). Psychometrics is datametrics: The test is not reliable. Educational and Psychological Measurement, 60, 174–195.
- Tremblay, P. F., & Gardner, R. C. (1995). Expanding the motivation construct in language learning. The Modern Language Journal, 79, 505–518.
- Vacha-Haase, T. (1998). Reliability generalization: Exploring variance in measurement error affecting score reliability across studies. Educational and Psychological Measurement, 58, 6–20.
- Vacha-Haase, T., Kogan, L. R., & Thompson, B. (2000). Sample compositions and variabilities in published studies versus those in test manuals: Validity of score reliability inductions. Educational and Psychological Measurement, 60, 509–522.
- Vacha-Haase, T., Ness, C., Nilsson, J., & Reetz, D. (1999). Practices regarding reporting of reliability coefficients. A review of three journals. The Journal of Experimental Education, 67, 335–341.
- Viswesvaran, C., & Ones, D. S. (2000). Measurement error in “Big Five Factors” personality assessment: Reliability generalization across studies and measures. Educational and Psychological Measurement, 60, 224–235.
- Weems, G. H., & Onwuegbuzie, A. J. (2000, November). Characteristics of item respondents who frequently utilize midpoint response categories on rating scales. Paper presented at the annual meeting of the Mid-South Educational Research Association, Bowling Green, KY.
- Weems, G. H., & Onwuegbuzie, A. J. (2001). The impact of midpoint responses and reverse coding on survey data. Measurement and Evaluation in Counseling and Development, 34, 166–176.
- Wilkinson, L., & American Psychological Association Task Force on Statistical Inference. (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54, 594–604. (Reprint available through the APA Home Page: http://www.apa.org/journals/amp/amp548594.html)
- Yin, P., & Fan, X. (2000). Assessing the reliability of Beck Depression Inventory scores: Reliability generalization across studies. Educational and Psychological Measurement, 60, 201–223.