Publication Cover
Educational Research and Evaluation
An International Journal on Theory and Practice
Volume 21, 2015 - Issue 3
282
Views
0
CrossRef citations to date
0
Altmetric
Articles

Dimensions of test performance in English as a foreign language in different European settings: a two-level confirmatory factor analytical approach

, &
Pages 188-208 | Received 17 Feb 2014, Accepted 14 Jan 2015, Published online: 20 Mar 2015

References

  • Åberg-Bengtsson, L. (2004). Do small rural schools differ? A comparative two-level model of reading achievement among Swedish 9-year-olds. Scandinavian Journal of Educational Research, 48, 19–33. doi: 10.1080/0031383032000149823
  • Åberg-Bengtsson, L., & Erickson, G. (2006). Dimensions of national test performance in language and mathematics: A two-level approach. Educational Research and Evaluation, 12, 469–488. doi: 10.1080/13803610600808115
  • American Educational Research Association. (2014). Standards for educational and psychological testing. Washington, DC: Author.
  • American Psychological Association. (1999). Standards for educational and psychological testing. Washington, DC: Author.
  • Bachman, L. F. (2000). Modern language testing at the turn of the century: Assuring that what we count counts. Language Testing, 17, 1–42.
  • Bachman, L. F. (2005). Building and supporting a case for test use. Language Assessment Quarterly, 2, 1–34. doi: 10.1207/s15434311laq0201_1
  • Bachman, L. F., & Palmer, A. (2010). Language assessment in practice. Oxford, UK: Oxford University Press.
  • Bae, J., & Bachman, L. F. (1998). A latent variable approach to listening and reading: Testing factorial invariance across two groups of children in the Korean/English Two-Way Immersion Program. Language Testing, 15, 380–414.
  • Balke, G. (1995, April). Decomposing of reading comprehension: Analysis of the IEA literacy tests. Paper presented at the Annual Meeting of the American Educational Research Association, San Francisco, CA.
  • Bonnet, G. (Ed.). (2004). The assessment of pupils’ skills in English in eight European countries 2002. Retrieved from http://www.eva.dk/projekter/2002/evaluering-af-faget-engelsk-i-grundskolen/projektprodukter/assessmentofenglish.pdf
  • Bourdieu, P. (1991). Language of symbolic power. Cambridge, MA: Harvard University Press.
  • Broadfoot, P. (1996). Education, assessment and society: A sociological analysis. Buckingham, UK: Open University Press.
  • Browne, M., & Cudeck, R. (1993). Alternative ways of assessing model fit. In K. A. Bollen & J. S. Long (Eds.), Testing structural equation models (pp. 136–162). Thousand Oaks, CA: Sage.
  • Carr, N. T. (2006). The factor structure of test task characteristics and examinee performance. Language Testing, 23, 269–289. doi: 10.1191/0265532206lt328oa
  • Carroll, J. B. (1993). Human cognitive abilities: A survey of factor-analytic studies. New York, NY: Cambridge University Press.
  • Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling, 9, 233–255. doi: 10.1207/S15328007SEM0902_5
  • Chudgar, A., & Luschei, T. L. (2009). National income, income inequality, and the importance of schools: A hierarchical cross-national comparison. American Educational Research Journal, 46, 626–658. doi: 10.3102/0002831209340043
  • Council of Europe. (2001). Common European framework of reference for languages: Learning, teaching, assessment. Cambridge, UK: Cambridge University Press.
  • European Commission. (2012). First European Survey on Language Competences: Final report. Retrieved from http://ec.europa.eu/languages/policy/strategic-framework/documents/language-survey-final-report_en.pdf
  • Foucault, M. (1979). Discipline and punish. New York, NY: Vantage Books.
  • Gipps, C. V. (1994). Beyond testing: Towards a theory of educational assessment. London, UK: The Falmer Press.
  • Gustafsson, J.-E. (1988). Hierarchical models of individual differences and cognitive abilities. In R. J. Sternberg (Ed.), Advances in the psychology of human intelligence (Vol. 4, pp. 35–71). Hillsdale, NJ: Lawrence Erlbaum.
  • Gustafsson, J.-E. (1997). Measurement characteristics of the IEA Reading Literacy Scales for 9- and 10-year-olds at country and individual levels. Journal of Educational Measurement, 34, 233–251. doi: 10.1111/j.1745-3984.1997.tb00517.x
  • Gustafsson, J.-E. (2001). On the hierarchical structure of intelligence and personality. In J. M. Collis, & S. J. Messick (Eds.), Intelligence and personality: Bridging the gap in theory and measurement (pp. 25–42). Hillsdale, NJ: Erlbaum.
  • Gustafsson, J-E., & Rosén, M. (2006). The dimensional structure of reading assessment tasks in the IEA Reading Literacy Study 1991 and the Progress in International Reading Literacy Study 2001. Educational Research and Evaluation, 12, 445–468. doi: 10.1080/13803610600697179
  • Gustafsson, J.-E., & Stahl, P. A. (2005). STREAMS user's guide: Version 3.0 for Windows. Mölndal, Sweden: MultivariateWare.
  • Horn, J. L., & Cattell, R. B. (1966). Refinement and test of the theory of fluid and crystallized general intelligences. Journal of Educational Psychology, 57, 253–270. doi: 10.1037/h0023816
  • Hox, J. J. (2010). Multilevel analysis: Techniques and applications (2nd ed.). Mahwah, NJ: Erlbaum.
  • In'nami, Y., & Koizumi, R. (2011). Factor structure of a revised TOEIC®test: A multiple-sample analysis. Language Testing, 29, 131–152. doi: 10.1177/0265532211413444
  • Jöreskog, K. G. (1993). Testing structural equation models. In K.A. Bollen & J.S. Long (Eds.), Testing structural equation models (pp. 294–317). Newbury Park, CA: Sage.
  • Kane, M. T. (2006). Validation. In R. L. Brennan (Ed.), Educational measurement (pp. 17–64). Westport, CT: Praeger.
  • Krashen, S. D. (1981). Second language acquisition and second language learning. Oxford, UK: Pergamon.
  • Lundberg, I., & Rosén, M. (1995, April). Two-level structural modeling of reading achievement as a basis for evaluating teaching effects. Paper presented at the Annual Meeting of the American Educational Research Association, San Francisco, CA.
  • Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 13–103). New York, NY: Macmillan.
  • Messick, S. (1996). Validity and washback in language testing. Language Testing, 13, 241–256. doi: 10.1177/026553229601300302
  • Muthén, B. O. (1994). Multilevel covariance structure analysis. Sociological Methods & Research, 22, 376–398. doi: 10.1177/0049124194022003006
  • Muthén, B. O., & Muthén, L. (2006). Mplus user's guide (Version 4). Los Angeles, CA: Authors.
  • Rosén, M. (1995, April). Gender differences in reading performance on documents across countries. Paper presented at the Annual Meeting of the American Educational Research Association, San Francisco, CA.
  • Rosén, M. (1997, March). Country differences in reading performance: A reanalysis of the IEA reading literacy study. Paper presented at the Annual Meeting of the American Educational Research Association, Chicago, IL.
  • Rosén, M. (2001). Gender differences in reading performance on documents across countries. Reading & Writing, 14, 1–38. doi: 10.1023/A:1007995107442
  • Sawaki, Y., Stricker, L. J., & Oranje, A. H. (2009). Factor structure of the TOEFL Internet-based test. Language Testing, 26, 5–30. doi: 10.1177/0265532208097335
  • Shohamy, E. (2001). The power of tests: A critical perspective on the uses of language tests. Harlow, UK: Pearson Education/Longman.
  • Song, M.-Y. (2008). Do divisible subskills exist in second language (L2) comprehension? A structural equation modeling approach. Language Testing, 25, 435–464. doi: 10.1177/0265532208094272
  • Skolverket/Erickson, G. (2004). Engelska i åtta europeiska länder: En undersökning av ungdomars kunskaper och uppfattningar [English in eight European countries]. Stockholm, Sweden: Skolverket/Fritzes.
  • Stake, R. E. (1967). The countenance of educational evaluation. Teachers College Record, 68, 523–540.
  • Vandenberg, R. J., & Lance, C. E. (2000). A review and synthesis of the measurement invariance literature: Suggestions, practices, and recommendations for organizational research. Organizational Research Methods, 3, 4–70. doi: 10.1177/109442810031002
  • Yang, Y. (2003). Measuring socioeconomic status and its effects at individual and collective levels: A crosscountry comparison (Göteborg Studies in Educational Sciences, 193). Göteborg, Sweden: Acta Universitatis Gothoburgensis.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.