REFERENCES
- Alderson , J. C. 2005 . Diagnosing foreign language proficiency: The interface between learning and assessment , New York , NY : Continuum .
- Attali , Y. 2007 . Construct validity of e-rater® in scoring TOEFL® essays , Princeton , NJ : ETS .
- Attali , Y. and Burstein , J. 2006 . Automated essay scoring with E-rater V. 2.0 . The Journal of Technology, Learning, and Assessment , 4 : 13 – 18 .
- Bagozzi , R. P. and Yi , Y. 1988 . On the evaluation of structural equation models . Journal of the Academy of Marketing Science , 16 : 74 – 94 .
- Bentler , P. M. 1985–2007 . EQS 6.1 for Windows (Build 94) [Computer software] , Encino , CA : Multivariate Software, Inc. .
- Brown , T. A. 2006 . Confirmatory factor analysis for applied research , New York , NY : Guilford .
- Chapelle , C. , Grabe , W. and Berns , M. 2000 . Communicative language proficiency: Definition and implications for TOEFL 2000 (TOEFL Monograph Series MS-10) , Princeton , NJ : ETS .
- Cumming , A. , Kantor , R. , Baba , K. , Eouanzoui , K. , Erdosy , U. and James , M. 2006 . Analysis of discourse features and verification of scoring levels for independent and integrated prototype written tasks for the new TOEFL® test. (TOEFL Monograph Report No. MS-30) , Princeton , NJ : ETS .
- Enright , M. and Quinlan , T. 2010 . “ Complementing human judgment of essays written by English language learners with E-rater® scoring ” . In Language Testing Vol. 27 , 317 – 334 .
- ETS . 2007 . “ Test and score data summary for TOEFL Internet-based test: September 2005 ” . In December 2006 test data , Princeton , NJ : Author .
- ETS . 2011 . Reliability and comparability of TOEFL iBT™ scores. (TOEFL iBT Research Insight, Series 1 , Vol. 3 , Princeton , NJ : Author .
- Kline , R. B. 1998 . Principles and practice of structural equation modeling , New York , NY : Guilford .
- Knoch , U. 2009 . Diagnostic assessment of writing: A comparison of two rating scales . Language Testing , 26 ( 2 ) : 275 – 304 .
- Kondo-Brown , K. 2002 . “ A FACETS analysis of rater bias in measuring Japanese second language writing performance ” . In Language Testing Vol. 19 , 3 – 31 .
- Kunnan , A. J. and Jang , E. E. 2009 . “ Diagnostic feedback in language assessment ” . In Handbook of second and foreign language teaching , Edited by: Long , M. and Doughty , C. 610 – 626 . Walden , MA : Wiley-Blackwell .
- Lee , Y.-W. , Gentile , C. and Kantor , R. 2008 . Analytic scoring of TOEFL® CBT essays: Scores from humans and e-rater® (TOEFL Research Rep. No. 81) , Princeton , NJ : ETS .
- Luecht , R. M. , Gierl , M. J. , Tan , X. and Huff , K. April 2006 . Scalability and the development of useful diagnostic scales , April , San Francisco , CA : Paper presented at the annual meeting of the National Council on Measurement in Education .
- McNamara , T. 1990 . “ Item response theory and the validation of an ESP test for health professionals ” . In Language Testing Vol. 7 , 52 – 75 .
- Quinlan , T. , Higgins , D. and Wolff , S. 2009 . Evaluating the construct coverage of the e-rater scoring engine (RR-09-01) , Princeton , NJ : ETS .
- Sasaki , M. 1996 . Second language proficiency, foreign language aptitude, and intelligence: Quantitative and qualitative analyses , New York , NY : Lang .
- Satorra , A. 1990 . Robustness issues in structural equation modeling: A review of recent developments . Quality & Quantity , 24 : 367 – 386 .
- Satorra , A. and Bentler , P. 1999 . A scaled difference chi-square test statistic for moment structure analysis (UCLA Statistics Series, No. 260) , Los Angeles : University of California .
- Sawaki , Y. , Stricker , L. and Oranje , A. 2009 . “ Factor structure of the TOEFL Internet-based Test ” . In Language Testing, Vol. 26 , 5 – 30 .
- Shanahan , T. 1984 . “ Nature of the reading-writing relation: An exploratory multivariate analysis ” . In Journal of Educational Psychology Vol. 76 , 466 – 477 .
- Shanahan , T. and Lomax , R. G. 1986 . “ An analysis and comparison of theoretical models of the reading-writing relationship ” . In Journal of Educational Psychology Vol. 78 , 116 – 123 .
- Shanahan , T. and Lomax , R. 1988 . “ A developmental comparison of three theoretical models of the reading-writing relationship ” . In Research in the Teaching of English Vol. 22 , 196 – 212 .
- Shin , S.-K. 2005 . “ Did they take the same test? Examinee language proficiency and the structure of language tests ” . In Language Testing Vol. 22 , 31 – 57 .
- Stricker , L. and Rock , D. 2008 . Factor structure of TOEFL Internet-based test across subgroups. (TOEFL iBT Research Rep. TOEFLiBT-07) , Princeton , NJ : ETS .
- Urquhart , S. and Weir , C. J. 1998 . Reading in a second language: Process, product and practice , London , UK : Longman .
- Van Dijk , T. A. and Kintsch , W. 1983 . Strategies of discourse comprehension , New York , NY : Academic Press .
- Weigle , S. C. 1998 . “ Using FACETS to model rater training effects ” . In Language Testing Vol. 15 , 263 – 287 .
- Weir , C. J. 1990 . Communicative language testing , 2nd , London , UK : Prentice Hall .