REFERENCES
- Attali , Y. and Burstein , J. Automated essay scoring with e-rater® v.2.0 . ETS Research Report No. RR-04-45 . 2005 . Princeton, NJ: Educational Testing Service
- Bachman , L. F. 1990 . Fundamental considerations in language testing , Oxford, UK : Oxford University Press .
- Bachman , L. F. Validity issues in a web-based language assessment system (WebLAS) . Colloquium paper presented at the American Association for Applied Linguistics Conference . March , Arlington, VA.
- Bachman , L. F. and Palmer , A. S. 1996 . Language testing in practice , Oxford, UK : Oxford University Press .
- Bennett , R. E. and Bejar , I. I. 1998 . Validity and automated scoring: It's not only the scoring . Educational Measurement: Issues and Practice , 17 : 9 – 17 .
- Bennett , R. E. , Steffen , M. , Singley , M. K. , Morley , M. and Jacquemin , D. 1997 . Evaluating an automatically scorable, open-ended response type for measuring mathematical reasoning in computer-adaptive tests . Journal of Educational Measurement , 34 : 162 – 176 .
- Burstein, J. C., Kukich, K., Wolff, S., Lu, C., & Chodorow, M. (1998, April). Computer analysis of essays. Paper presented at the annual meeting of the National Council of Measurement in Education. San Diego, CA http://www.ets.org/Media/Research/pdf/erater_ncmefinal.pdf (http://www.ets.org/Media/Research/pdf/erater_ncmefinal.pdf) (Accessed: 6 November 2007 ).
- Clauser , B. E. , Harik , P. and Clyman , S. G. 2000 . The generalizability of scores for a performance assessment scored with a computer-automated scoring system . Journal of Educational Measurement , 37 : 245 – 261 .
- Clauser , B. E. , Kane , M. T. and Swanson , D. B. 2002 . Validity issues for performance-based tests scored with computer-automated scoring systems . Applied Measurement in Education , 15 : 413 – 432 .
- Clauser , B. E. , Margolis , M. J. , Clyman , S. G. and Ross , L. P. 1997 . Development of automated scoring algorithms for complex performance assessments: A comparison of two approaches . Journal of Educational Measurement , 34 : 141 – 161 .
- Clauser , B. E. , Ross , L. P. , Clyman , S. G. , Rose , K. M. , Margolis , M. J. Nungester , R. J. 1997 . Development of a scoring algorithm to replace a complex performance-based assessment . Applied Measurement in Education , 10 : 345 – 358 .
- Clauser , B. E. , Subhiyah , R. G. , Nungester , R. J. , Ripkey , D. R. , Clyman , S. G. and McKinley , D. 1995 . Scoring a performance-based assessment by modeling the judgments of experts . Journal of Educational Measurement , 32 : 397 – 415 .
- Clauser , B. E. , Swanson , D. B. and Clyman , S. G. 2000 . A comparison of the generalizability of scores produced by expert raters and automated scoring systems . Applied Measurement in Education , 12 : 281 – 299 .
- Educational Testing Service. (2005). Criterion details: English language learning: Frequently asked questions about Criterion http://www.ets.org/portal/site/ets/menuitem.1488512ecfd5b8849a77b13bc3921509/?vgnextoid=e9872d3631df4010VgnVCM10000022f95190RCRD&vgnextchannel=547f253b164f4010VgnVCM10000022f95190RCRD (http://www.ets.org/portal/site/ets/menuitem.1488512ecfd5b8849a77b13bc3921509/?vgnextoid=e9872d3631df4010VgnVCM10000022f95190RCRD&vgnextchannel=547f253b164f4010VgnVCM10000022f95190RCRD) (Accessed: 22 September 2007 ).
- Fitzpatrick, S., & Triscari, R. (2005, April). Comparability studies of the Virginia computer-delivered tests. Paper presented at the annual meeting of the American Educational Research Association. Montreal, Canada http://www.doe.virginia.gov/VDOE/Assessment/Compstudy_AERA2005.pdf (http://www.doe.virginia.gov/VDOE/Assessment/Compstudy_AERA2005.pdf) (Accessed: 6 April 2008 ).
- Higgins, D., Burstein, J., Marcu, D., & Gentile, C. (2004). Evaluating multiple aspects of coherence in student essays. In Proceedings of the Annual Meeting of HLT/NAACL, Boston, MA http://ftp.ets.org/pub/res/erater_higgins_dis_coh.pdf (http://ftp.ets.org/pub/res/erater_higgins_dis_coh.pdf) (Accessed: 19 October 2007 ).
- Hirschman , L. , Breck , E. , Light , M. , Burger , J. D. and Ferro , L. 2000 . Automated grading of short-answer tests . IEEE Intelligent Systems , 15 ( 5 ) : 31 – 35 .
- Leacock, C. (2004). Scoring free-responses automatically: A case study of a large-scale assessment. Examens, 1(3). English version http://www.ets.org/Media/Research/pdf/erater_examens_leacock.pdf (http://www.ets.org/Media/Research/pdf/erater_examens_leacock.pdf) (Accessed: 12 April 2008 ).
- Leacock , C. and Chodorow , M. 2003 . C-rater: Automated scoring of short answer questions . Computers and the Humanities , 37 : 389 – 405 .
- Nichols, P. (2005). Evidence for the interpretation and use of scores from an automated essay scorer (Pearson Educational Measurement Research Rep. 0502) http://www.pearsonedmeasurement.com/research/research.htm (http://www.pearsonedmeasurement.com/research/research.htm) (Accessed: 6 April 2008 ).
- Powers , D. E. , Burstein , J. C. , Chodorow , M. , Fowles , M.E. and Kukich , K. Stumping E-Rater: Challenging the validity of automated essay scoring . ETS Research Rep. 01–03 . 2001 . Princeton, NJ: Educational Testing Service
- Shaw , S. D. 2004 . Automated writing assessment: A review of four conceptual models . Research Notes , 17 : 13 – 18 .
- UCLA Department of Applied Linguistics and TESL & Center for Digital Humanities. (2003). WebLAS (Web-based Language Assessment System) http://www.weblas.ucla.edu/ (http://www.weblas.ucla.edu/) (Accessed: 10 August 2005 ).
- Vongpumivitch , V. 2004 . Measuring the knowledge of text structure in academic English as a second language (ESL) , Los Angeles : University of California . Unpublished doctoral dissertation
- Williamson , D. M. , Bejar , I. I. and Hone , A. S. 1999 . “Mental model” comparison of automated and human scoring . Journal of Educational Measurement , 36 : 158 – 184 .
- Williamson , D. M. , Bejar , I. I. and Saxe , A. 2004 . Automated tools for subject matter expert evaluation of automated scoring . Applied Measurement in Education , 17 : 323 – 357 .
- Yang , Y. , Buckendahl , C. W. , Juszkiewicz , P. J. and Bhola , D. S. 2002 . A review of strategies for validating computer-automated scoring . Applied Measurement in Education , 15 : 391 – 412 .