References
- Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In B. N. Petrov & F. Csaki (Eds.), Second international symposium on information theory (pp. 267–281). Akademiai Kiado.
- Albrecht, S., Cullen, C., Davies, K., Dunlop, M., Elliott, M., & Stevenson, L. (2018). Cambridge Assessment English. Cambridge University Press. https://www.cambridgeenglish.org/Images/461823-young-learners-revision-publication.pdf
- Alim, S. A., & Rashid, N. K. A. (2018). Some commonly used speech feature extraction algorithms. In R. Lopez-Ruiz (Ed.), From natural to artificial intelligence: Algorithms and applications (pp. 2–19). IntechOpen. https://doi.org/http://dx.doi.org/10.5772/intechopen.80419
- Amazon Web Services (AWS). (2016). Amazon Polly [Text-to-Speech Service]. www.aws.amazon.com/polly/
- Bachman, L. F., & Palmer, A. S. (2010). Language assessment in practice: Developing language assessments and justifying their use in the real world. Oxford University Press.
- Barzilay, R., & Lapata, M. (2005). Modeling local coherence: An entity-based approach. Proceedings of the 43rd annual meeting of the association for computational linguistics Ann Arbor, MI, 141–148: Association for Computational Linguistics. https://doi.org/https://doi.org/10.3115/1219840.1219858
- Barzilay, R., & Lapata, M. (2008). Modeling local coherence: An entity-based approach. Computational Linguistics, 34(1), 1–34. https://doi.org/https://doi.org/10.1162/coli.2008.34.1.1
- Bernstein, J., Van Moere,A., & Cheng, J. (2010). Validating automated speaking tests. Language Testing, 27(3), 355–377. https://doi.org/https://doi.org/10.1177/0265532210364404
- Bishop, C. M. (2006). Pattern recognition and machine learning. Springer.
- Bolaños, D., Cole, R. A., Ward, W. H., Tindal, G. A., Hasbrouck, J., & Schwanenflugel, P. J. (2013). Human and automated assessment of oral reading fluency. Journal of Educational Psychology, 105(4), 1142–1151. https://doi.org/https://doi.org/10.1037/a0031479
- Burstein, J. (2009). Opportunities for natural language processing research in education. In A. Gelbukh (Ed.), Lecture notes in computer science: Vol. 5449. Computational linguistics and intelligent text processing (pp. 6–27). Springer. https://doi.org/https://doi.org/10.1007/978-3-642-00382-0_2
- Cain, K., & Oakhill, J. (2007). Cognitive bases of children’s language comprehension difficulties. In K. Cain & J. Oakhill (Eds.), Children’s comprehension problems in oral and written language: A cognitive perspective (pp. 283–295). Guilford Press.
- Cambridge Assessment English (CAE). (2021). Pre a1 starters, a1 movers and a2 flyers handbook for teachers. https://www.cambridgeenglish.org/Images/357180-starters-movers-and-flyers-handbook-for-teachers-2021.pdf
- Campbell, J.P. (1997). Speaker recognition: A tutorial. Proceedings of the IEEE, 85 (8), 1437–1462 https://doi.org/https://doi.org/10.1109/5.628714.
- Catts, H. W., Fey, M. E., Zhang, X., & Tomblin, J.B. (2001). Estimating the risk of future reading difficulties in kindergarten children. Language, Speech, and Hearing Services in Schools, 32(1), 38–50. https://doi.org/https://doi.org/10.1044/0161-1461(2001/004)
- Chapelle, C. A., & Chung, Y. R. (2010). The promise of NLP and speech processing technologies in language assessment. Language Testing, 27(3), 301–315. https://doi.org/https://doi.org/10.1177/0265532210364405
- Chen, H., & He, B. (2013). Automated essay scoring by maximising human-machine agreement. In D. Yarowsky, T. Baldwin, A. Korhonen, K. Livescu, and S. Bethard (Eds.), Proceedings of the 2013 conference on empirical methods in natural language processing, Seattle, WA, 1741–1752: Association for computational linguistics. https://aclanthology.org/D13-1180
- Chen, L., Zechner, K., Yoon, S. Y., Evanini, K., Wang, X., Loukina, A., Tao, J., Davis, L., Lee, C. M., Mundkowsky, R., Lu, C., Leong, C. W., & Gyawali, B. (2018). Automated scoring of nonnative speech using the SpeechRaterSM v. 5.0 engine. ETS Research Report Series, 1, 1-31: Educational Testing Service. https://doi.org/https://doi.org/10.1002/ets2.12198
- Chodorow, M., & Burstein, J. (2004). Beyond essay length: Evaluating e-rater’s performance on TOEFL essays. TOEFL Research Report Series, 1, i-83.Educational Testing Service . https://doi.org/https://doi.org/10.1002/j.2333-8504.2004.tb01931.x
- Collins-Thompson, K., & Callan, J. P. (2004). A language modeling approach to predicting reading difficulty. Proceedings of the human language technology conference of the North American chapter of the association for computational linguistics, Boston, MA, 193–200. Association for Computational Linguistics. https://aclanthology.org/N04-1025
- Cordeiro, J., & Brazdil, P. (2004). Learning text extraction rules, without ignoring stop words. In A. Fred (Ed.), Proceedings of 4th international workshop on pattern recognition in information systems (pp. 128–138). SciTePress. https://doi.org/https://doi.org/10.5220/0002681601280138
- Covington, M. A., & McFall, J. D. (2010). Cutting the Gordian knot: The moving-average type–token ratio (MATTR). Journal of Quantitative Linguistics, 17(2), 94–100. https://doi.org/https://doi.org/10.1080/09296171003643098
- Crystal, D. (2008). A dictionary of linguistics and phonetics (6th ed.). Blackwell Pub.
- Cucchiarini, C., Strik, H., & Boves, L. W. (2002). Quantitative assessment of second language learners’ fluency: Comparisons between read and spontaneous speech. The Journal of the Acoustical Society of America, 111(6), 2862–2873. https://doi.org/https://doi.org/10.1121/1.1471894
- Cumbal, R., Moell, B., Lopes, J., & Engwall, O. (2021). “You don’t understand me!”: Comparing ASR Results for L1 and L2 Speakers of Swedish. Proceedings of Interspeech 2021, Brno, Czechia, 30 August - 3 September 2021. International Speech Communication Association. 4463–4467. https://doi.org/http://dx.doi.org/10.21437/Interspeech.2021-2140
- Döllinger, M., Dubrovskiy, D., & Patel, R. (2012). Spatiotemporal analysis of vocal fold vibrations between children and adults. The Laryngoscope, 122(11), 2511–2518. https://doi.org/https://doi.org/10.1002/lary.23568
- Dong, G., & Liu, H. (2018). Feature engineering for machine learning and data analytics (Vol. 1, 1st ed.). CRC Press. https://doi.org/https://doi.org/10.1201/9781315181080
- Dong, L. (2011). Time series analysis of jitter in sustained vowels. Proceedings of the17th international congress of phonetic sciences meeting, August 17-21, 2011, Hong Kong, 603-606: Cambridge University Press. . https://www.internationalphoneticassociation.org/icphs-proceedings/ICPhS2011/OnlineProceedings/RegularSession/Dong/Dong.pdf
- Educational Testing Services (ETS). (2019). Handbook for the TOEFL primary tests. https://www.ets.org/s/toefl_primary/pdf/toefl-primary-handbook-2019.pdf
- Evanini, K., & Zechner, K. (2019). Overview of automated speech scoring. In K. Zechner & K. Evanini (Eds.), Automated speaking assessment: Using language technologies to score spontaneous speech (pp. 3–20). Routledge.
- Eyben, F., Weninger, F., Gross, F., & Schuller, B. (2013). Recent developments in openSMILE, the Munich open-source multimedia feature extractor. Proceedings of the 21st ACM international conference on multimedia, New York, NY, 835–838.: Association for Computing Machinery. https://doi.org/https://doi.org/10.1145/2502081.2502224
- Farrús, M., Hernando, J., & Ejarque, P. (2007). Jitter and shimmer measurements for speaker recognition. Proceedings of the 8th annual conference of the international speech communication association, Antwerp, Belgium, 778–781: International Speech Communication Association. http://hdl.handle.net/10230/28250
- Fergadiotis, G., Wright, H. H., & West, T. M. (2013). Measuring lexical diversity in narrative discourse of people with aphasia. American Journal of Speech-Language Pathology, 22(2), 397–408. https://doi.org/https://doi.org/10.1044/1058-0360(2013/12-0083)
- Fielding, L., Kerr, N., & Rosier, P. (2007). Annual growth for all students: Catch-up growth for those who are behind. New Foundation Press.
- Fillmore, L. W., & Snow, C. E. (2018). What Teachers Need to Know About Language. In C. T. Adger, D. Christian, & C. E. Snow Eds., What teachers need to know about language (2nd ed., pp. 8–51). Multilingual Matters. https://doi.org/https://doi.org/10.21832/9781788920193-003.
- Firth, J. R. (1957). Papers in linguistics, 1934–1951. Oxford University Press.
- Forero, C. G., & Maydeu-Olivares, A. (2009). Estimation of IRT graded response models: Limited versus full information methods. Psychological Methods, 14(3), 275–299. https://doi.org/https://doi.org/10.1037/a0015825
- Fuchs, L. S., Fuchs, D., Hosp, M. K., & Jenkins, J. R. (2001). Oral reading fluency as an indicator of reading competence: A theoretical, empirical, and historical analysis. Scientific Studies of Reading, 5(3), 239–256. https://doi.org/https://doi.org/10.1207/S1532799XSSR0503_3
- Gerosa, M., Giuliani, D., Narayanan, S., & Potamianos, A. (2009). A review of ASR technologies for children’s speech. Proceedings of the 2nd workshop on child, computer and interaction New York, NY, 1–8: Association for Computing Machinery. https://doi.org/https://doi.org/10.1145/1640377.1640384
- Geva, E., & Wiener, J. (2014). Psychological assessment of culturally and linguistically diverse children and adolescents: A practitioner’s guide. Springer.
- Gillam, R. B., & Johnston, J. R. (1992). Spoken and written language relationships in language/learning-impaired and normally achieving school-age children. Journal of Speech, Language, and Hearing Research, 35(6), 1303–1315. https://doi.org/https://doi.org/10.1044/jshr.3506.1303
- Good III, R. H., Simmons, D. C., & Kame’enui, E. J. (2001). The importance and decision-making utility of a continuum of fluency-based indicators of foundational reading skills for third-grade high-stakes outcomes. Scientific Studies of Reading, 5(3), 257–288. https://doi.org/https://doi.org/10.1207/S1532799XSSR0503_4
- Gràcia, M., Vega, F., Jarque, S., Adam, A. L., Jarque, M. J., & Hui, S. K. F. (2021). Teaching practices for developing oral language skills in Catalan schools. Cogent Education, 8(1), 1935647. https://doi.org/https://doi.org/10.1080/2331186X.2021.1935647
- Gupta, D., Bansal, P., & Choudhary, K. (2018). The state of the art of feature extraction techniques in speech recognition. In S. Agrawal, A. Devi, R. Wason, & P. Bansal (Eds.), Advances in intelligent systems and computing, Vol 664. Speech and language processing for human-machine communications (pp. 195–207). Springer. https://doi.org/https://doi.org/10.1007/978-981-10-6626-9_22
- Hacki, T., & Heitmüller, S. (1999). Development of the child’s voice: Premutation, mutation. International Journal of Pediatric Otorhinolaryngology, 49(1), S141–S144. https://doi.org/https://doi.org/10.1016/S0165-5876(99)00150-0
- Hagen, A., Pellom, B., & Cole, R. (2007). Highly accurate children’s speech recognition for interactive reading tutors using subword units. Speech Communication, 49(12), 861–873. https://doi.org/https://doi.org/10.1016/j.specom.2007.05.004
- Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. Elsevier.
- Hannah, L., Kim, H., & Jang, E. E. (2021). Investigating the effects of task type and linguistic background on accuracy in automated speech recognition systems: Implications for use in language assessment of young learners. [Manuscript submitted for publication]. Department of Applied Psychology and Human Development, Ontario Institute for Studies in Education, University of Toronto.
- Hasselgreen, A., & Caudwell, C. (2016). Assessing the language of young learners. Language Testing, 22(3), 337–354. https://doi.org/https://doi.org/10.1191/0265532205lt312oa
- Heilmann, J., Miller, J. F., Nockerts, A., & Dunaway, C. (2010). Properties of the narrative scoring scheme using narrative re-tells in young school-age children. American Journal of Speech-Language Pathology, 19(2), 154–166. https://doi.org/https://doi.org/10.1044/1058-0360(2009/08-0024)
- Hsieh, C., & Wang, Y. (2019). Speaking proficiency of young language students: A discourse-analytic study. Language Testing, 36(1), 27–50. https://doi.org/https://doi.org/10.1177/0265532217734240
- Huber, J. E., Stathopoulos, E. T., Curione, G. M., Ash, T. A., & Johnson, K. (1999). Formants of children, women, and men: The effects of vocal intensity variation. The Journal of the Acoustical Society of America, 106(3), 1532–1542. https://doi.org/https://doi.org/10.1121/1.427150
- Hymes, D. H. (1972). On communicative competence. In J. B. Pride & J. Holmes (Eds.), Sociolinguistics. Selected readings (pp. 269–293). Penguin.
- Jones, S., Fox, C., Gillam, S., Gillam, R. B., & Baxter, G. J. (2019). An exploration of automated narrative analysis via machine learning. PloS One, 14(10), 1–14. https://doi.org/https://doi.org/10.1371/journal.pone.0224634
- Juel, C., Biancarosa, G., Coker, D., & Deffes, R. (2003). Walking with Rosie: A cautionary tale of early reading instruction. Educational Leadership, 60(7), 12–18.
- Kang, O., & Ginther, A. (Eds.). (2017). Assessment in second language pronunciation. Taylor & Francis.
- Kang, O., & Johnson, D. (2018). The roles of suprasegmental features in predicting English oral proficiency with an automated system. Language Assessment Quarterly, 15(2), 150–168. https://doi.org/https://doi.org/10.1080/15434303.2018.1451531
- Kincaid, J. (2018, August 20). Challenges in measuring automatic transcription accuracy. Descript. https://medium.com/descript/challenges-in-measuring-automatic-transcription-accuracy-f322bf5994f
- Kohavi, R., & John, G. H. (1998). The wrapper approach. In H. Liu & H. Motoda (Eds.), The springer international series in engineering and computer science, Vol 453. Feature extraction, construction and selection (pp. 33–50). Springer. https://doi.org/https://doi.org/10.1007/978-1-4615-5725-8_3
- Kusner, M., Sun, Y., Kolkin, N., & Weinberger, K. (2015). From word embeddings to document distances. Proceedings of the 32nd international conference on machine learning, Lille, France, 957–966. Journal of Machine Learning Research. https://dl.acm.org/doi/abs/10.5555/3045118.3045221
- Landauer, T. K., Laham, D., & Foltz, P. W. (2002). Automated scoring and annotation of essays with the intelligent essay assessor. In M. D. Shermis & J. C. Burstein (Eds.), Automated essay scoring: A cross-disciplinary perspective (1st ed., pp. 305–329). Routledge.
- Lee, S., Potamianos, A., & Narayanan, S. (1999). Acoustics of children’s speech: Developmental changes of temporal and spectral parameters. The Journal of the Acoustical Society of America, 105(3), 1455–1468. https://doi.org/https://doi.org/10.1121/1.426686
- Lennox, M., Westerveld, M. F., & Trembath, D. (2017). Should we use sentence-or text-level tasks to measure oral language proficiency in year-one students following whole-class intervention? Folia Phoniatrica Et Logopaedica, 69(4), 169–179. https://doi.org/https://doi.org/10.1159/000485974
- Lin, C. Y. (2004). Rouge: A package for automatic evaluation of summaries. Preceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, 74-81: Association for Computational Linguistics.https://aclanthology.org/W04-1013
- Lin, T. H., & Dayton, C. M. (1997). Model selection information criteria for non-nested latent class models. Journal of Educational and Behavioral Statistics, 22(3), 249–264. https://doi.org/https://doi.org/10.3102/10769986022003249
- Lord, F. M. (1980). Applications of item response theory to practical testing problems. Routledge. https://doi.org/https://doi.org/10.4324/9780203056615
- Luoma, S. (2004). Assessing speaking. Cambridge University Press. https://doi.org/https://doi.org/10.1017/CBO9780511733017
- Malec, A., Peterson, S. S., & Elshereif, H. (2017). Assessing young children’s oral language: Recommendations for classroom practice and policy. Canadian Journal of Education/Revue Canadienne De L’éducation, 40(3), 362–392https://www.jstor.org/stable/90014782.
- Mandler, J. M., Scribner, S., Cole, M., & DeForest, M. (1980). Cross-cultural invariance in story recall. Child Development, 51(1), 19–26. https://doi.org/https://doi.org/10.2307/1129585
- Manning, C., Raghavan, P., & Schütze, H. (2008). Introduction to information retrieval. Cambridge University Press. https://nlp.stanford.edu/IR-book/information-retrieval-book.html
- McCarthy, P. M. (2005). An assessment of the range and usefulness of lexical diversity measures and the potential of the measure of textual, lexical diversity (MTLD) [Unpublished doctoral dissertation]. The University of Memphis.
- McKay, P. (2006). Assessing young language learners. Cambridge University Press. https://doi.org/https://doi.org/10.1017/CBO9780511733093
- McNamara, D. S., Crossley, S. A., & Roscoe, R. (2013). Natural language processing in an intelligent writing strategy tutoring system. Behavior Research Methods, 45(2), 499–515. https://doi.org/https://doi.org/10.3758/s13428-012-0258-1
- MetaMetrics Inc. (2021). Lexile grade level charts. https://hub.lexile.com/lexile-grade-level-charts
- Miller, R. D., Correa, V. I., & Katsiyannis, A. (2018). Effects of a story grammar intervention with repeated retells for English learners with language impairments. Communication Disorders Quarterly, 40(1), 15–27. https://doi.org/https://doi.org/10.1177/1525740117751897
- Milton, J. (2013). Measuring the contribution of vocabulary knowledge to proficiency in the four skills. In C. Bardel, C. Lindqvist, & B. Laufer (Eds.), L2 vocabulary acquisition, knowledge and use: New perspectives on assessment and corpus analysis (pp. 57–78). Eurosla Monographs Series, 2.
- Mladenic, D., & Grobelnik, M. (1999). Feature selection for unbalanced class distribution and naïve Bayes. In I. Bratko, and S. Dzeroski (Eds.), Proceedings of the sixteenth international conference on machine learning, San Francisco, CA,(pp. 258–267). Kaufmann.
- Mohsen, M. A., & Almudawis, S. (2020). Second language vocabulary gains from listening versus reading comprehension input: A comparative study. Journal of Psycholinguistic Research, 50(3), 543–562. https://doi.org/https://doi.org/10.1007/s10936-020-09690-y
- Morris, J., & Hirst, G. (1991). Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics, 17(1), 21–48. https://dl.acm.org/doi/abs/10.5555/971738.971740
- Napolitano, D., Sheehan, K. M., & Mundkowsky, R. (2015). Online readability and text complexity analysis with TextEvaluator. Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: Demonstrations, Denver, CO, 96–100: Association for Computational Linguistics. https://aclanthology.org/N15-3020.pdf
- Nippold, M. A. (2016). Later language development: School-age children, adolescents, and young adults (4th ed.). PRO-ED.
- Norbury, C. F., & Bishop, D. V. (2003). Narrative skills of children with communication impairments. International Journal of Language & Communication Disorders, 38(3), 287–313. https://doi.org/https://doi.org/10.1080/136820310000108133
- Pearson Education. (2019). Versant English test: Test description and validation summary. . . . https://www.pearson.com/content/dam/one-dot-com/one-dot-com/english/SupportingDocs/Versant/ValidationSummary/Versant-English-Test-Description-Validation-Report.pdf
- Pearson Education. (March, 2021). Global scale of English assessment framework for young learners. https://www.pearson.com/content/dam/one-dot-com/one-dot-com/english/SupportingDocs/GSE_Assessment_Young.pdf
- Perelman, L. (2014). When “the state of the art” is counting words. Assessing Writing, 21(1), 104–111. https://doi.org/https://doi.org/10.1016/j.asw.2014.05.001
- Petersen, D. B., & Spencer, T. D. (2012). The narrative language measures: Tools for language screening, progress monitoring, and intervention planning. Perspectives on Language Learning and Education, 19(4), 119–129. https://doi.org/https://doi.org/10.1044/lle19.4.119
- Peterson, S. S. (2016). Supporting young children’s vocabulary through play. What Works? Research into Practice, 62, 1–4. http://thelearningexchange.ca/wp-content/uploads/2017/02/ww_vocabulary.pdf
- Python Software Foundation (2021). The Python Language Reference. [Computer software]. https://docs.python.org/3/reference/
- Quinlan, T., Higgins, D., & Wolff, S. (2009). Evaluating the construct‐coverage of the e‐rater® scoring engine. ETS Research Report Series, 1, 1-35: Educational Testing Service, . https://doi.org/https://doi.org/10.1002/j.2333-8504.2009.tb02158.x
- Rahimi, Z., Litman, D., Correnti, R., Wang, E., & Matsumura, L. C. (2017). Assessing students’ use of evidence and organisation in response-to-text writing: Using natural language processing for rubric-based automated scoring. International Journal of Artificial Intelligence Education, 27(4), 694–728. https://doi.org/https://doi.org/10.1007/s40593-017-0143-2
- Rubio, V. J., Aguado, D., Hontangas, P. M., & Hernández, J. M. (2007). Psychometric properties of an emotional adjustment measure: An application of the graded response model. European Journal of Psychological Assessment, 23(1), 39–46. https://doi.org/https://doi.org/10.1027/1015-5759.23.1.39
- Sahidullah, M., Chakroborty, S., & Saha, G. (2010). On the use of perceptual line spectral pairs frequencies and higher-order residual moments for speaker identification. International Journal of Biometrics, 2(4), 358–378. https://doi.org/https://doi.org/10.1504/IJBM.2010.035450
- Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, 34(1), 1–97. https://doi.org/https://doi.org/10.1007/BF03372160
- Schwanenflugel, P. J., Hamilton, A. M., Kuhn, M. R., Wisenbaker, J. M., & Stahl, S. A. (2004). Becoming a fluent reader: Reading skill and prosodic features in the oral reading of young readers. Journal of Educational Psychology, 96(1), 119–129. https://doi.org/https://doi.org/10.1037/0022-0663.96.1.119
- Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), 461–464. https://doi.org/https://doi.org/10.1214/aos/1176344136
- Sebastiani, F. (2002). Machine learning in automated text categorisation. ACM Computing Surveys, 34(1), 1–47. https://doi.org/https://doi.org/10.1145/505282.505283
- Shermis, M. D. (2014). State-of-the-art automated essay scoring: Competition, results, and future directions from a United States demonstration. Assessing Writing, 20, 53–76. https://doi.org/https://doi.org/10.1016/j.asw.2013.04.001
- Smith, B. L., Kenney, M. K., & Hussain, S. (1996). A longitudinal investigation of duration and temporal variability in children’s speech production. Journal of the Acoustical Society of America, 99(4), 2344–2349. https://doi.org/https://doi.org/10.1121/1.415421
- So, Y., Wolf, M. K., Hauck, M. C., Mollaun, P., Rybinski, P., Tumposky, D., & Wang, J. (2015). TOEFL Junior® design framework. Educational Testing Service. ETS Research Report Series, TOEFL Junior Research Report No. 02. https://files.eric.ed.gov/fulltext/EJ1109688.pdf
- spaCy. (2020). spaCy (Version 2.0) [Natural language processing library]. https://spacy.io
- Stein, N. L., Glenn, C. G., & Freedle, R. (1979). An analysis of story comprehension in elementary school children. In R. O. Freedle (Ed.), New directions in discourse processing (pp. 53–120). Ablex.
- Teixeira, J. P., Oliveira, C., & Lopes, C. (2013). Vocal acoustic analysis–jitter, shimmer and HNR parameters. Procedia Technology, 9, 1112–1122. https://doi.org/https://doi.org/10.1016/j.protcy.2013.12.124
- Uchikoshi, Y., Yang, L., Lohr, B., & Leung, G. (2016). Role of oral proficiency on reading comprehension: Within-language and cross-language relationships. Literacy Research, 65(1), 236–252. https://doi.org/https://doi.org/10.1177/2381336916661538
- Vector Psychometric Group (VPG). (2020). IRTPRO (Version 5.1) [Computer software].
- Wang, J., & Wang, X. (2019). Structural equation modeling: Applications using Mplus. John Wiley & Sons.
- Wilson, R. M., Gambrell, L. B., & Pfeiffer, W. R. (1985). The effects of re-telling upon reading comprehension and recall of text information. The Journal of Educational Research, 78(4), 216–220. https://doi.org/https://doi.org/10.1080/00220671.1985.10885604
- Witten, I. H., Frank, E., Hall, M. A., & Pal, C. J. (2011). Data mining: Practical machine learning tools and techniques (3rd ed.). Elsevier.
- Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques (2nd ed.). Elsevier.
- Wolf, M., & Katzir-Cohen, T. (2001). Reading fluency and its intervention. Scientific Studies of Reading, 5(3), 211–239. https://doi.org/https://doi.org/10.1207/S1532799XSSR0503_2
- Xi, X., Higgins, D., Zechner, K., & Williamson, D. (2012). A comparison of two scoring methods for an automated speech scoring system. Language Testing, 29(3), 371–394. https://doi.org/https://doi.org/10.1177/0265532211425673
- Yoon, S., & Bhat, S. (2018). A comparison of grammatical proficiency measures in the automated assessment of spontaneous speech. Speech Communication, 99, 221–230. https://doi.org/https://doi.org/10.1016/j.specom.2018.04.003
- Zechner, K., Chen, L., Davis, L., Evanini, K., Lee, C. M., Leong, C. W., Wang, X., & Yoon, S. Y. (2015). Automated scoring of speaking tasks in the test of English‐for‐Teaching (TEFT™). ETS Research Report Series, 2, 1-17: Education Testing Services. , . https://doi.org/https://doi.org/10.1002/ets2.12080
- Zheng, A., & Casari, A. (2018). Feature engineering for machine learning: Principles and techniques for data scientists. O’Reilly.
- Zupanc, K., & Bosnić, Z. (2017). Automated essay evaluation with semantic analysis. Knowledge-Based Systems, 120, 118–132. https://doi.org/https://doi.org/10.1016/j.knosys.2017.01.006