Search in:

Assessment in Education: Principles, Policy & Practice Volume 28, 2021 - Issue 4: Use of Innovative Technology in Oral Language Assessment

Submit an article Journal homepage

285

Views

CrossRef citations to date

Altmetric

Articles

Investigating the potential of NLP-driven linguistic and acoustic features for predicting human scores of children’s oral language proficiency

Melissa R. Huntea Applied Psychology and Human Development, University of Toronto, Toronto, ON, CanadaCorrespondence[email protected]

https://orcid.org/0000-0003-2070-266X View further author information

Samantha McCormicka Applied Psychology and Human Development, University of Toronto, Toronto, ON, Canada

https://orcid.org/0000-0002-6324-6636 View further author information

Maitree Shahb Electrical & Computer Engineering, University of Toronto, Toronto, On, Canada

https://orcid.org/0000-0001-6032-8903 View further author information

Clarissa Laua Applied Psychology and Human Development, University of Toronto, Toronto, ON, Canada

https://orcid.org/0000-0002-8553-8242 View further author information

Eunice Eunhee Janga Applied Psychology and Human Development, University of Toronto, Toronto, ON, CanadaView further author information

Pages 477-505 | Received 16 Jul 2020, Accepted 19 Oct 2021, Published online: 27 Dec 2021

Cite this article
https://doi.org/10.1080/0969594X.2021.1999209
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions

References

Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In B. N. Petrov & F. Csaki (Eds.), Second international symposium on information theory (pp. 267–281). Akademiai Kiado.
Google Scholar
Albrecht, S., Cullen, C., Davies, K., Dunlop, M., Elliott, M., & Stevenson, L. (2018). Cambridge Assessment English. Cambridge University Press. https://www.cambridgeenglish.org/Images/461823-young-learners-revision-publication.pdf
Google Scholar
Alim, S. A., & Rashid, N. K. A. (2018). Some commonly used speech feature extraction algorithms. In R. Lopez-Ruiz (Ed.), From natural to artificial intelligence: Algorithms and applications (pp. 2–19). IntechOpen. https://doi.org/http://dx.doi.org/10.5772/intechopen.80419
Google Scholar
Amazon Web Services (AWS). (2016). Amazon Polly [Text-to-Speech Service]. www.aws.amazon.com/polly/
Google Scholar
Bachman, L. F., & Palmer, A. S. (2010). Language assessment in practice: Developing language assessments and justifying their use in the real world. Oxford University Press.
Google Scholar
Barzilay, R., & Lapata, M. (2005). Modeling local coherence: An entity-based approach. Proceedings of the 43rd annual meeting of the association for computational linguistics Ann Arbor, MI, 141–148: Association for Computational Linguistics. https://doi.org/https://doi.org/10.3115/1219840.1219858
Google Scholar
Barzilay, R., & Lapata, M. (2008). Modeling local coherence: An entity-based approach. Computational Linguistics, 34(1), 1–34. https://doi.org/https://doi.org/10.1162/coli.2008.34.1.1
Web of Science ®Google Scholar
Bernstein, J., Van Moere,A., & Cheng, J. (2010). Validating automated speaking tests. Language Testing, 27(3), 355–377. https://doi.org/https://doi.org/10.1177/0265532210364404
Web of Science ®Google Scholar
Bishop, C. M. (2006). Pattern recognition and machine learning. Springer.
Google Scholar
Bolaños, D., Cole, R. A., Ward, W. H., Tindal, G. A., Hasbrouck, J., & Schwanenflugel, P. J. (2013). Human and automated assessment of oral reading fluency. Journal of Educational Psychology, 105(4), 1142–1151. https://doi.org/https://doi.org/10.1037/a0031479
Web of Science ®Google Scholar
Burstein, J. (2009). Opportunities for natural language processing research in education. In A. Gelbukh (Ed.), Lecture notes in computer science: Vol. 5449. Computational linguistics and intelligent text processing (pp. 6–27). Springer. https://doi.org/https://doi.org/10.1007/978-3-642-00382-0_2
Google Scholar
Cain, K., & Oakhill, J. (2007). Cognitive bases of children’s language comprehension difficulties. In K. Cain & J. Oakhill (Eds.), Children’s comprehension problems in oral and written language: A cognitive perspective (pp. 283–295). Guilford Press.
Google Scholar
Cambridge Assessment English (CAE). (2021). Pre a1 starters, a1 movers and a2 flyers handbook for teachers. https://www.cambridgeenglish.org/Images/357180-starters-movers-and-flyers-handbook-for-teachers-2021.pdf
Google Scholar
Campbell, J.P. (1997). Speaker recognition: A tutorial. Proceedings of the IEEE, 85 (8), 1437–1462 https://doi.org/https://doi.org/10.1109/5.628714.
Google Scholar
Catts, H. W., Fey, M. E., Zhang, X., & Tomblin, J.B. (2001). Estimating the risk of future reading difficulties in kindergarten children. Language, Speech, and Hearing Services in Schools, 32(1), 38–50. https://doi.org/https://doi.org/10.1044/0161-1461(2001/004)
PubMed Web of Science ®Google Scholar
Chapelle, C. A., & Chung, Y. R. (2010). The promise of NLP and speech processing technologies in language assessment. Language Testing, 27(3), 301–315. https://doi.org/https://doi.org/10.1177/0265532210364405
Web of Science ®Google Scholar
Chen, H., & He, B. (2013). Automated essay scoring by maximising human-machine agreement. In D. Yarowsky, T. Baldwin, A. Korhonen, K. Livescu, and S. Bethard (Eds.), Proceedings of the 2013 conference on empirical methods in natural language processing, Seattle, WA, 1741–1752: Association for computational linguistics. https://aclanthology.org/D13-1180
Google Scholar
Chen, L., Zechner, K., Yoon, S. Y., Evanini, K., Wang, X., Loukina, A., Tao, J., Davis, L., Lee, C. M., Mundkowsky, R., Lu, C., Leong, C. W., & Gyawali, B. (2018). Automated scoring of nonnative speech using the SpeechRaterSM v. 5.0 engine. ETS Research Report Series, 1, 1-31: Educational Testing Service. https://doi.org/https://doi.org/10.1002/ets2.12198
Google Scholar
Chodorow, M., & Burstein, J. (2004). Beyond essay length: Evaluating e-rater’s performance on TOEFL essays. TOEFL Research Report Series, 1, i-83.Educational Testing Service . https://doi.org/https://doi.org/10.1002/j.2333-8504.2004.tb01931.x
Google Scholar
Collins-Thompson, K., & Callan, J. P. (2004). A language modeling approach to predicting reading difficulty. Proceedings of the human language technology conference of the North American chapter of the association for computational linguistics, Boston, MA, 193–200. Association for Computational Linguistics. https://aclanthology.org/N04-1025
Google Scholar
Cordeiro, J., & Brazdil, P. (2004). Learning text extraction rules, without ignoring stop words. In A. Fred (Ed.), Proceedings of 4th international workshop on pattern recognition in information systems (pp. 128–138). SciTePress. https://doi.org/https://doi.org/10.5220/0002681601280138
Google Scholar
Covington, M. A., & McFall, J. D. (2010). Cutting the Gordian knot: The moving-average type–token ratio (MATTR). Journal of Quantitative Linguistics, 17(2), 94–100. https://doi.org/https://doi.org/10.1080/09296171003643098
Web of Science ®Google Scholar
Crystal, D. (2008). A dictionary of linguistics and phonetics (6th ed.). Blackwell Pub.
Google Scholar
Cucchiarini, C., Strik, H., & Boves, L. W. (2002). Quantitative assessment of second language learners’ fluency: Comparisons between read and spontaneous speech. The Journal of the Acoustical Society of America, 111(6), 2862–2873. https://doi.org/https://doi.org/10.1121/1.1471894
PubMed Web of Science ®Google Scholar
Cumbal, R., Moell, B., Lopes, J., & Engwall, O. (2021). “You don’t understand me!”: Comparing ASR Results for L1 and L2 Speakers of Swedish. Proceedings of Interspeech 2021, Brno, Czechia, 30 August - 3 September 2021. International Speech Communication Association. 4463–4467. https://doi.org/http://dx.doi.org/10.21437/Interspeech.2021-2140
Google Scholar
Döllinger, M., Dubrovskiy, D., & Patel, R. (2012). Spatiotemporal analysis of vocal fold vibrations between children and adults. The Laryngoscope, 122(11), 2511–2518. https://doi.org/https://doi.org/10.1002/lary.23568
PubMed Web of Science ®Google Scholar
Dong, G., & Liu, H. (2018). Feature engineering for machine learning and data analytics (Vol. 1, 1st ed.). CRC Press. https://doi.org/https://doi.org/10.1201/9781315181080
Google Scholar
Dong, L. (2011). Time series analysis of jitter in sustained vowels. Proceedings of the17th international congress of phonetic sciences meeting, August 17-21, 2011, Hong Kong, 603-606: Cambridge University Press. . https://www.internationalphoneticassociation.org/icphs-proceedings/ICPhS2011/OnlineProceedings/RegularSession/Dong/Dong.pdf
Google Scholar
Educational Testing Services (ETS). (2019). Handbook for the TOEFL primary tests. https://www.ets.org/s/toefl_primary/pdf/toefl-primary-handbook-2019.pdf
Google Scholar
Evanini, K., & Zechner, K. (2019). Overview of automated speech scoring. In K. Zechner & K. Evanini (Eds.), Automated speaking assessment: Using language technologies to score spontaneous speech (pp. 3–20). Routledge.
Google Scholar
Eyben, F., Weninger, F., Gross, F., & Schuller, B. (2013). Recent developments in openSMILE, the Munich open-source multimedia feature extractor. Proceedings of the 21st ACM international conference on multimedia, New York, NY, 835–838.: Association for Computing Machinery. https://doi.org/https://doi.org/10.1145/2502081.2502224
Google Scholar
Farrús, M., Hernando, J., & Ejarque, P. (2007). Jitter and shimmer measurements for speaker recognition. Proceedings of the 8th annual conference of the international speech communication association, Antwerp, Belgium, 778–781: International Speech Communication Association. http://hdl.handle.net/10230/28250
Google Scholar
Fergadiotis, G., Wright, H. H., & West, T. M. (2013). Measuring lexical diversity in narrative discourse of people with aphasia. American Journal of Speech-Language Pathology, 22(2), 397–408. https://doi.org/https://doi.org/10.1044/1058-0360(2013/12-0083)
PubMed Web of Science ®Google Scholar
Fielding, L., Kerr, N., & Rosier, P. (2007). Annual growth for all students: Catch-up growth for those who are behind. New Foundation Press.
Google Scholar
Fillmore, L. W., & Snow, C. E. (2018). What Teachers Need to Know About Language. In C. T. Adger, D. Christian, & C. E. Snow Eds., What teachers need to know about language (2nd ed., pp. 8–51). Multilingual Matters. https://doi.org/https://doi.org/10.21832/9781788920193-003.
Google Scholar
Firth, J. R. (1957). Papers in linguistics, 1934–1951. Oxford University Press.
Google Scholar
Forero, C. G., & Maydeu-Olivares, A. (2009). Estimation of IRT graded response models: Limited versus full information methods. Psychological Methods, 14(3), 275–299. https://doi.org/https://doi.org/10.1037/a0015825
PubMed Web of Science ®Google Scholar
Fuchs, L. S., Fuchs, D., Hosp, M. K., & Jenkins, J. R. (2001). Oral reading fluency as an indicator of reading competence: A theoretical, empirical, and historical analysis. Scientific Studies of Reading, 5(3), 239–256. https://doi.org/https://doi.org/10.1207/S1532799XSSR0503_3
Google Scholar
Gerosa, M., Giuliani, D., Narayanan, S., & Potamianos, A. (2009). A review of ASR technologies for children’s speech. Proceedings of the 2nd workshop on child, computer and interaction New York, NY, 1–8: Association for Computing Machinery. https://doi.org/https://doi.org/10.1145/1640377.1640384
Google Scholar
Geva, E., & Wiener, J. (2014). Psychological assessment of culturally and linguistically diverse children and adolescents: A practitioner’s guide. Springer.
Google Scholar
Gillam, R. B., & Johnston, J. R. (1992). Spoken and written language relationships in language/learning-impaired and normally achieving school-age children. Journal of Speech, Language, and Hearing Research, 35(6), 1303–1315. https://doi.org/https://doi.org/10.1044/jshr.3506.1303
Web of Science ®Google Scholar
Good III, R. H., Simmons, D. C., & Kame’enui, E. J. (2001). The importance and decision-making utility of a continuum of fluency-based indicators of foundational reading skills for third-grade high-stakes outcomes. Scientific Studies of Reading, 5(3), 257–288. https://doi.org/https://doi.org/10.1207/S1532799XSSR0503_4
Google Scholar
Gràcia, M., Vega, F., Jarque, S., Adam, A. L., Jarque, M. J., & Hui, S. K. F. (2021). Teaching practices for developing oral language skills in Catalan schools. Cogent Education, 8(1), 1935647. https://doi.org/https://doi.org/10.1080/2331186X.2021.1935647
Web of Science ®Google Scholar
Gupta, D., Bansal, P., & Choudhary, K. (2018). The state of the art of feature extraction techniques in speech recognition. In S. Agrawal, A. Devi, R. Wason, & P. Bansal (Eds.), Advances in intelligent systems and computing, Vol 664. Speech and language processing for human-machine communications (pp. 195–207). Springer. https://doi.org/https://doi.org/10.1007/978-981-10-6626-9_22
Google Scholar
Hacki, T., & Heitmüller, S. (1999). Development of the child’s voice: Premutation, mutation. International Journal of Pediatric Otorhinolaryngology, 49(1), S141–S144. https://doi.org/https://doi.org/10.1016/S0165-5876(99)00150-0
PubMedGoogle Scholar
Hagen, A., Pellom, B., & Cole, R. (2007). Highly accurate children’s speech recognition for interactive reading tutors using subword units. Speech Communication, 49(12), 861–873. https://doi.org/https://doi.org/10.1016/j.specom.2007.05.004
Web of Science ®Google Scholar
Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. Elsevier.
Google Scholar
Hannah, L., Kim, H., & Jang, E. E. (2021). Investigating the effects of task type and linguistic background on accuracy in automated speech recognition systems: Implications for use in language assessment of young learners. [Manuscript submitted for publication]. Department of Applied Psychology and Human Development, Ontario Institute for Studies in Education, University of Toronto.
Google Scholar
Hasselgreen, A., & Caudwell, C. (2016). Assessing the language of young learners. Language Testing, 22(3), 337–354. https://doi.org/https://doi.org/10.1191/0265532205lt312oa
Google Scholar
Heilmann, J., Miller, J. F., Nockerts, A., & Dunaway, C. (2010). Properties of the narrative scoring scheme using narrative re-tells in young school-age children. American Journal of Speech-Language Pathology, 19(2), 154–166. https://doi.org/https://doi.org/10.1044/1058-0360(2009/08-0024)
PubMed Web of Science ®Google Scholar
Hsieh, C., & Wang, Y. (2019). Speaking proficiency of young language students: A discourse-analytic study. Language Testing, 36(1), 27–50. https://doi.org/https://doi.org/10.1177/0265532217734240
Web of Science ®Google Scholar
Huber, J. E., Stathopoulos, E. T., Curione, G. M., Ash, T. A., & Johnson, K. (1999). Formants of children, women, and men: The effects of vocal intensity variation. The Journal of the Acoustical Society of America, 106(3), 1532–1542. https://doi.org/https://doi.org/10.1121/1.427150
PubMed Web of Science ®Google Scholar
Hymes, D. H. (1972). On communicative competence. In J. B. Pride & J. Holmes (Eds.), Sociolinguistics. Selected readings (pp. 269–293). Penguin.
Google Scholar
Jones, S., Fox, C., Gillam, S., Gillam, R. B., & Baxter, G. J. (2019). An exploration of automated narrative analysis via machine learning. PloS One, 14(10), 1–14. https://doi.org/https://doi.org/10.1371/journal.pone.0224634
Web of Science ®Google Scholar
Juel, C., Biancarosa, G., Coker, D., & Deffes, R. (2003). Walking with Rosie: A cautionary tale of early reading instruction. Educational Leadership, 60(7), 12–18.
Web of Science ®Google Scholar
Kang, O., & Ginther, A. (Eds.). (2017). Assessment in second language pronunciation. Taylor & Francis.
Google Scholar
Kang, O., & Johnson, D. (2018). The roles of suprasegmental features in predicting English oral proficiency with an automated system. Language Assessment Quarterly, 15(2), 150–168. https://doi.org/https://doi.org/10.1080/15434303.2018.1451531
Web of Science ®Google Scholar
Kincaid, J. (2018, August 20). Challenges in measuring automatic transcription accuracy. Descript. https://medium.com/descript/challenges-in-measuring-automatic-transcription-accuracy-f322bf5994f
Google Scholar
Kohavi, R., & John, G. H. (1998). The wrapper approach. In H. Liu & H. Motoda (Eds.), The springer international series in engineering and computer science, Vol 453. Feature extraction, construction and selection (pp. 33–50). Springer. https://doi.org/https://doi.org/10.1007/978-1-4615-5725-8_3
Google Scholar
Kusner, M., Sun, Y., Kolkin, N., & Weinberger, K. (2015). From word embeddings to document distances. Proceedings of the 32nd international conference on machine learning, Lille, France, 957–966. Journal of Machine Learning Research. https://dl.acm.org/doi/abs/10.5555/3045118.3045221
Google Scholar
Landauer, T. K., Laham, D., & Foltz, P. W. (2002). Automated scoring and annotation of essays with the intelligent essay assessor. In M. D. Shermis & J. C. Burstein (Eds.), Automated essay scoring: A cross-disciplinary perspective (1st ed., pp. 305–329). Routledge.
Google Scholar
Lee, S., Potamianos, A., & Narayanan, S. (1999). Acoustics of children’s speech: Developmental changes of temporal and spectral parameters. The Journal of the Acoustical Society of America, 105(3), 1455–1468. https://doi.org/https://doi.org/10.1121/1.426686
PubMed Web of Science ®Google Scholar
Lennox, M., Westerveld, M. F., & Trembath, D. (2017). Should we use sentence-or text-level tasks to measure oral language proficiency in year-one students following whole-class intervention? Folia Phoniatrica Et Logopaedica, 69(4), 169–179. https://doi.org/https://doi.org/10.1159/000485974
PubMed Web of Science ®Google Scholar
Lin, C. Y. (2004). Rouge: A package for automatic evaluation of summaries. Preceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, 74-81: Association for Computational Linguistics.https://aclanthology.org/W04-1013
Google Scholar
Lin, T. H., & Dayton, C. M. (1997). Model selection information criteria for non-nested latent class models. Journal of Educational and Behavioral Statistics, 22(3), 249–264. https://doi.org/https://doi.org/10.3102/10769986022003249
Web of Science ®Google Scholar
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Routledge. https://doi.org/https://doi.org/10.4324/9780203056615
Google Scholar
Luoma, S. (2004). Assessing speaking. Cambridge University Press. https://doi.org/https://doi.org/10.1017/CBO9780511733017
Google Scholar
Malec, A., Peterson, S. S., & Elshereif, H. (2017). Assessing young children’s oral language: Recommendations for classroom practice and policy. Canadian Journal of Education/Revue Canadienne De L’éducation, 40(3), 362–392https://www.jstor.org/stable/90014782.
Google Scholar
Mandler, J. M., Scribner, S., Cole, M., & DeForest, M. (1980). Cross-cultural invariance in story recall. Child Development, 51(1), 19–26. https://doi.org/https://doi.org/10.2307/1129585
Web of Science ®Google Scholar
Manning, C., Raghavan, P., & Schütze, H. (2008). Introduction to information retrieval. Cambridge University Press. https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Google Scholar
McCarthy, P. M. (2005). An assessment of the range and usefulness of lexical diversity measures and the potential of the measure of textual, lexical diversity (MTLD) [Unpublished doctoral dissertation]. The University of Memphis.
Google Scholar
McKay, P. (2006). Assessing young language learners. Cambridge University Press. https://doi.org/https://doi.org/10.1017/CBO9780511733093
Google Scholar
McNamara, D. S., Crossley, S. A., & Roscoe, R. (2013). Natural language processing in an intelligent writing strategy tutoring system. Behavior Research Methods, 45(2), 499–515. https://doi.org/https://doi.org/10.3758/s13428-012-0258-1
PubMed Web of Science ®Google Scholar
MetaMetrics Inc. (2021). Lexile grade level charts. https://hub.lexile.com/lexile-grade-level-charts
Google Scholar
Miller, R. D., Correa, V. I., & Katsiyannis, A. (2018). Effects of a story grammar intervention with repeated retells for English learners with language impairments. Communication Disorders Quarterly, 40(1), 15–27. https://doi.org/https://doi.org/10.1177/1525740117751897
Web of Science ®Google Scholar
Milton, J. (2013). Measuring the contribution of vocabulary knowledge to proficiency in the four skills. In C. Bardel, C. Lindqvist, & B. Laufer (Eds.), L2 vocabulary acquisition, knowledge and use: New perspectives on assessment and corpus analysis (pp. 57–78). Eurosla Monographs Series, 2.
Google Scholar
Mladenic, D., & Grobelnik, M. (1999). Feature selection for unbalanced class distribution and naïve Bayes. In I. Bratko, and S. Dzeroski (Eds.), Proceedings of the sixteenth international conference on machine learning, San Francisco, CA,(pp. 258–267). Kaufmann.
Google Scholar
Mohsen, M. A., & Almudawis, S. (2020). Second language vocabulary gains from listening versus reading comprehension input: A comparative study. Journal of Psycholinguistic Research, 50(3), 543–562. https://doi.org/https://doi.org/10.1007/s10936-020-09690-y
Web of Science ®Google Scholar
Morris, J., & Hirst, G. (1991). Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics, 17(1), 21–48. https://dl.acm.org/doi/abs/10.5555/971738.971740
Google Scholar
Napolitano, D., Sheehan, K. M., & Mundkowsky, R. (2015). Online readability and text complexity analysis with TextEvaluator. Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: Demonstrations, Denver, CO, 96–100: Association for Computational Linguistics. https://aclanthology.org/N15-3020.pdf
Google Scholar
Nippold, M. A. (2016). Later language development: School-age children, adolescents, and young adults (4th ed.). PRO-ED.
Google Scholar
Norbury, C. F., & Bishop, D. V. (2003). Narrative skills of children with communication impairments. International Journal of Language & Communication Disorders, 38(3), 287–313. https://doi.org/https://doi.org/10.1080/136820310000108133
PubMed Web of Science ®Google Scholar
Pearson Education. (2019). Versant English test: Test description and validation summary. . . . https://www.pearson.com/content/dam/one-dot-com/one-dot-com/english/SupportingDocs/Versant/ValidationSummary/Versant-English-Test-Description-Validation-Report.pdf
Google Scholar
Pearson Education. (March, 2021). Global scale of English assessment framework for young learners. https://www.pearson.com/content/dam/one-dot-com/one-dot-com/english/SupportingDocs/GSE_Assessment_Young.pdf
Google Scholar
Perelman, L. (2014). When “the state of the art” is counting words. Assessing Writing, 21(1), 104–111. https://doi.org/https://doi.org/10.1016/j.asw.2014.05.001
Google Scholar
Petersen, D. B., & Spencer, T. D. (2012). The narrative language measures: Tools for language screening, progress monitoring, and intervention planning. Perspectives on Language Learning and Education, 19(4), 119–129. https://doi.org/https://doi.org/10.1044/lle19.4.119
Google Scholar
Peterson, S. S. (2016). Supporting young children’s vocabulary through play. What Works? Research into Practice, 62, 1–4. http://thelearningexchange.ca/wp-content/uploads/2017/02/ww_vocabulary.pdf
Google Scholar
Python Software Foundation (2021). The Python Language Reference. [Computer software]. https://docs.python.org/3/reference/
Google Scholar
Quinlan, T., Higgins, D., & Wolff, S. (2009). Evaluating the construct‐coverage of the e‐rater® scoring engine. ETS Research Report Series, 1, 1-35: Educational Testing Service, . https://doi.org/https://doi.org/10.1002/j.2333-8504.2009.tb02158.x
Google Scholar
Rahimi, Z., Litman, D., Correnti, R., Wang, E., & Matsumura, L. C. (2017). Assessing students’ use of evidence and organisation in response-to-text writing: Using natural language processing for rubric-based automated scoring. International Journal of Artificial Intelligence Education, 27(4), 694–728. https://doi.org/https://doi.org/10.1007/s40593-017-0143-2
Web of Science ®Google Scholar
Rubio, V. J., Aguado, D., Hontangas, P. M., & Hernández, J. M. (2007). Psychometric properties of an emotional adjustment measure: An application of the graded response model. European Journal of Psychological Assessment, 23(1), 39–46. https://doi.org/https://doi.org/10.1027/1015-5759.23.1.39
Web of Science ®Google Scholar
Sahidullah, M., Chakroborty, S., & Saha, G. (2010). On the use of perceptual line spectral pairs frequencies and higher-order residual moments for speaker identification. International Journal of Biometrics, 2(4), 358–378. https://doi.org/https://doi.org/10.1504/IJBM.2010.035450
Google Scholar
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, 34(1), 1–97. https://doi.org/https://doi.org/10.1007/BF03372160
Google Scholar
Schwanenflugel, P. J., Hamilton, A. M., Kuhn, M. R., Wisenbaker, J. M., & Stahl, S. A. (2004). Becoming a fluent reader: Reading skill and prosodic features in the oral reading of young readers. Journal of Educational Psychology, 96(1), 119–129. https://doi.org/https://doi.org/10.1037/0022-0663.96.1.119
PubMed Web of Science ®Google Scholar
Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), 461–464. https://doi.org/https://doi.org/10.1214/aos/1176344136
Web of Science ®Google Scholar
Sebastiani, F. (2002). Machine learning in automated text categorisation. ACM Computing Surveys, 34(1), 1–47. https://doi.org/https://doi.org/10.1145/505282.505283
Web of Science ®Google Scholar
Shermis, M. D. (2014). State-of-the-art automated essay scoring: Competition, results, and future directions from a United States demonstration. Assessing Writing, 20, 53–76. https://doi.org/https://doi.org/10.1016/j.asw.2013.04.001
Web of Science ®Google Scholar
Smith, B. L., Kenney, M. K., & Hussain, S. (1996). A longitudinal investigation of duration and temporal variability in children’s speech production. Journal of the Acoustical Society of America, 99(4), 2344–2349. https://doi.org/https://doi.org/10.1121/1.415421
PubMed Web of Science ®Google Scholar
So, Y., Wolf, M. K., Hauck, M. C., Mollaun, P., Rybinski, P., Tumposky, D., & Wang, J. (2015). TOEFL Junior® design framework. Educational Testing Service. ETS Research Report Series, TOEFL Junior Research Report No. 02. https://files.eric.ed.gov/fulltext/EJ1109688.pdf
Google Scholar
spaCy. (2020). spaCy (Version 2.0) [Natural language processing library]. https://spacy.io
Google Scholar
Stein, N. L., Glenn, C. G., & Freedle, R. (1979). An analysis of story comprehension in elementary school children. In R. O. Freedle (Ed.), New directions in discourse processing (pp. 53–120). Ablex.
Google Scholar
Teixeira, J. P., Oliveira, C., & Lopes, C. (2013). Vocal acoustic analysis–jitter, shimmer and HNR parameters. Procedia Technology, 9, 1112–1122. https://doi.org/https://doi.org/10.1016/j.protcy.2013.12.124
Google Scholar
Uchikoshi, Y., Yang, L., Lohr, B., & Leung, G. (2016). Role of oral proficiency on reading comprehension: Within-language and cross-language relationships. Literacy Research, 65(1), 236–252. https://doi.org/https://doi.org/10.1177/2381336916661538
Google Scholar
Vector Psychometric Group (VPG). (2020). IRTPRO (Version 5.1) [Computer software].
Google Scholar
Wang, J., & Wang, X. (2019). Structural equation modeling: Applications using Mplus. John Wiley & Sons.
Google Scholar
Wilson, R. M., Gambrell, L. B., & Pfeiffer, W. R. (1985). The effects of re-telling upon reading comprehension and recall of text information. The Journal of Educational Research, 78(4), 216–220. https://doi.org/https://doi.org/10.1080/00220671.1985.10885604
Web of Science ®Google Scholar
Witten, I. H., Frank, E., Hall, M. A., & Pal, C. J. (2011). Data mining: Practical machine learning tools and techniques (3rd ed.). Elsevier.
Google Scholar
Witten, I. H., & Frank, E. (2005). Data mining: Practical machine learning tools and techniques (2nd ed.). Elsevier.
Google Scholar
Wolf, M., & Katzir-Cohen, T. (2001). Reading fluency and its intervention. Scientific Studies of Reading, 5(3), 211–239. https://doi.org/https://doi.org/10.1207/S1532799XSSR0503_2
Google Scholar
Xi, X., Higgins, D., Zechner, K., & Williamson, D. (2012). A comparison of two scoring methods for an automated speech scoring system. Language Testing, 29(3), 371–394. https://doi.org/https://doi.org/10.1177/0265532211425673
Web of Science ®Google Scholar
Yoon, S., & Bhat, S. (2018). A comparison of grammatical proficiency measures in the automated assessment of spontaneous speech. Speech Communication, 99, 221–230. https://doi.org/https://doi.org/10.1016/j.specom.2018.04.003
Web of Science ®Google Scholar
Zechner, K., Chen, L., Davis, L., Evanini, K., Lee, C. M., Leong, C. W., Wang, X., & Yoon, S. Y. (2015). Automated scoring of speaking tasks in the test of English‐for‐Teaching (TEFT™). ETS Research Report Series, 2, 1-17: Education Testing Services. , . https://doi.org/https://doi.org/10.1002/ets2.12080
Google Scholar
Zheng, A., & Casari, A. (2018). Feature engineering for machine learning: Principles and techniques for data scientists. O’Reilly.
Google Scholar
Zupanc, K., & Bosnić, Z. (2017). Automated essay evaluation with semantic analysis. Knowledge-Based Systems, 120, 118–132. https://doi.org/https://doi.org/10.1016/j.knosys.2017.01.006
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Investigating the potential of NLP-driven linguistic and acoustic features for predicting human scores of children’s oral language proficiency

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Investigating the potential of NLP-driven linguistic and acoustic features for predicting human scores of children’s oral language proficiency

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date