ABSTRACT
The current article offers an overview of scholarship on additional-language (e.g., second-language, heritage-language) users of Spanish that has been carried out using learner corpora in the last decade. I focus the review of Spanish learner corpus research on investigations that have examined grammar (e.g., fluency, grammatical gender), vocabulary (e.g., lexical diversity), and pragmatics (e.g., discourse markers), and I highlight the contributions that this body of work has made to the understanding of the use and development of additional-language Spanish. I also discuss the pedagogical applications that this line of inquiry may have. I conclude by identifying specific avenues for future work pertaining to research on additional-language learning and the development of new corpora.
RESUMEN
Este artículo ofrece una descripción general de las investigaciones en la última década sobre hablantes de español como lengua adicional (p. ej., como segunda lengua o lengua de herencia) que se han llevado a cabo utilizando corpus de aprendices. Más concretamente, se pone el foco de atención en la revisión de la investigación de corpus de aprendices de español que examina la gramática (p. ej., la fluidez, el género gramatical), el vocabulario (p. ej., la diversidad léxica) y la pragmática (p. ej., los marcadores discursivos), destacando estas contribuciones al uso y desarrollo del español como lengua adicional. También se abordan las implicaciones pedagógicas más notables. El artículo concluye con la identificación de vías específicas para el trabajo futuro relacionado con la investigación sobre el aprendizaje de idiomas adicionales y el desarrollo de nuevos corpus.
Notes
1 I use this term to refer to second-language and heritage-language learning.
2 See Miaschi et al. (Citation2020) for an example of a corpus-based study that uses Natural Language Processing tools to examine writing in additional-language Spanish.
3 I also limit the present article to empirical studies that have analyzed corpus data only. Investigations, however, that examine both corpus and either experimental data or other types of non-publicly available data exist (e.g., Tracy-Ventura and Myles Citation2015; Tracy-Ventura Citation2017; Sampedro Mella Citation2021). Additionally, there are numerous studies that have analyzed Spanish alongside other target languages (e.g., Stengers et al. Citation2011; Erman et al. Citation2015; Huensch and Tracy-Ventura Citation2017a; Huensch et al. Citation2019; Edmonds and Gudmestad Citation2021; McManus, Mitchell and Tracy-Ventura Citation2021; Tracy-Ventura, Huensch, and Mitchell Citation2021), but the current article is limited to those that focus on Spanish only.
4 Achievements and accomplishments are telic, whereas activities and states are unbounded.
5 See Gudmestad, Edmonds, and Metzger (Citation2019) and Gudmestad (Citation2020) for other investigations on grammatical gender marking in Spanish that consist of analyses of LANGSNAP data.
6 This corpus also contains data on learners of French and native speakers of French and Spanish.
7 The native-speaker or monolingual bias “results from the assumption that monolingualism is the default for human communication and from valuing nativeness as a superior form of language competence and the most legitimate relationship between a language and its users” (Ortega Citation2014, 32).
9 See Fernández-Mira et al. (Citation2021) for another corpus-based study on lexical diversity in second-language Spanish.
Miaschi, A., S. Davidson, D. Brunato, F. Dell’Orletta, K. Sagae, C. H. Sánchez-Gutiérrez, and G. Venturi. 2020. “Evolution of Written Language Competence in L2 Spanish Learners.” In Proceedings of the 15th Workshop on Innovative Use of NLP for Building Educational Applications, eds. J. Burstein, E. Kochmar, C. Leacock, N. Madnani, I. Pilán, H. Yannakoudakis, and T. Zesch, 92-101. Seattle: Association for Computational Linguistics. Tracy-Ventura, N. and F. Myles. 2015. “The Importance of Task Variability in the Design of Learner Corpora for SLA Research.” International Journal of Learner Corpus Research 1: 58-95. Tracy-Ventura, N. 2017. “Combining Corpora and Experimental Data to Investigate Language Learning During Residence Abroad: A Study of Lexical Sophistication.” System 71: 35-45. Sampedro Mella, M. 2021. “De la práctica a la teoría: Análisis del acto de solicitud en el corpus de aprendices de español (CAES) y propuesta para una mejora didáctica.” Revista de Lingüística Teórica y Aplicada 59: 63-89. Stengers, H., F. Boers, A. Housen, and J. Eyckmans. 2011. “Formulaic Sequences and L2 Oral Proficiency: Does the Type of Target Language Influence the Association?” IRAL 49: 321-343. Erman, B., A. Denke, L. Fant, and F. Forsberg. 2015. “Nativelike Expression in the Speech of Long-residency L2 Users: A Study of Multiword Structures in L2 English, French and Spanish.” International Journal of Applied Linguistics 25: 160-182. Huensch, A. and N. Tracy-Ventura. 2017a. “L2 Utterance Fluency Development Before, During, and After Residence Abroad: A Multidimensional Investigation.” The Modern Language Journal 101: 275-293. Huensch, A., N. Tracy-Ventura, J. Bridges, and J. A. Cuesta Medina. 2019. “Variables Affecting the Maintenance of L2 Proficiency and Fluency Four Years Post-Study Abroad.” Study Abroad Research in Second Language Acquisition and International Education 4 (1): 96-125. Edmonds, A. and A. Gudmestad. 2021. “The Role of Transparency in Grammatical-gender Marking Among Stay-Abroad Learners of Spanish and French.” In Study Abroad and the Second Language Learner: Expectations, Experiences and Development, ed. M. Howard, 233-253. London: Bloomsbury. McManus, K., R. Mitchell, and N. Tracy-Ventura. 2021. “A Longitudinal Study of Advanced Learners’ Linguistic Development Before, During, and After Study Abroad.” Applied Linguistics 42: 136-163. Tracy-Ventura, N., A. Huensch, and R. Mitchell. 2021. “Understanding the Long-term Evolution of L2 Lexical Diversity: The Contribution of a Longitudinal Learner Corpus.” In Learner Corpus Research Meets Second Language Acquisition, eds. B. Le Bruyn and M. Paquot, 148-171. Cambridge: Cambridge University Press. Gudmestad, A., A. Edmonds, and T. Metzger. 2019. “Using Variationism and Learner Corpus Research to Investigate Grammatical Gender Marking in Additional-language Spanish.” Language Learning 69 (4): 911-949. Gudmestad, A. 2020. “On the relationship between epistemology and methodology: A reanalysis of grammatical gender in additional-language Spanish.” In Interpreting language-learning data (Eurosla Studies 4), eds. A. Edmonds, P. Leclercq, and A. Gudmestad, 111-137. Berlin: Language Science Press. Ortega, L. 2014. “Ways Forward for a Bi/Multilingual Turn in SLA.” In The Multilingual Turn: Implications for SLA, TESOL, and Bilingual Education, ed. S. May, 32-53. London and New York: Routledge. Gudmestad, A. and A. Edmonds. Forthcoming. “The Variable Use of First-person-singular Subject Forms During an Academic Year Abroad.” Study Abroad and the Second Language Acquisition of Sociolinguistic Variation in Spanish, eds. S. Zahler, A. Y. Long, and B. Linford. Amsterdam: John Benjamins. Lozano, C. 2016. “Pragmatic Principles in Anaphora Resolution at the Syntax-Discourse Interface: Advanced English Learners of Spanish in the CEDEL2 corpus.” In Spanish Learner Corpus Research: Current Trends and Future Perspectives, ed. M. Alonso-Ramos, 235-265. Amsterdam: John Benjamins. Fernández-Mira, P., E. Morgan, S. Davidson, A. Yamada, A. Carando, K. Sagae, and C. H. Sánchez-Gutiérrez. 2021. “Lexical Diversity in an L2 Spanish Learner Corpus: The Effect of Topic-Related Variables.” International Journal of Learner Corpus Research 7: 230-258.