200
Views
0
CrossRef citations to date
0
Altmetric
Articles

Designing, compiling and interrogating corpora in L2 Spanish acquisition research

Diseño, compilación y cotejo de corpus en la investigación sobre la adquisición del español LE/L2

&
Pages 190-206 | Received 15 Jun 2022, Accepted 15 Nov 2022, Published online: 09 Jan 2023
 

ABSTRACT

Despite the burgeoning field of Spanish second language acquisition (SLA) research, large Spanish learner corpora (LC) are not common practice yet. We present a general yet practical introduction to the multiple decisions Spanish as a second language (L2) researchers should consider before creating their own LC. We focus on (i) two freely available Spanish LC (CEDEL2 and COWS-L2H), (ii) their general design principles, (iii) crucial variables to collect (learner and task variables), (iv) ways of collecting and compiling LC data, and (v) the final product (the corpus interface). We explore different ways of interrogating the two corpora, illustrating them with specific (morpho)syntactic and lexical examples from L2 Spanish, as well as potential curricular and teaching applications of LC. We conclude with a recommendation for the triangulation of LC data with experimental research and a summary of future directions that the field of LC research may take. Our ultimate aim is to equip researchers with the basic theoretical and methodological tools to design, build and collect their own LC.

RESUMEN

A pesar del reciente auge del campo de la investigación de la adquisición de español como segunda lengua (L2), el uso de corpus de aprendices (CA) sigue sin ser una práctica habitual. En este artículo presentamos, de manera general a la vez que práctica, las múltiples decisiones a las que se enfrentan los investigadores de español L2 a la hora de crear su propio corpus. Nos centramos en (i) dos CA de español de acceso gratuito (CEDEL2 and COWS-L2H), (ii) sus principios de diseño, (iii) las variables relativas a los aprendices y a las tareas, (iv) maneras de recoger y compilar los datos y (v) el producto final (interfaces de búsqueda). Exploramos diferentes maneras de interrogar los corpus, ilustrándolas con ejemplos lingüísticos, y describimos posibles usos de esos datos tanto en la investigación como en la enseñanza. Concluimos con una recomendación de triangular datos de CA y experimentos y un resumen de los próximos pasos en el campo de la investigación de CA. Nuestra finalidad es equipar a los investigadores con herramientas básicas para compilar exitosamente su propio CA.

Notes

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 309.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.