521
Views
7
CrossRef citations to date
0
Altmetric
Articles

Learner corpora as a research tool for the investigation of lexical competence in L2 Spanish

Pages 180-193 | Received 08 Jul 2015, Accepted 02 Oct 2015, Published online: 21 Dec 2015
 

Abstract

This article presents a state-of-the-art discussion of second language (L2) Spanish corpus-based research on lexical competence. While L2 Spanish learner corpus research (LCR) is still in its infancy, we will review the major findings of relevant studies on the production of several lexical aspects: copula choice with ser/estar; overt/null pronoun distribution; collocations and lexico-syntactic verbal competence. Due to the highly contextualised nature of learner corpus data, many of these studies show that learners do not always behave differently from natives in terms of frequency of use, though they may differ in terms of discursive and pragmatic uses. The article ends with some theoretical and methodological caveats about L2 Spanish learner corpus research. An argument is made for the need to conduct L2 corpus-based research which (1) is theoretically motivated and explanatory (as opposed to descriptive and pedagogical), (2) uses fine-grained annotation (as opposed to coarse-grained, general tagsets), (3) exploits learner corpora that are properly designed and where learner variables are properly controlled for.

Este artículo presenta una revisión crítica de los estudios de corpus de español como segunda lengua (L2) sobre la competencia léxica. Si bien la investigación de la adquisición del español L2 basada en corpus está aún en su infancia, se revisarán los hallazgos principales de algunos estudios clave sobre el uso de verbos copulativos (ser/estar), la distribución de pronombres plenos y nulos, las colocaciones y otros aspectos léxicos. Dado que los datos de corpus proceden de discursos naturales, la mayoría de estos estudios demuestran que los aprendices se comportan de forma similar a los nativos con relación a la frecuencia de uso, aunque difieren con relación a los usos pragmáticos y discursivos. El artículo concluye con una discusión de las limitaciones teóricas y metodológicas de dichos estudios y con recomendaciones para que los futuros estudios de español L2 basados en corpus (1) estén teóricamente motivados y sean explicativos (en vez de descriptivos y pedagógicos), (2) usen sistemas de anotación más sofisticados (en vez de sistemas genéricos) y (3) exploten corpus de aprendices que estén bien diseñados y que controlen las variables del aprendiz.

Disclosure statement

No potential conflict of interest was reported by the author.

Notes

1. In the following examples, the ‘#’ symbol indicates that the pronominal coreference is pragmatically infelicitous (either redundant or ambiguous), though the sentence is syntactically correct.

2. The authors define sophistication as “the number of low-frequency words that are appropriate to the topic and style of a particular text” (p. 564).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.