3,484
Views
3
CrossRef citations to date
0
Altmetric
REGULAR ARTICLES

The transposed-word effect revisited: the role of syntax in word position coding

ORCID Icon, ORCID Icon & ORCID Icon
Pages 668-673 | Received 20 Apr 2020, Accepted 19 Jan 2021, Published online: 02 Feb 2021

ABSTRACT

Skilled readers may misinterpret “you that read wrong” for “you read that wrong”: a transposed-word effect. This relatively novel finding, which supports parallel word processing during sentence reading, is attributed to a combination of noisy bottom-up word position coding and top-down syntactic constraints. The present study focussed on the contribution of syntactic constraints in driving transposed-word effects. In a speeded grammatical decision experiment, two types of ungrammatical transposed-word sequences were compared, namely a transposition either across a syntactic phrase (“the have girls gone home”) or within a syntactic phrase (“the girls gone have home”). We found longer response times and lower accuracy rates for within-phrase transpositions than across-phrase transpositions, demonstrating a direct influence of syntactic structures on the transposed-word effect. We conclude that the assignment of words to positions in a sentence is guided by top-down syntactic constraints.

Introduction

Skilled adult readers can read more than 200 words per minute (Brysbaert, Citation2019; Rayner et al., Citation2016). Such fast reading comprehension not only consists of correct identification of individual words, but also requires successful encoding of word order within a sentence. However, word position coding in reading is not always error-free. As demonstrated in a recent speeded grammatical decision study (Mirault et al., Citation2018), participants were slower and more error-prone in judging a transposed-word sequence (e.g. “you that read wrong”) as ungrammatical compared with a control sequence (e.g. “you that read worry”). This novel effect has been consistently found in follow-up investigations (Pegado & Grainger, Citation2019, Citation2020; Snell & Grainger, Citation2019b; Wen et al., Citation2021). As explained in previous studies, the transposed-word effect is assumed to be driven by a combination of noisy bottom-up encoding of word order and the operation of top-down sentence-level syntactic constraints. This combination of bottom-up and top-down processes drives the language processor to interpret a transposed-word sequence like “you that read wrong” as “you read that wrong”. The transposed-word effect is consistent with the assumption that the language processor builds imprecise representations of the input (for a review on shallow language processing, see Christianson, Citation2016), and importantly it contributes to the theoretical debate over the serial versus parallel processing in reading.

One of the most prominent accounts of reading behaviour, the EZ-Reader model (Reichle et al., Citation1998) endorses a strictly one-word-at-a-time serial processing assumption. According to the serial processing view, sequential word order must be encoded incrementally to achieve successful reading comprehension, whereas failure of word position coding is inevitable if the language processor operates in a parallel fashion (Reichle et al., Citation2009). Obviously, the transposed-word effect provides clear evidence for incorrect encoding of word position, which fits better with the parallel processing account (see Pegado & Grainger, Citation2020; Snell & Grainger, Citation2019a, for more discussion). In particular, the transposed-word effect can be perfectly accounted for by Snell et al. (Citation2017, Citation2018) model of parallel orthographic processing and reading. As proposed in their model, when multiple words are processed in parallel, the association between word identities and their locations remains uncertain. This uncertainty will not end up with confusions because an elementary sentence-level representation is quickly generated which then guides the assignment of word identities onto spatiotopic locations along a line of text. For example, having identified word n as an article, the syntactic processor will expect that word n +1 is more likely to be a noun than a verb. Therefore, if both a noun and a verb are processed in parallel, the noun will be assigned to position n +1. Due to such top-down syntactic constraints on allocating word identities to plausible positions, a transposed-word sequence can be perceived as being grammatically correct.

Further evidence in line with Snell and colleagues’ proposal of interactive processing between word identities and sentence-level structures is the sentence superiority effect observed with the Rapid Parallel Visual Presentation (RPVP) procedure (Declerck et al., Citation2020; Snell & Grainger, Citation2017; Wen et al., Citation2019, Citation2021). The first demonstration of a sentence superiority effect using RPVP came from Snell and Grainger (Citation2017). In this behavioural study, a sequence of four words was simultaneously presented for a brief duration (200 ms) and participants were asked to identify one post-cued word from the sequence. The post-cued target words (e.g. “can”) were easier to identify when embedded in a grammatical sequence (e.g. “the man can run”) than embedded in a corresponding ungrammatical scrambled sequence (e.g. “run the can man”). The sentence superiority effect is interpreted as a reflection of parallel word processing which generates an elementary sentence-level representation in grammatical sequences, and this syntactic representation then influences on-going word processing via feedback connections. This interpretation has been further supported by two electrophysiological studies using either the same word-in-sequence identification task (Wen et al., Citation2019) or a grammatical decision task (Wen et al., Citation2021) combined with the RPVP procedure, both of which revealed an N400 reduction in grammatical sequences relative to scrambled sequences. This N400 reduction is assumed to reflect facilitatory feedback to word identities from sentence-level representations which are present in grammatical sequences due to parallel processing. Taken together, the interactive mechanisms in a parallel processing system outlined by Snell and colleagues provide a valid explanation for recent experimental findings in the field.

The present study seeks further evidence in favour of Snell and colleagues’ account of the transposed-word effect. If syntactic constraints are a major contributor to the transposed-word effect, we reasoned that the syntactic distance between the transposed words should modulate the effect. More specifically, syntactic constraints would be stronger for two adjacent words within a syntactic phrase (“have gone”) than across a syntactic phrase (“girls have”) in a sentence (“the girls have gone home” composed of a noun phrase “the girls” and a verb phrase “have gone home”). Therefore, a within-phrase transposition (“the girls gone have home”) should be harder to classify as being ungrammatical relative to an across-phrase transposition (“the have girls gone home”). Following this logic, the present study set out to test the transposed-word effect within or across a syntactic phrase.

Methods

Participants

Forty native French speakers (31 females; mean age = 24.15 years, SD = 3.66) received monetary compensation or course credit for their participation. All participants reported having normal or corrected-to-normal vision, and no history of language impairment. Their average LexTALE_Fr vocabulary score (Brysbaert, Citation2013) was 90.05 (SD = 4.64, range: 80.4–98.2). Data from four additional participants were excluded from the analyses because of low LexTALE_Fr vocabulary scores (< 80). Based on a priori power analysis using the data of Mirault et al. (Citation2018), we achieved 100% power with 40 participants (Brysbaert & Stevens, Citation2018; Green & MacLeod, Citation2016).

Materials and design

We first constructed 80 grammatically correct sentences in French in order to create the critical ungrammatical stimuli. Each of these sentences consisted of five words with an average word length of 5.15 letters (SD = 2.34). The average word frequency in Zipf values was 5.79, SD = 1.09 (Ferrand et al., Citation2010; van Heuven et al., Citation2014). We used the Stanford Parser (Green et al., Citation2011) to validate the syntactic structure of each sentence in order to define whether two adjacent words constitute a phrase. In half of the 80 sentences (e.g. The girls have gone home), a transposition of the words at positions 2 and 3 generated across-phrase transposed sequences (e.g. The have girls gone home), whereas a transposition of the words at positions 3 and 4 generated within-phrase transposed sequences (e.g. The girls gone have home). And this was reversed for the other half of the sentences to counterbalance (e.g. Sometimes the man runs fast). Therefore, for each sentence, two ungrammatical versions were generated by either transposing words at positions 2 and 3 or positions 3 and 4. The average word frequency of the transposed words was 6.0 for within-phrase transpositions and 5.69 for across-phrase transpositions in Zipf values (Ferrand et al., Citation2010; van Heuven et al., Citation2014). Two counterbalanced lists were created, and participants were randomly assigned to one of the lists. In order to have an equal number of grammatical and ungrammatical sequences, another 80 grammatically correct five-word sequences were included as fillers (e.g. These two guys are tall). The words in the filler sequences matched the words used in the critical stimuli in word length (average word length: 5.00 letters, SD = 1.96) and frequency (average word frequency in Zipf values: 5.76, SD = 1.12).

Procedure

The study was approved by the “Comité de Protection des Personnes SUD-EST IV” (No. 17/051). All participants provided their written informed consent before the experiment started. Participants were tested individually in a sound-attenuated experimental room. The stimuli were presented on a monitor using OpenSesame (Mathôt et al., Citation2012). Each participant received a unique random trial order. Each trial began with a central fixation across, presented for 400 ms followed by a 200 ms blank screen. Next, a sequence of five words was simultaneously presented. Participants were asked to decide whether the presented sequence was grammatically correct nor not by pressing buttons on a gamepad connected to the computer. They were instructed to perform the task as quickly and as accurately as possible. The word sequence remained on the screen up to a maximum of 3000 ms or until the participants responded (Snell & Grainger, Citation2019b). Feedback was then provided with a green (correct) or red (incorrect) dot presented for 700 ms. The inter-trial interval was set at 1000 ms. Prior to the experiment, 16 practice trials were used to familiarise the participants with the procedure.

Data analysis

Reaction times of incorrect responses were removed (13.75%), and values beyond 3.5 standard deviations from the mean of each condition for each participant were discarded as outliers (0.29%). To reduce the skewness in the distribution, reaction times were log transformed. Using the lme4 (Bates et al., Citation2015) and lmerTest (Kuznetsova et al., Citation2017) packages in R (Version 3.6.2; R Core Team, Citation2019), the response times were analysed with a linear mixed-effects model, and the accuracy data were analysed with a logistic mixed-effects model (Jaeger, Citation2008). Participants and items were included as random effects (Baayen et al., Citation2008), and by-participant and by-item random slopes were also included (Barr et al., Citation2013).

Results

Participants were slower in judging the within-phrase transposed sequences as ungrammatical relative to across-phrase transposed sequences, mean RTs: 1400 ms vs. 1324 ms respectively, t = 3.445, p < .001 (see ). Similarly, participants were more likely to mistake the within-phrase transposed sequences as being grammatical than the between-phrase transposed sequences, mean accuracy rates: 80.7% vs. 91.8% respectively, z = −4.10, p < .001 (see ).

Table 1. Results of mixed-effects modelling on RT data.

Table 2. Results of logistic mixed-effects modelling on accuracy data.

A post-hoc analysis on the RT data was conducted to address the potential concern that an ungrammaticality may be detected earlier in across- than in within-transposed sequences if words are processed sequentially in a left-to-right fashion. For across-phrase transposed sequences, the critical point of ungrammaticality occurred at position 2 and position 3 for transpositions at positions 2–3 and positions 3–4 respectively. For the within-phrase condition, the critical point of ungrammaticality occurred at position 3 and position 4 for transpositions at positions 2–3 and 3–4 respectively. The critical point of ungrammaticality was then entered as a fixed factor in the mixed-effect modelling analysis. The difference between two types of transpositions remained significant, b = 0.11239, SE = 0.04583, t = 2.452, p = 0.01640, suggesting that the observed difference between across-phrase and within-phrase transposed conditions is not driven by the critical point of ungrammaticality.

Discussion

The present study examined the impact of syntactic structure on word position coding by comparing two types of transposed-word sequences in a grammatical decision task. As predicted, we found that deciding a word sequence as ungrammatical was harder when the transposition occurred within a syntactic phrase compared with transpositions that occurred across a syntactic phrase. This, we suggest, is due to greater top-down syntactic influences on re-ordering transposed words within a syntactic phrase where two adjacent words are more closely bound.

As noted in the introduction, serial and parallel processing views make contrasting predictions concerning word order errors during reading. Proponents of serial processing reason that encoding words sequentially is a necessary prerequisite for encoding word order within a sentence (Reichle et al., Citation2009). Although words within a sentence can be fixated out of order under the assumption of serial processing, it is generally agreed that parallel processing is more susceptible to incorrect word ordering. It is apparent that the transposed-word effect is a clear manifestation of incorrect word ordering, thus providing support for the parallel processing view (Mirault et al., Citation2018; Pegado & Grainger, Citation2019, Citation2020; Snell & Grainger, Citation2019b; Wen et al., Citation2021). According to Snell et al.'s (Citation2017, Citation2018) model of parallel orthographic processing and reading, there are two mechanisms that contribute to transposed-word effects. One is the noisy association of word identities to spatiotopic locations along a line of text, and the other concerns the top-down constraints imposed by sentence-level syntactic structures on word order encoding. The top-down contribution posited in this parallel reading framework is unequivocally supported by the current study which reveals top-down syntactic processes influencing word position coding.

Although the results of the present study clearly respond to our specific research question, one potential issue is that we did not include a baseline control condition. Following the same logic as Snell and Grainger (Citation2019b), we considered that it was not necessary to replicate the transposed-word vs. control manipulation as in Mirault et al. (Citation2018) because such a replication would not provide additional leverage with respect to interpreting the current findings. Moreover, the choice of control condition is not straightforward. Following the logic of transposed-letter effects (e.g. Perea & Lupker, Citation2004), the two transposed words should be replaced by two different words that cannot be re-arranged to form a correct sentence (e.g. “the girls like were home” as a control for “the girls gone have home”). This procedure was adopted by Pegado and Grainger (Citation2019; Citation2020) with the same-different matching task. Pegado and Grainger (Citation2020) also found robust transposed-word effects when a single word was substituted in the control condition. This was also the case in the Mirault et al. (Citation2018) study, where the control condition that was created by changing the final word in the transposed-word sequences with a different word (e.g. “the white was cat big” / “the white was cat slowly”) and using pairs of sentences in order to match words in the two conditions (e.g. “the black dog ran slowly” paired with “the white cat was big” to generate the transposed-word and control sequences matched to the previous example: “the black ran dog slowly” / “the black ran dog big”). With this particular control procedure, purely lexical-level influences between the transposed-word and control conditions were minimised. To conclude on this complex issue, we surmise that there are two easy options for future research on transposed-word effects obtained with the grammatical decision task (other options are of course open for investigations using other tasks, such a priming version of the grammatical decision task). One is to investigate how the degree of ungrammaticality of the control condition impacts on the size of transposed-word effects. The other is to investigate how different types of transposition impact on the effects. It is the second option that was adopted in the present work.

Notwithstanding the existing research on word transpositions in favour of the parallel processing view, there are some caveats to be pointed out. First, the sentences used in the present study as well as previous studies on transposed-word effects were relatively short and simple. We note that using longer and more complex sentences (e.g. garden path sentences) would be an interesting avenue for future research. It should be also acknowledged that the existing studies on transposed-word effects have not manipulated sentential constraint, which would also be an interesting topic for future investigations. Furthermore, the present study used an artificial experimental task, and reading to make grammatical decisions might be different from reading for comprehension. However, we would point to the potential of the grammatical decision task as a means to investigate sentence reading in analogy with the widely-used lexical decision task applied in investigations of single word reading. Moreover, the transposed-word effect is not specific to the dependent measures obtained in the grammatical decision task. Mirault et al. (Citation2020) reported transposed-word effects in a measure of total reading time obtained with eye-movement recordings, and Chang et al. (Citation2020) found a transposed-character effect in Chinese in a natural sentence reading experiment with eye-movement recordings. Finally, as already noted above, Pegado and Grainger (Citation2020) found transposed-word effects in a same-different matching task in which participants were presented with two sequences of words and simply had to judge whether the two sequences were the same or not. Different judgments were found to be harder to make when the difference involved a word transposition compared with a word substitution.

In sum, the current results demonstrate the impact of syntactic constraints on word position coding and thus provide clear evidence for the parallel and interactive mechanisms underlying reading as postulated by Snell et al. (Citation2017, Citation2018). We conclude that top-down syntactic constraints guide the parallel processing of words and the encoding of word information during sentence reading. Future research is necessary to test whether current findings are generalisable to various types of syntactic structures and different experimental paradigms.

Supplemental material

Supplemental Appendix

Download MS Word (30.3 KB)

Acknowledgments

This research was supported by ERC grant ADG 742141. We thank Charlotte Leflaëc and Agnès Guerre-Genton for their help with stimuli creation and data collection.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by European Research Council [grant number ADG 742141].

References