80
Views
0
CrossRef citations to date
0
Altmetric
Commentaries

Neuroscience as a window into discourse processing: the influence of Walter Kintsch

ORCID Icon

ABSTRACT

Neuroscientific methods have greatly enhanced our understanding of cognition and language processing. However, their results are not always in line with predictions from models of discourse comprehension. In this essay I reflect on the impact Walter Kintsch’s work has had on neuroscientific investigations and on how models of discourse comprehension need to accommodate the blurring of distinctions that were previously taken for granted, and additionally, I sketch out a few of the recent developments in the field.

Neuroimaging as a window into discourse processing

When I started my graduate studies at the University of Colorado in Boulder in 1988, neuroscientists were only starting to develop methods to observe brain functions online. Neuropsychology and neurolinguistics were research endeavors medical departments were concerned with but that cognitive science could do well without. A few years later, toward the end of my doctoral studies, the field had greatly expanded; I even included one (!) reference to an ERP-study (event-related potentials) in my dissertation in 1994. I cannot remember if my advisor, Walter Kintsch, actually commented on that short paragraph at all—neuroscience was just not part of our toolbox yet. In this short period, positron emission tomography (PET) and functional magnetic resonance imaging (fMRI) had also come to fruition. To my knowledge, the very first—and still amazingly accurate—study on the neuroanatomy of text comprehension was published in 1993 (Mazoyer et al., Citation1993), but it was not yet obvious that neuroimaging would soon become widely available for the study of language processing.

The search for brain regions involved in text comprehension

The first ideas on the link between brain function and text comprehension came from the observation that some neurological patients without aphasic language deficits had considerable problems with communication at the text level. Symptoms such as incoherent language production, problems with inferencing, and difficulties with social and pragmatic communication had been described, often for patients with lesions in the right hemisphere. The right hemisphere hypothesis, thus, stated that word- and sentence-level language processing is mostly a left-sided affair, while including context and nonliteral language also involves both right and left hemispheres of the brain (Jung-Beeman, Citation2005). Neuroimaging studies did not readily confirm this idea, though (cf. Ferstl, Citation2007): Bilateral activation was observed even on the word level or sentence level (e.g., Mazoyer et al., Citation1993), and text comprehension did not always engage the right hemisphere. More-fine-grained empirical dissociations of subprocesses were needed. Obviously, the conceptual frameworks provided by Walter (Kintsch, Citation1988, Citation1998; Kintsch & van Dijk, Citation1978) were a good starting point.

Studies on cohesion and its consequences for coherence (Ferstl & von Cramon, Citation2001; Robertson et al., Citation2000), on inferencing (e.g., Ferstl & von Cramon, Citation2002; Friese et al., Citation2008; Kuperberg et al., Citation2006; Mason & Just, Citation2004; Virtue et al., Citation2008), and on situation model processing (Ferstl et al., Citation2005; Speer et al., Citation2007; Whitney et al., Citation2009) were conducted. The results converged on the key players: Besides the perisylvian language cortex, the activation patterns suggested an important role for the anterior temporal lobes, the left inferior frontal gyrus, and two midline regions—namely, the medial prefrontal cortex and the posterior cingulate cortex. “Fine,” Walter would have said, “Keep up the good work!” However, detailed inspection showed that prediction was close to impossible; which of these regions would show up in which fine-grained comparison remained a puzzle. The costs of fMRI prohibited the rigorous experimentation known from behavioral studies, varying any and all possible factors, such as task requirements, types of materials, imaging method, inference type, or participant characteristics. Attempts at such a program remained singular spotlights.

Nevertheless, I suggested some rather plausible mappings of Kintschian subprocesses of comprehension to brain regions (Ferstl, Citation2007; Ferstl et al., Citation2008). In particular, the anterior temporal lobes seemed to become engaged whenever words are combined to idea units or propositions (cf. Pylkkänen, Citation2020, for a recent account based on the linguistic concept composition). Interestingly, this neuroscientific finding might be interpreted as evidence for the textbase representation, in contrast to two-stage models of text comprehension that were partly based on neuroscientific observations of direct, embodied resonance (cf. Zwaan, Citation2004). Easy bridging inferences require processing of semantic information, as in the left inferior frontal lobe, and more strategic, explicit inferences involve the frontomedial cortex. However, other researchers developed quite different but, given the sparse data base, equally plausible interpretations. For instance, Mason and Just (Citation2006) agreed on the function of the IFG for inferencing but saw the anterior temporal lobe (aTL) as important for text integration (rather than construction) and introduced a protagonist’s perspective interpreter for explaining the unexpected importance of the medial prefrontal cortex during narrative comprehension. Hagoort (Citation2013) postulated unification as the process of “deriving new and complex meaning from the lexical building blocks” (p. 1)—a definition reminiscent of propositionalization or composition—as the function of the left IFG rather than the aTL. Finally, Egidi and Caramazza (Citation2016) studied what they named accumulation and evaluation using an inconsistency paradigm (cf. Albrecht & O’Brien, Citation1993) very similar to the ones used in studies informed by Kintsch’s theory (cf. Ferstl et al., Citation2005; Helder et al., Citation2017).

Going beyond cognition

While the search for regions implementing Kintsch’s theory did not seem to be too successful, brain imaging confirmed what he taught his students: Language comprehension is not a modular, specialized function, but it is embedded in the cognitive system in a holistic way. Text comprehension without cognition—in particular, working memory, background knowledge, attention, and reading goals—is inconceivable (Kintsch, Citation1998). Consequently, the results from imaging studies yielded activations not only in what had been termed the language network—that is, Broca’s and Wernicke’s areas (cf. Fedorenko & Thompson-Schill, Citation2014; Lipkin et al., Citation2002)—but in the entire brain. Depending on the task and on the particular comparison, regions known to play a role in these and related cognitive processes were shown to be involved as well.

More importantly, though, our participants’ brains surprised us every now and again, as I have described previously (Ferstl, Citation2015). One region whose purpose was not readily mapped to one of the text comprehension subprocesses was the aforementioned medial prefrontal cortex. Ventromedial activations, usually related to reward and emotions in social interactions, were reported by Maguire et al. (Citation1999), while the dorsomedial activations in inference studies (cf. Ferstl, Citation2015) had been implicated in the so-called theory-of-mind processes (Ferstl & von Cramon, Citation2002; Mar, Citation2011), or the processing of self-relevant information. Such noncognitive domains had been targeted in few studies in connection with text comprehension—for example, to elucidate the role of motivation or mood for successful comprehension or the representation of emotion—but they had not been explicitly included in theories of comprehension yet. The resulting paradigm shift from a purely cognitive theory to a more holistic view of comprehension as an integral part of and means for social communication was the most important consequence of neuroimaging studies of discourse processing.

Recent developments

The above summary of the neuroimaging literature on text comprehension has focused on the decade between 2000 and 2010. This period was the heyday of neuroimaging. The method was exciting, and each experiment yielded rich data. We gained truly novel insights that we happily communicated at specialized events, such as a conference at the Hanse-Institute for Advanced Studies in Delmenhorst in 2003 (organized by Chuck Perfetti and Franz Schmalhofer; Schmalhofer & Perfetti, Citation2007) and a week-long workshop at the Lorentz-Institute in Leiden (organized by Paul van den Broek, Jos van Berkum, and myself). In the last decade, however, rather than seeing an explosion of the field, neuroimaging has moved away from basic text comprehension research, a fate shared by other topics in classical experimental psychology. The replication crisis (cf. Zwaan et al., Citation2018), which hit neuroscience particularly hard (Nieuwland et al., Citation2018; Poldrack et al., Citation2017), and the shift from localization of specific brain regions to a network approach has made it less promising to conduct fine-grained, psycholinguistic experiments in the scanner. On the other hand, recent technological developments enable researchers to employ big-data approaches combined with bottom-up analyses to uncover patterns of brain function without the need for model-based hypothesis testing. This attractive and exciting paradigm shift seems promising even if it comes at the expense of text linguistic theoretical specificity.

Walter Kintsch’s lasting impact

Walter Kintsch would have embraced these trends, I am sure, despite recent doubts about the replicability and stringency of systemic neuroimaging research. When I once complained that his CI-model had too many degrees of freedom and could explain pretty much anything, Walter shrugged it off and told me he was more interested in the proof of concept rather than in falsifiability. He was always excited about novel possibilities and insights and focused on the big picture as much as on methodological issues. In this sense, his work was always ahead of its time. Although ChatGPT itself is not aware of its roots (see the funny query by Janice Keenan, this volume), Walter Kintsch’s work on latent semantic analysis (e.g., Foltz et al., Citation1998) helped to lay the groundwork for large language models (LLMs). He always went forward in his thinking. When, as a first-year student, I struggled with how the construction-integration model (Kintsch, Citation1988) fit with the distinction between micro- and macrostructure (Kintsch & van Dijk, Citation1978) he gave some quick response, but I could tell he found the question unproductive and a bit backward tending. When challenged in a seminar discussion about a detail in his work, he just shrugged it off: “Scientific theories merely are a proposal, a suggestion, based on what we know at a given point in time. It is the job of the next generation of researchers to take them apart and to come up with something better.” Neuroscience was a research field Walter happily left to his students and colleagues, but his thinking will live on in the work of the future generations.

Disclosure statement

No potential conflict of interest was reported by the author.

References

  • Albrecht, J. E., & O’Brien, E. J. (1993). Updating a mental model: Maintaining both local and global coherence. Journal of Experimental Psychology: Learning, Memory, and Cognition, 19(5), 1061–1070. https://doi.org/10.1037/0278-7393.19.5.1061
  • Egidi, G., & Caramazza, A. (2016). Integration processes compared: Cortical differences for consistency evaluation and passive comprehension in local and global coherence. Journal of Cognitive Neuroscience, 18(10), 1568–1583. https://doi.org/10.1162/jocn_a_00982
  • Fedorenko, E., & Thompson-Schill, S. L. (2014). Reworking the language network. Trends in Cognitive Science, 18(3), 120–126. https://doi.org/10.1016/j.tics.2013.12.006
  • Ferstl, E. C. (2007). The functional neuroanatomy of text comprehension: What’s the story so far? In F. Schmalhofer & C. Perfetti (Eds.), Higher level language processes in the brain: Inference and comprehension processes (pp. 53–102). Erlbaum.
  • Ferstl, E. C. (2015). The neuroanatomy of inference processes. In E. J. O’Brien, A. E. Cook, & R. F. Lorch Jr. (Eds.), Inferences during reading (pp. 230–259). Cambridge University Press.
  • Ferstl, E. C., Neumann, J., Bogler, C., & Von Cramon, D. Y. (2008). The extended language network: A meta-analysis of neuroimaging studies on text comprehension. Human Brain Mapping, 29(5), 581–593. https://doi.org/10.1002/hbm.20422
  • Ferstl, E. C., Rinck, M., & von Cramon, D. Y. (2005). Emotional and temporal aspects of situation model processing during text comprehension: An event-related fMRI study. Journal of Cognitive Neuroscience, 17(5), 724–739. https://doi.org/10.1162/0898929053747658
  • Ferstl, E. C., & von Cramon, D. Y. (2001). The role of coherence and cohesion in text comprehension: An event-related fMRI study. Cognitive Brain Research, 11(3), 325–340. https://doi.org/10.1016/S0926-6410(01)00007-6
  • Ferstl, E. C., & von Cramon, D. Y. (2002). What does the frontomedian cortex contribute to language processing: Coherence or theory of mind? Neuroimage, 17(3), 1599–1612. https://doi.org/10.1006/nimg.2002.1247
  • Foltz, P. W., Kintsch, W., & Landauer, T. K. (1998). The measurement of textual coherence with latent semantic analysis. Discourse Processes, 25(2–3), 285–307. https://doi.org/10.1080/01638539809545029
  • Friese, U., Rutschmann, R., Raabe, M., & Schmalhofer, F. (2008). Neural correlates of inference processes in text comprehension: An event-related functional magnetic resonance imaging study. Journal of Cognitive Neuroscience, 20(11), 2110–2124. https://doi.org/10.1162/jocn.2008.20141
  • Hagoort, P. (2013). MUC (Memory, Unification, Control) and beyond. Frontiers in Psychology, 4, 416. https://doi.org/10.3389/fpsyg.2013.00416
  • Helder, A., van den Broek, P., Karlsson, J., & van Leijenhorst, L. (2017). Neural correlates o coherence-break detection during reading of narratives. Scientific Studies of Reading, 21(6), 463–479. https://doi.org/10.1080/10888438.2017.1332065
  • Jung-Beeman, M. (2005). Bilateral brain processes for comprehending natural language. Trends in Cognitive Sciences, 9(11), 512–518. https://doi.org/10.1016/j.tics.2005.09.009
  • Kintsch, W. (1988). The role of knowledge in discourse comprehension: A construction-integration model. Psychological Review, 95(2), 163–182. https://doi.org/10.1037/0033-295X.95.2.163
  • Kintsch, W. (1998). Comprehension: A paradigm for cognition. Cambridge University Press.
  • Kintsch, W., & van Dijk, T. A. (1978). Toward a model of text comprehension and production. Psychological Review, 85(5), 363–394. https://doi.org/10.1037/0033-295X.85.5.363
  • Kuperberg, G. R., Lakshmanan, B. M., Caplan, D. N., & Holcomb, P. J. (2006). Making sense of discourse: An fMRI study of causal inferencing across sentences. Neuroimage, 33(1), 343–361. https://doi.org/10.1016/j.neuroimage.2006.06.001
  • Lipkin, B., Tuckute, G., Affourtit, J., Small, H., Mineroff, Z., Kean, H., Jouravlev, O., Rakocevic, L., Pritchett, B., Siegelman, M., & Hoeflin, C. (2002). Probabilistic atlas for the language network based on precision fMRI data from >800 individuals. Scientific Data, 9(1), 529. https://doi.org/10.1038/s41597-022-01645-3
  • Maguire, E. A., Frith, C. D., & Morris, R. G. (1999). The functional neuroanatomy of comprehension and memory: The importance of prior knowledge. Brain, 122(10), 1839–1850. https://doi.org/10.1093/brain/122.10.1839
  • Mar, R. A. (2011). The neural bases of social cognition and story comprehension. Annual Review of Psychology, 62(1), 103–134. https://doi.org/10.1146/annurev-psych-120709-145406
  • Mason, R. A., & Just, M. A. (2004). How the brain processes causal inferences in text. Psychological Science, 15(1), 1–7. https://doi.org/10.1111/j.0963-7214.2004.01501001.x
  • Mason, R. A., & Just, M. A. (2006). Neuroimaging contributions to the understanding of discourse processes. In M. J. Traxler & M. A. Gernsbacher (Eds.), Handbook of psycholinguistics (2nd ed., pp. 765–799). Elsevier.
  • Mazoyer, B. M., Tzourio, N., Frak, V., Syrota, A., Murayama, N., Levrier, O., Salamon, G., Dehaene, S., Cohen, L., & Mehler, J. (1993). The cortical representation of speech. Journal of Cognitive Neuroscience, 5(4), 467–479. https://doi.org/10.1162/jocn.1993.5.4.467
  • Nieuwland, M. S., Politzer-Ahles, S., Heyselaar, E., Segaert, K., Darley, E., Kazanina, N., Von Grebmer Zu Wolfsthurn, S., Bartolozzi, F., Kogan, V., Ito, A., Mézière, D., Barr, D. J., Rousselet, G. A., Ferguson, H. J., Busch-Moreno, S., Fu, X., Tuomainen, J., Kulakova, E., Husband, E. M., … Huettig, F. (2018). Large-scale replication study reveals a limit on probabilistic prediction in language comprehension. eLife, 7, e33468. https://doi.org/10.7554/eLife.33468
  • Poldrack, R. A., Baker, C. I., Durnez, J., Gorgolewski, K. J., Matthews, P. M., Munafò, M. R., Nichols, T. E., Poline, J.-B., Vul, E., & Yarkoni, T. (2017). Scanning the horizon: Towards transparent and reproducible neuroimaging research. Nature Reviews, Neuroscience, 18(2), 115–126. https://doi.org/10.1038/nrn.2016.167
  • Pylkkänen, L. (2020). Neural basis of basic composition: What we have learned from the red–boat studies and their extensions. Philosophical Transactions of the Royal Society B, 375(1791), 20190299. http://doi.org/10.1098/rstb.2019.0299
  • Robertson, D. A., Gernsbacher, M. A., Guidotti, S. J., Robertson, R. R., Irwin, W., Mock, B. J., & Campana, M. E. (2000). Functional neuroanatomy of the cognitive process of mapping during discourse comprehension. Psychological Science, 11(3), 255–260. https://doi.org/10.1111/1467-9280.00251
  • Schmalhofer, F., & Perfetti, C. A. (Eds.). (2007). Higher level language processes in the Brain: Inference and comprehension processes. Erlbaum.
  • Speer, N. K., Zacks, J. M., & Reynolds, J. R. (2007). Human brain activity time-locked to narrative event boundaries. Psychological Science, 18(5), 449–455. https://doi.org/10.1111/j.1467-9280.2007.01920.x
  • Virtue, S., Parrish, T., & Jung-Beeman, M. (2008). Inferences during story comprehension: Cortical recruitment affected by predictability of events and working memory capacity. Journal of Cognitive Neuroscience, 20(12), 2274–2284. https://doi.org/10.1162/jocn.2008.20160
  • Whitney, C., Huber, W., Klann, J., Weis, S., Krach, S., & Kircher, T. (2009). Neural correlates of narrative shifts during auditory story comprehension. NeuroImage, 47(1), 360–366. https://doi.org/10.1016/j.neuroimage.2009.04.037
  • Zwaan, R. A. (2004). The immersed experiencer: Toward an embodied theory of language comprehension. Psychology of Learning and Motivation: Advances in Research and Theory, 44, 35–62.
  • Zwaan, R. A., Etz, A., Lucas, R. E., & Donnellan, M. B. (2018). Making replication mainstream. Behavioral and Brain Sciences, 41, e120. https://doi.org/10.1017/S0140525X17001972