1,673
Views
1
CrossRef citations to date
0
Altmetric
Brief Report

The effects of episode spacing on adult's reports of a repeated event

, , , &
Pages 879-889 | Received 22 Dec 2022, Accepted 29 Mar 2023, Published online: 19 Apr 2023

ABSTRACT

Witnesses’ reports of repeated events have been the focus of much research; however, the spacing interval between each episode of the event has differed greatly. The aim of the current study was to determine whether spacing interval affects participants’ memory reports. Adults (N = 217) watched one (n = 52) or four videos depicting workplace bullying. The repeated event participants watched the four videos all in one day (n = 55), one per day over four consecutive days (n = 60), or one every three days over 12 days (n = 50). One week after the last (or only) video, participants reported on that video and answered some reflective questions about the procedure. Repeated-event participants also reported on what usually happens across the videos. Single-event participants reported proportionally more accurate information about the target video than repeated-event participants, and spacing interval did not affect repeated event participants’ accuracy. However, accuracy scores were close to ceiling while errors rates were at floor levels, preventing us from drawing strong conclusions. We found some evidence that episode spacing affected participants’ perceptions of their memory performance. Overall, spacing may have a minimal effect on adults’ memory for repeated events, but further research is required.

Research investigating children’s and adults’ memories for episodes of repeated events has important implications for investigative interviewing. Witnesses are required to report on individual episodes of repeated events like sexual abuse, family violence, and workplace bullying with as much detail and accuracy as possible (Fair Work Act, Citation2009; PPP v. R, Citation2010; R v. Baker, Citation2002). Results show that when participants report on a target episode (commonly the last episode), they are proficient at reporting details that did not change across episodes, but they often confuse details that varied across episodes (see Dilevski et al., Citation2020; Woiwod et al., Citation2019 for reviews). To date, these studies have adopted vastly different spacing intervals between the presentation of episodes, from experiencing all episodes within 1 h (MacLean et al., Citation2018) to experiencing them across four weeks (Deck & Paterson, Citation2020). Deck and Paterson (Citation2020) argued that short spacing prevents the generalisability of results, as repeated event episodes in a forensic context are unlikely to wholly occur within a short period of time. However, little consideration has been given to the effects that the spacing interval between episodes may have on participants’ reports.

Only one study to date has systematically manipulated episode spacing intervals. Price and colleagues (Citation2006) examined 7–8-year-old children’s suggestibility for an episode of a repeated event. Children experienced structured play sessions consisting of eight memory items (e.g., receiving a sticker) that had different instantiations in each episode (e.g., children received a car sticker, an aeroplane sticker, a truck sticker, and a motorcycle sticker across the episodes). Some children experienced only a single episode, while others experienced four episodes that were massed into one session, spaced across four days, or spaced across 10 days. In a later biasing interview, instantiations that were not experienced—but were consistent with the experienced details—were suggested to have occurred in a target episode (e.g., the interviewer told children they received a scooter sticker). A different interviewer then elicited children’s reports of the target episode. Single-event children provided more correct instantiations than children in the three repeated event conditions, which did not differ from each other. In the repeated event conditions, children reported more instantiations from a non-target episode (internal intrusions, e.g., reporting the truck sticker when the motorcycle sticker was received in the target episode) when the episodes were massed into one session compared to when they were spaced over four or 10 days. Further, children who experienced the episodes spaced over four and 10 days reported more suggested instantiations than those in the single condition when specifically asked about each memory item. Price and colleagues’ (Citation2006) findings suggest that spacing episodes across multiple days makes children more suggestible to interviewer-provided details, but less prone to internal intrusions (compared to when all episodes are in one session).

To date, the effects of spacing interval have not been explored in adults. We might expect differences between children’s and adults’ memories because adults’ memory systems are more developed than children’s (Schneider, Citation2015). For example, with repeated experience adults encode a detailed and complex generic representation in memory of what usually happens at the event (i.e., a script; Schank & Abelson, Citation1977). While children also encode scripts, they are slower to create them and their scripts are more rudimentary than adults’ (Hudson et al., Citation1992; Hudson & Mayhew, Citation2009). In addition, adults are better than children at evaluating the source of a recalled detail; for example, deciding if a particular instantiation occurred in the first or second episode (i.e., a source-monitoring decision; Lindsay et al., Citation1991; Ryan, Citation2009). Poor source-monitoring decisions lead to internal intrusion errors, which children make more often than adults (Lindsay et al., Citation1991; Ryan, Citation2009). Adults are also more proficient at creating verbatim memory traces that contain the happenings of a particular episode of a repeated event and gist memory traces for the general meaning of events (e.g., the stickers had transport-related pictures; Brainerd & Reyna, Citation2004; Brainerd et al., Citation2008; Odegard et al., Citation2009). Overall, adults’ more developed source monitoring abilities and verbatim traces may mean that spacing has less effect on their reports of internal intrusions and accurate details (compared to children), but adults’ proficiency to extract gist makes them more susceptible to errors involving information consistent with the general meaning of the event (like reporting a non-experienced form of transport; see Brainerd et al., Citation2008 for review).

Whilst no study has systematically manipulated the spacing interval of repeated event episodes with an adult sample, plentiful research has explored the effects of spacing interval on adults’ ability to learn cognitive stimuli like word pairs and foreign vocabulary (e.g., Cepeda et al., Citation2006; Delaney et al., Citation2017). Results show that adult learning improves as the spacing intervals between repeated stimuli increases, particularly when compared to the presentation of stimuli together in one massed session (the spacing effect; Ebbinghaus, Citation1885; see Smith & Scarf, Citation2017, for a review). Two types of theories explain this effect. Encoding variability theories propose that increased spacing intervals between stimuli causes contextual differences during encoding (such as different testing conditions and locations), whereas stimuli presented together in one massed session overlap in contextual features and are encoded similarly to each other. More contextual variety at encoding increases the number of retrieval pathways, which—in turn—increase the likelihood of successful retrieval (Crowder, Citation1976; Melton, Citation1970). Deficient processing theories propose that stimuli receive less processing when presented in one massed session because attention is divided between them in working memory, and the memory traces for repeated stimuli may quickly decay. Presenting stimuli at larger spacing intervals increases the attention—and therefore quality of processing—for each individual stimulus, which enhances consolidation of the stimuli in long-term memory to facilitate later retrieval (Dellarosa & Bourne, Citation1985; Koval, Citation2019).

The spacing effect literature has focused on cognitive stimuli like word pairs, whereas the repeated event literature has focused on personally experienced events, such as activity sessions. Compared to cognitive stimuli, experienced events are more complex and engaging, are intermingled with personal interpretations and connections in memory, and are supported by a script containing typically occurring details and temporal order of what usually happens (Conway, Citation2005; Hudson et al., Citation1992). Further, memory for experienced events becomes more general over time (e.g., remembering the gist of events, or what usually occurs; Brainerd & Reyna, Citation2004; Myles-Worsley et al., Citation1986; Sekeres et al., Citation2016). Overall, these differences may mean that spacing effects are less pronounced for experienced events.

Whilst the spacing of experienced repeated event episodes has not been considered with adult participants previously, a body of research has examined how adults naturally segment an experienced event into smaller units. Event segmentation theory proposes that humans continuously parse live events into sub-units, typically establishing boundaries between units at points of change in the event (such as a scene change; Zacks et al., Citation2007). Research on event segmentation has reliably found that events are better recalled if they are segmented rather than considered as one continuous event (Flores et al., Citation2017), and that spacing the units by inserting breaks or pauses between them can further improve memory for the event (Boltz, Citation1992; Schwan et al., Citation2000). Thus, we might expect larger spacing intervals between repeated event episodes to improve memory for adult participants.

The current study examined the effects of episode spacing on adults’ memory reports of a repeated event. Adult participants watched one (single) or four (repeated) videos showing workplace bullying incidents. The repeated videos were shown all in one day (massed), spaced across four days (spaced-short), or spaced across 12 days (spaced-long). One week after the final (or only) video, participants recalled what happened in the target (last or only) episode. Those in the repeated event conditions also recalled what usually happened across the series. Given that best-practice interviewing should avoid suggestive questions (e.g., Risan et al., Citation2020), we elicited adults’ memory reports using only non-suggestive prompts (unlike Price and colleagues’ (Citation2006) study with children). Finally, since witnesses’ perceptions of their memory reports influence jurors’ assessments of witnesses’ testimony (Tenney et al., Citation2007), our participants were asked reflective questions about their experience of reporting on the event, including rating their perceived accuracy of their report.

We made three predictions. Based on Price et al.’s (Citation2006) experiments with children, we first predicted that single-event participants should provide a higher proportion of accurate details from the target episode than repeated-event participants. Second, we predicted that participants in the massed condition should have a higher proportion of internal intrusions than participants in the two spaced conditions. We were also interested in reports of non-experienced details, because adults have previously been shown to be more susceptible to errors that are consistent with the gist of experienced word lists after experiencing spaced rather than massed lists (Dubuisson et al., Citation2012). Since spacing the episodes should increase participants’ abilities to extract their gist, our third prediction was that participants in the two spaced conditions should have a higher proportion of external intrusion errors (details that did not occur but are consistent with the gist of the videos) than participants in the massed condition. We examined participants’ reports about what usually happened because we were interested to see if spacing condition affected general memory reports, and we had participants reflect on their experience of recalling the event to investigate their insights into their own memory accuracy. We did not make formal hypotheses for these variables.

Method

Participants

An a-priori power analysis showed that 176 participants were required to detect medium effect sizes (ηp2 = .06) with power of .80. A medium effect size was chosen to reflect the medium-large effects detected for spacing intervals in previous work with children (e.g., Price et al., Citation2006). We over-recruited due to expected drop-out: 230 participants were recruited through Prolific. However, 13 participants reported that they did not remember watching the videos at all and were excluded from the sample. The final sample consisted of 217 participants (18–62 years, M = 31.84, SD = 10.80; nfemale = 122, nmale= 94, nmissing = 1).

Participants experienced a single event (n = 52) or a repeated event with four episodes presented in one massed session (massed; n = 55), spaced over 4 days (spaced–short; n = 60), or spaced over 12 days (spaced–long; n = 50). There were no age or gender differences across the four conditions, ps ≥ .56. Assignment of participants to event conditions was not random, as each condition was advertised one-at-a-time. Participants signed up to a condition, and when one condition was full the next condition was advertised, etc. Once participants signed up to a condition, they were excluded from seeing advertisements for future conditions. This procedure allowed the study advertisements to accurately reflect expected time involvement and reimbursement, which differed across conditions.

Materials

Videos

Participants watched either one or four workplace bullying videos (see also Dilevski et al., Citationin press). Videos were 1–2 min long and focused on an incident of bullying in an office setting between the same two workers (the perpetrator, Rick, and the victim, Dan). Some details varied across the videos while others remained stable (see for a summary). For a description of the narrative for each video, see the coding manual available at: https://osf.io/6e2dc/?view_only=592607266537414ebe911e5d53d7620c . Repeated-event participants watched the four videos in the same orderFootnote1, and the last video was the target video. Participants in the single event condition only watched the target video without viewing the three preceding videos.

Table 1. Key details that varied and remained stable across the video series.

Memory report

Participants answered questions online to provide their memory reports (full questionnaire available online). For repeated-event participants, the questionnaire contained three key tasks: 1) a free recall report of the target video, 2) a free recall report of what usually happens across the video series (i.e., their script), and 3) reflective questions about their perceptions of recalling the videos. Single-event participants completed tasks 1 and 3 only. The tasks are described in more detail below.

Reporting the target video

Participants described everything that happened in the target video in a text box. They were told to describe everything from the beginning to the end of the target video in as much detail as possible, and not to guess what happened but to write everything they remembered. For repeated-event participants, instructions clearly identified “the last video” as the target.

Reporting what usually happens

Repeated-event participants described everything that usually happens in the videos in a text box (i.e., they described their event script). They were told to describe everything in as much detail as possible, and not to guess what happened but to write everything they remembered. Instructions specified that this question pertained to all the videos.

Reflective questions

Participants reflected on their experience of recalling the videos. First, participants rated how (1) accurately they believed that they recalled the target video, and (2) how difficult it was to remember the target video, from 1 (Not at all) to 6 (Extremely). Second, repeated-event participants completed the same ratings for their reports of what usually happens. Third, all participants were asked a qualitative question: “Is there anything else that would have helped you remember the video better?” and provided free-text space to respond.

Procedure

Ethics approval was granted by Deakin University. The study was advertised on Prolific to subscribers living in Australia and aged 18–65 years. To ensure that participants did not attempt to memorise or make records about the video/s, the study was advertised as an examination of people’s emotional responses to watching workplace bullying interactions. Participants provided informed consent and demographic information (e.g., age, gender). The single event condition watched the target video, then the session concluded for the day. The massed condition watched all four videos, then the session concluded for the day. The two spaced conditions watched the first video in the workplace bullying series, then the session concluded for the day. The spaced–short condition then watched the remaining videos at a rate of one-per-day for the next three days. The spaced–long condition watched the remaining videos on every third day (i.e., they watched the second video three days after the first video, then watched the third video three days after that, etc.). All participants (including the single event) always completed a filler task for 1 min after each video to avoid them rehearsing any details. In the filler task, participants were presented with pairs of numbers and had to judge whether the numbers were the same.

One week after watching the final (or only) video, participants completed the online questionnaire. A one-week delay was selected to replicate the delay in Price and colleagues’ (Citation2006, Experiment 2) study. Participants were paid £10.00/hr for participating, which took an average of 20.32 min (single condition) and 38.13 min (repeated conditions).

Coding

Memory reports

There was a wide range in the length of participants’ reports about the target episode (range = 0–219 words; M = 65.88, SD = 37.34) and what usually happens (range = 7–526 words; M = 74.75, SD = 71.65). There was no difference in word counts for either prompt across spacing conditions, Fs < .96, ps > .41. Participants who provided responses of 20 words or fewer in their reports on the target episode (n = 17) or reports on what usually happens (n = 9) were excluded from analysis as their responses were too insufficient to code (e.g., P154: “I cannot remember the details, but I know it was about the man being bullied and harassed”; P176: “Staff member being bullied for no reason”). If a participant provided a sufficient response (> 20 words) to one prompt but not the other, their sufficient response was coded and analysed but their insufficient response was not. The coding schemes for responses to both prompts considered each individual clause in the response, so that everything participants reported was coded (rather than being limited to a pre-determined list of memory items; Gross & Hayne, Citation1998; Patterson & Pipe, Citation2009). A clause was defined as a group of words that included a subject and an implicit or explicit verb (e.g., “a mean boss picking on one of his employees”, “the HR person dismissed the victim”). The categorisation and scoring of clauses differed for reports of the target episode and what usually happens.

Target video reports

In addition to the 17 excluded participants noted above, five participants did not respond to the prompt about the target video. The remaining 195 participants provided an average of 9.85 clauses (SD = 4.45). Coders categorised each clause as accurate (it accurately described the target video), an internal intrusion (it contained information from a non-target video), or an external intrusion (it contained information that did not occur in any video). Of note, all external intrusions were consistent with the details presented in the target video (e.g., “the bully threatened to fire the victim” never occurred but the bully did threaten the victim with a poor reference in the target video). Because clauses are small units, typically a few words, an individual clause never fell into more than one category. Participants’ scores were converted into proportions of the total number of reported clauses (see also Danby et al., Citation2022; Sharman et al., Citation2022). For example, if a participant provided 10 accurate clauses, one internal intrusion, and two external intrusions, they received proportional scores of .77 (10/13), .08 (1/13), and .15 (2/13) respectively.

Eleven participants—all from the repeated event conditions—provided a report of a video that was not the target video (they all provided a coherent and clear report of the third video). This occurred evenly across the repeated event conditions (ns: massed = 2/50, spaced–short = 3/55, spaced–long = 6/42), zs < 1.74, ps > .08. For these participants, the accuracy of their clauses was marked against the video they recalled (i.e., a clause was categorised as accurate if it described a detail from the reported non-target video and an internal intrusion if it described a detail from any other video; see Sharman et al.’s (Citation2022) scoring procedures). This procedure meant that participants were not overly penalised for confusing which video was the last one, reflecting that they had likely bound the details of one video together but just confused which was the last (Lee et al., Citation2015; Mammarella & Fairfield, Citation2008).

Reports of what usually happens

The 156 repeated-event participants with sufficient reports provided an average of 8.44 clauses (SD = 6.97), which either (1) generally described what typically occurred, or (2) described each video individually. Thus, coders categorised each clause as accurate general (it accurately described what usually happened across the video series; e.g., “the bully was mean to the victim”), accurate specific (it accurately described one but not all videos; e.g., “the bully confronted him in the bathroom one time”), or external intrusion (it contained information that did not occur in any videos). Again, all external intrusions were consistent with the details presented in at least one of the videos (no external intrusions were inconsistent with the videos), and again no clauses fell into more than one category. Participants’ scores were converted into proportions of the total number of clauses provided.

Reflective questions

Likert scales

Participants’ ratings of their perceived accuracy and difficulty when reporting the target video and (for repeated-event participants only) what usually happened were analysed even if they had provided insufficient responses to the free recall questions above. This was because we did not wish to exclude participants who may have found the written memory reports difficult. However, the five participants who did not provide a response about the target video at all were excluded from analyses of Likert data because they could not logically rate their own accuracy to a question they had not answered. Thus, ratings for 212 participants were analysed.

Qualitative question

Of the 217 participants, 66 did not provide a response of anything that would have helped them remember the target episode better or they reported that there was nothing that would help. A further nine participants did not answer the question properly (i.e., they provided a detail they did/did not remember well or a reason why they think they remembered the videos well/poorly) and were excluded from the coding. The remaining 142 participants’ responses fell into four categories: 1) behaviours or strategies designed to help memory or reporting of details (memory support), 2) changes to the video content or presentation (video changes), 3) changes to the timing of the spacing interval between the videos (spacing), and 4) changes to the delay between the last/only video and the online questionnaire (delay). See for example responses and descriptions.

Table 2. Qualitative question response categories.

Interrater reliability

Coders were blind to participant conditions and hypotheses. One researcher was responsible for coding participants’ reports of the target video and their responses to the qualitative question. A second researcher coded 15% of responses about the target video, and 35% of responses to the qualitative question for interrater reliability, which showed strong similarity in the categorisation of clauses (kappa = .81) and of qualitative responses (kappa = .95). Two coders together coded participants’ reports of their event scripts; they double-coded 30% of the sample with good reliability (kappa = .76)

Results

Our dataset is available at: https://osf.io/6e2dc/?view_only=592607266537414ebe911e5d53d7620c.

Target video reports

We examined the effects of spacing condition (single, massed, spaced-short, spaced-long) on participants’ proportional accuracy, internal and external intrusion scores. Proportional scores were nonnormal and non-homogenous, so an adjusted Welch’s F test was used (see Blanca et al., Citation2017; Delacre et al., Citation2019; Schmider et al., Citation2010). The main effect of condition on accuracy was significant, F(3, 101) = 3.59, p = .02, ω2 = .07. Planned contrasts showed that single-event participants had a higher proportion accuracy than repeated-event participants, t(141) = 3.10, p = .002, d = .52. Bonferroni-corrected post-hoc tests showed that the three repeated event conditions did not significantly differ from each otherFootnote2, ps > .08. provides group means.

Table 3. Proportional scores for reports of the target video and what usually happens in the videos.

Only 10 participants made internal intrusions (5.1%); they were all in repeated event conditions. Specifically, five participants made one intrusion, three made two intrusions, one made three intrusions, and one participantFootnote3 made 11 intrusions. Due to the infrequent internal intrusions, an F-test was not appropriate. Instead, we dichotomously coded whether participants made any internal intrusions (see for distribution) and conducted a 3 (condition: massed, spaced-short, spaced-long) x 2 (intrusion: yes, no) contingency table. Due to small expected cell counts, a Fisher-Freeman-Halton exact test was used (Conover, Citation1980). The test was not significant, p = .11.

External intrusions were made by 54 of the 195 participants (27.7%). Specifically, 42 participants reported one external intrusion, nine reported two, and three reported three intrusions. The F-test examining the effects of spacing condition on external intrusions was not significant, F(3, 105.13) = 0.59, p = .62.

Reports of what usually happens

Three one-way ANOVAs were conducted to examine the effects of spacing condition (massed, spaced-short, spaced-long) on the proportional accurate general, accurate specific, and external intrusion scores. None of the ANOVAs were significant, Fs < .59, ps > .55 (see for means).

Reflective questions

Likert ratings

We conducted two one-way ANOVAs to explore whether event condition (including the single event) affected participants’ perceived accuracy and difficulty ratings about reporting the target video. The ANOVAs were non-significant, Fs ≤ 2.03, ps ≥ .11, η2 s ≤ .03.

Next, we compared repeated-event participants’ perceived accuracy and difficulty for reports of the target episode versus reports of what usually happens. We conducted two 3 (condition: massed, spaced-short, spaced-long) x 2 (prompt rated: target episode, event script) mixed ANOVAs. See for group means. The ANOVA for perceived accuracy showed a significant between-subjects effect of condition, F(2, 157) = 3.12, p = .047, ηp2= .04. Bonferroni-corrected post-hoc comparisons showed no significant difference between conditions (ps > .05), but the difference closest to significance was the massed condition perceiving themselves more accurate than the spaced-long condition, p = .065. There was a significant within-subjects effect of prompt, F(1, 157) = 25.45, p < .001, ηp2= .14. Pairwise comparisons with Bonferroni corrections showed that participants perceived themselves to be more accurate reporting on what usually happens than reporting on the target video, p < .001. The interaction was not significant, F(2, 157) = 0.11, p = .89, ηp2= .001.

Table 4. Likert ratings of perceived accuracy and question difficulty.

The ANOVA for perceived difficulty had no significant between-subjects effect of condition, F(2, 157) = 2.95, p = .06, ηp2= .04. However, there was a within-subjects effect of prompt, F(1, 157) = 29.31, p < .001, ηp2= .16. Pairwise comparisons with Bonferroni corrections showed that participants rated the target episode prompt more difficult to answer than what usually happens, p < .001. The interaction was not significant, F(2, 157) = 0.43, p = .65, ηp2= .005.

Qualitative question

Of the 142 participants who provided a response about what would help them to remember the videos better, most commented on memory support (n = 66) or video changes (n = 53); fewer commented on the spacing interval (n = 14) or delay to recalling the video (n = 9). Strategies for memory support included ideas such as re-watching the videos, taking notes while watching them, or being warned before watching them that there would be a memory test. Video change responses suggested adding subtitles, adding numbers or labels to videos or characters, or adding more unique details to each video. All spacing responses indicated that an interval between each video was perceived to be detrimental to memory recall of the target video (i.e., a shorter interval would make recall easier); only participants in the two spaced event conditions made these responses (spaced-short n = 5, spaced-long n = 9). All delay responses indicated that reducing the delay would improve recall.

A 4 (event condition) x 4 (response) contingency table explored response differences across event conditions. Due to small expected cell counts, a Fisher-Freeman-Halton exact test was used. Responses were significantly related to condition, p < .001. Spacing responses were made by the spaced-long condition more often than expected, and by the single and massed conditions less often. Compared to expected values, memory support responses were more often made by the single condition, video change responses were more often made by the massed condition, and delay responses were more often made by the spaced-short condition. shows response distributions.

Table 5. Distribution of qualitative question responses (n = 142).

Discussion

The current study examined whether the interval between episodes of a repeated event affected adults’ memory reports. Consistent with our first hypothesis, we found that repeated-event participants were less accurate in reporting on the target video than single-event participants. This finding is consistent with Price et al,’s (Citation2006) study of children as well as the repeated event literature more generally (see Dilevski et al., Citation2020; Woiwod et al., Citation2019 for reviews). When accuracy is defined narrowly (i.e., as the ability to attribute details to the correct episode; Price et al., Citation2006; Woiwod et al., Citation2019), single event witnesses tend to be more accurate than repeated event witnesses.

Like Price et al. (Citation2006), the current study also found no differences in accurate details reported between the repeated event conditions (i.e., across the massed and two spaced conditions). These findings contrast the spacing effect literature, which has shown that learning improves when the intervals between repetitions increase (Ebbinghaus, Citation1885; Smith & Scarf, Citation2017). It is possible that, because events are more complex and integrated than cognitive stimuli (Conway, Citation2005; Hudson et al., Citation1992), spacing does not further improve retrieval. Price and colleagues (Citation2006) found reduced spacing effects after a one-week delay (Experiment 2) compared to a one-day delay (Experiment 1). Memory for experienced events becomes more general over time (Brainerd & Reyna, Citation2004; Sekeres et al., Citation2016). Participants in the current study experienced a one-week delay before providing their memory reports, which may have increased their reliance on general memory and reduced any effects of episode spacing. Consistent with this explanation, we also found no effect of spacing on repeated-event participants’ reports about “what usually happens”, which taps into their scripts of the general event structure and temporal order. It is possible that a shorter delay (less than one week) might have resulted in more effects of spacing condition in the current study.

Contrary to our second and third hypotheses, we did not find any evidence that spacing affected rates of memory errors when reporting on a target episode: both internal and external intrusions were comparable across spacing conditions. While Price et al. (Citation2006) found an effect of episode spacing on internal intrusions, this study sampled children, who are still developing their source-monitoring abilities so are more prone to internal intrusion errors than adults (Lindsay et al., Citation1991; Ryan, Citation2009). Indeed, our adult participants reported very few internal intrusions regardless of spacing condition. Consequently, a floor effect may explain why event spacing did not influence internal intrusion reports.

External intrusions were the most common type of error our sample made about the target episode (though reported errors were still quite low). Of note, all external intrusions about the target episode were consistent with what occurred in the target video. Reporting non-presented details that are consistent with presented content has commonly been found in memory studies (e.g., reporting seeing details beyond the boundaries of a presented photograph in Munger & Multhaup, Citation2016; reporting seeing unpresented segments of a video showing someone making a sandwich in Gerrie et al., Citation2006). These errors occur when participants generate a non-presented detail internally (such as imagining it or drawing on their prior knowledge) and erroneously attribute it to the observed stimuli. Accurately monitoring what details were witnessed versus generated internally becomes more difficult when there are delays after an event (Sussman, Citation2001). Reporting on the target video after a one-week delay might have influenced our participants’ commission of external intrusions more so than spacing interval. Even with immediate recall, the spacing effect has previously been shown to be small for the commission of errors that are consistent with the theme of presented stimuli (relative to its size for reporting accurate details, Dubuisson et al., Citation2012). It is also possible that external intrusions may have been too low in the current study to detect a spacing effect, especially if the spacing effect is small for stimuli-consistent errors.

Interestingly, while spacing had no overall effect on repeated event participants’ memory reports, it influenced their perceived accuracy. That is, there was a significant effect of spacing condition on participants’ perceived accuracy ratings. While the post-hoc tests were nonsignificant, the effect seemed to be driven by the massed condition rating themselves as more accurate than the spaced-long condition. Previous studies have also shown that learners perceive themselves as better at performing skills learned after one massed learning session, rather than spacing learning over several sessions (Dunning et al., Citation2004; Simon & Bjork, Citation2001). Similarly, 14 participants in the spaced-short and spaced-long conditions believed that reducing spacing (e.g., presenting all videos in a massed session) would improve their memory performance. Dunning et al. (Citation2004) asserted that adults often mistake learning a skill quickly for competence performing it, leading to increased beliefs of performance in massed conditions (without necessarily demonstrating increased performance).

Repeated-event participants further judged themselves to be more accurate when reporting what usually happened than when reporting the target video. This judgment is consistent with script theory, which states that an event script supports retrieval of the typically occurring details from a repeated event (Hudson et al., Citation1992; Schank & Abelson, Citation1977). Previously, Danby et al. (Citation2022) surveyed laypeople about their understanding of event memory, finding that lay adults expected the general or consistent details from a repeated event to be better remembered than changing ones. Similarly, our results for repeated event participants suggest that they felt they performed more accurately when recalling the general details rather than details unique to the target episode. The single event condition’s perceived accuracy could only be considered for reports of the target video (not reports of what usually happened); their perceived accuracy did not significantly differ from the repeated event conditions. In Danby et al.’s survey of lay adults, participants expected repeated events to be recalled worse than single events. However, people generally have inflated expectations for their own performance on cognitive tasks (Kruger & Dunning, Citation1998; Metcalfe, Citation1998), so we may have not detected a difference between single and repeated event conditions’ perceived accuracy because our participants were rating their own memory (rather than rating memory for a fictional a vignette in Danby et al., Citation2022). The ability to accurately judge ones’ own memory performance has important implications when it comes to how eyewitnesses are perceived. Research has shown that confidence-accuracy calibration impacts credibility assessment made by legal decision makers, with witnesses with better calibration being judged more credible (Tenney et al., Citation2007).

This study has limitations. First, we collected memory reports online using free text boxes. This meant that we could not probe participants with follow-up questions, so some responses were very short and had to be excluded from the analyses. It is also possible that the high accuracy scores in the current study reflect participants’ provision of the details that they were most confident about, and there was no subsequent prompting for more detail. Replication of our study with face-to-face interviews that include follow-up prompts is needed to address this limitation. Further, the videos were quite different from each other, even though they all depicted workplace bullying between the same employees. For example, they differed in the locations in which the events took place, and the type of bullying employed (verbal, physical). While ecologically valid, these differences may have made the episodes less likely to be confused with one another and thus may have resulted in lower rates of internal intrusions across all conditions. Future research could utilise videos that are highly structured and similar to one another to test this notion. Last, participant assignment was non-random so that recruitment advertising and payment could accurately reflect participant requirements for each condition. It is possible that participants willing to commit to longer conditions may have increased motivation compared to shorter conditions. Future research would benefit from random assignment to conditions.

Despite these limitations, this study makes an important contribution to the literature. The findings suggest that the spacing effect may not be evident for adult recollections of a repeated event when tested after a one-week delay, at least in response to a free recall question and when episodes are very different from each other. This has important methodological implications because it suggests that stimuli for repeated event studies could potentially be shown to participants in one session, rather than across the course of multiple weeks, without any effect on memory. Consequently, research examining repeated event memory may be logistically easier to conduct in the future. However, given this is the first time spacing has been manipulated within an adult sample, replication with a range of events (such as live events rather than videos) and retention intervals (such as delays longer or shorter than one week) is needed. Overall, our findings provide a starting point for future investigations of the effect of spacing of experienced events on adults’ memory performance.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 Video order was not counterbalanced in the current study since Dilevski and colleagues (Citationin press) found no effect of counterbalance order on adult participants’ memory reports.

2 We further examined the lack of statistically significant differences in participants’ memory reports across spacing conditions using Bayesian one-way ANOVAs conducted in JASP (version 0.16.2). Bayes factors were examined and compared to conventions suggested by Lee and Wagenmakers (Citation2013) and Jeffreys (Citation1961) for interpretation. For reports of the target video, we found anecdotal evidence for the null hypothesis of no difference across the repeated event conditions’ accuracy scores (BF10 = 0.59) and strong evidence for the null hypothesis across all conditions’ external intrusion scores (BF10 = .05). For reports of what usually happens, we found strong evidence for the null hypothesis of no difference across all conditions for accurate-general and accurate-specific scores (BF10  = .09 and BF10  = .08 respectively), and moderate evidence for the null hypothesis on external intrusion scores (BF10 = .11).

3 This participant was in a repeated event condition and clearly identified that they were reporting on the last video only but reported happenings from all four videos in depth. Accordingly, details that were from the last video were coded as accurate, but details from the other three videos were coded as internal intrusions, leading to a large internal intrusion score. Analyses were run including and excluding this participant, and the pattern of results was the same either way. Accordingly, we present results including this participant.

References

  • Blanca, M. J., Alarcon, R., Arnau, J., Bono, R., & Bendayan, R. (2017). Non-normal data: Is ANOVA still a valid option? Psicothema, 29(4), 552–557. https://doi.org/10.7334/psicothema2016.383
  • Boltz, M. (1992). Temporal accent structure and the remembering of filmed narratives. Journal of Experimental Psychology: Human Perception and Performance, 18(1), 90. https://doi.org/10.1037/0096-1523.18.1.90
  • Brainerd, C. J., & Reyna, V. F. (2004). Fuzzy-trace theory and memory development. Developmental Review, 24(4), 396–439. https://doi.org/10.1016/j.dr.2004.08.005
  • Brainerd, C. J., Reyna, V. F., & Ceci, S. J. (2008). Developmental reversals in false memory: A review of data and theory. Psychological Bulletin, 134(3), 343. https://doi.org/10.1037/0033-2909.134.3.343
  • Cepeda, N. J., Pashler, H., Vul, E., Wixted, J. T., & Rohrer, D. (2006). Distributed practice in verbal recall tasks: A review and quantitative synthesis. Psychological Bulletin, 132(3), 354–380. https://doi.org/10.1037/0033-2909.132.3.354
  • Conover, W. J. (1980). Practical nonparametric statistics (2nd ed.). John Willey & Sons.
  • Conway, M. A. (2005). Memory and the self. Journal of Memory and Language, 53(4), 594–628. https://doi.org/10.1016/j.jml.2005.08.005
  • Crowder, R. G. (1976). Principles of learning and memory. Erlbaum.
  • Danby, M., Sharman, S., & Claringbold, G. (2022). The effects of recalling generic versus episodic information first on adults’ reports of a repeated event. Applied Cognitive Psychology, 36(2), 460–467. https://doi.org/10.1002/acp.3924
  • Deck, S., & Paterson, H. (2020). Adults also have difficulty recalling one instance of a repeated event. Applied Cognitive Psychology, 35(1), 286–292. https://doi.org/10.1002/acp.3736
  • Delacre, M., Leys, C., Mora, Y. L., & Lakens, D. (2019). Taking parametric assumptions seriously: Arguments for the use of Welch’s F-test instead of the classical F-test in one-way ANOVA. International Review of Social Psychology, 32(1), 1–12. https://doi.org/10.5334/irsp.198
  • Delaney, P. F., Godbole, N. R., Holden, L. R., & Chang, Y. (2017). Working memory capacity and the spacing effect in cued recall. Memory, 26(6), 784–797. https://doi.org/10.1080/09658211.2017.1408841
  • Dellarosa, D., & Bourne, L. E. (1985). Surface form and the spacing effect. Memory & Cognition, 13(6), 529–537. https://doi.org/10.3758/bf03198324
  • Dilevski, N., Cullen, H. J., Marsh, A., Paterson, H. M., & van Golde, C. (in press). ‘Tell me what just happened’: The effect of immediate recall on adult memory for instances of a repeated event. Accepted for Publication in Psychology, Crime & Law. https://doi.org/10.1080/1068316X.2022.2117808
  • Dilevski, N., Paterson, H., Walker, S. A., & van Golde, C. (2020). Adult memory for specific instances of a repeated event: A preliminary review. Psychiatry, Psychology and Law. https://doi.org/10.1080/13218719.2020.1837031
  • Dubuisson, J. B., Fiori, N., & Nicolas, S. (2012). Repetition and spacing effects on true and false recognition in the DRM paradigm. Scandinavian Journal of Psychology, 53(5), 382–389. https://doi.org/10.1111/j.1467-9450.2012.00963.x
  • Dunning, D., Heath, C., & Suls, J. M. (2004). Flawed self-assessment: Implications for health, education, and the workplace. Psychological Science in the Public Interest, 5(3), 69–106. https://doi.org/10.1111/j.1529-1006.2004.00018.x
  • Ebbinghaus, H. (1885). Uber das Gedachtnis. Dover.
  • Fair Work Act. (2009). (Cth) s. 481.1 (Austl.).
  • Flores, S., Bailey, H. R., Eisenberg, M. L., & Zacks, J. M. (2017). Event segmentation improves event memory up to one month later. Journal of Experimental Psychology: Learning, Memory, and Cognition, 43(8), 1183–1202. https://doi.org/10.1037/xlm0000367
  • Gerrie, M. P., Belcher, L. E., & Garry, M. (2006). ‘Mind the Gap': False memories for missing aspects of an event. Applied Cognitive Psychology, 20(5), 689–696. https://doi.org/10.1002/acp.1221
  • Gross, J., & Hayne, H. (1998). Drawing facilitates children's verbal reports of emotionally laden events. Journal of Experimental Psychology: Applied, 4(2), 163. https://doi.org/10.1037/1076-898x.4.2.163
  • Hudson, J. A., & Mayhew, E. M. Y. (2009). The development of memory for recurring events. In M. L. Courage, & N. Cowan (Eds.), The development of memory in infancy and childhood (pp. 69–91). Psychology Press.
  • Hudson, J., Fivush, R., & Kuebli, J. (1992). Scripts and episodes: The development of event memory. Applied Cognitive Psychology, 6(6), 483–505. https://doi.org/10.1002/acp.2350060604
  • Jeffreys, H. (1961). The theory of probability. Oxford University Press.
  • Koval, N. G. (2019). Testing the deficient processing account of the spacing effect in second language vocabulary learning: Evidence from eye tracking. Applied Psycholinguistics, 40(05), 1103–1139. https://doi.org/10.1017/s0142716419000158
  • Kruger, J., & Dunning, D. (1998). Unskilled and unaware of it: How difficulties in recognizing one's own incompetence lead to inflated self-assessments. Journal of Personality and Social Psychology, 77(6), 1121–1134. https://doi.org/10.1037/0022-3514.77.6.1121
  • Lee, J. K., Wendelken, C., Bunge, S. A., & Ghetti, S. (2015). A time and place for everything: Developmental differences in the building blocks of episodic memory. Child Development, 87(1), 194–210. https://doi.org/10.1111/cdev.12447
  • Lee, M. D., & Wagenmakers, E.-J. (2013). Bayesian modeling for cognitive science: A practical course. Cambridge University Press. https://doi.org/10.1017/CBO9781139087759
  • Lindsay, D. S., Johnson, M. K., & Kwon, P. J. (1991). Developmental changes in memory source monitoring. Journal of Experimental Child Psychology, 52(3), 297–318. https://doi.org/10.1016/0022-0965(91)90065-Z
  • MacLean, C. L., Coburn, P. I., Chong, K., & Connolly, D. L. (2018). Breaking script: Deviations and postevent information in adult memory for a repeated event. Applied Cognitive Psychology, 32(4), 474–486. https://doi.org/10.1002/acp.3421
  • Mammarella, N., & Fairfield, B. (2008). Source monitoring: The importance of feature binding at encoding. European Journal of Cognitive Psychology, 20(1), 91–122. https://doi.org/10.1080/09541440601112522
  • Melton, A. W. (1970). The situation with respect to the spacing of repetitions and memory. Journal of Verbal Learning and Verbal Behavior, 9(5), 596–606. https://doi.org/10.1016/s0022-5371(70)80107-4
  • Metcalfe, J. (1998). Cognitive optimism: Self-Deception or memory-based processing heuristics? Personality and Social Psychology Review, 2(2), 100–110. https://doi.org/10.1207/s15327957pspr0202_3
  • Munger, M., & Multhaup, K. (2016). No imagination effect on boundary extension. Memory & Cognition, 44(1), 73–88. https://doi.org/10.3758/s13421-015-0541-3
  • Myles-Worsley, M., Cromer, C. C., & Dodd, D. H. (1986). Children’s preschool script reconstruction: Reliance on general knowledge as memory fades. Developmental Psychology, 22(1), 22–30. https://doi.org/10.1037/0012-1649.22.1.22
  • Odegard, T. N., Cooper, C. N., Lampinen, J. M., Reyna, V. F., & Brainerd, C. J. (2009). Children’s eyewitness memory for multiple real-life events. Child Development, 80(6), 1977–1890. https://doi.org/10.1111/j.1467-8624.2009.01373.x
  • PPP v. R. (2010). VSCA 110.
  • Patterson, T., & Pipe, M. E. M. (2009). Exploratory assessments of child abuse: Children's responses to interviewer's questions across multiple interview sessions. Child Abuse & Neglect, 33(8), 490–504. https://doi.org/10.1016/j.chiabu.2008.12.012
  • Price, H. L., Connolly, D. A., & Gordon, H. M. (2006). Children's memory for complex autobiographical events: Does spacing of repeated instances matter? Memory (Hove, England), 14(8), 977–989. https://doi.org/10.1080/09658210601009005
  • Risan, P., Milne, R., & Bider, P.-E. (2020). Trauma narratives: Recommendations for investigative interviewing. Psychiatry, Psychology & Law, 27(4), 678–694. https://doi.org/10.1080/13218719.2020.1742237
  • R v. Baker. (2002). QCA 174.
  • Ryan, R. G. (2009). Age differences in personality: Adolescents and young adults. Personality and Individual Differences, 47(4), 331–335. https://doi.org/10.1016/j.paid.2009.03.023
  • Schank, R. C., & Abelson, R. P. (1977). Scripts, plans, goals, and understanding: An inquiry into human knowledge structures. Erlbaum.
  • Schmider, E., Ziegler, M., Danay, E., Beyer, L., & Bühner, M. (2010). Is It really robust? Reinvestigating the robustness of ANOVA against violations of the normal distribution assumption. Methodology, 6(4), 147–151. https://doi.org/10.1027/1614-2241/a000016
  • Schneider, W. (2015). Memory development from early childhood through emerging adulthood. Springer International Publishing.
  • Schwan, S., Garsoffky, B., & Hesse, F. W. (2000). Do film cuts facilitate the perceptual and cognitive organization of activitiy sequences? Memory & Cognition, 28(2), 214–223. https://doi.org/10.3758/BF03213801
  • Sekeres, M. J., Bonasia, K., St-Laurent, M., Pishdadian, S., Winocur, G., Grady, C., & Moscovitch, M. (2016). Recovering and preventing loss of detailed memory: Differential rates of forgetting for detail types in episodic memory. Learning & Memory, 23(2), 72–82. https://doi.org/10.1101/lm.039057.115
  • Sharman, S. J., Danby, M. C., & Christopoulos, L. (2022). Mental context reinstatement improves adults’ reports of additional details from two instances of a repeated event. Memory (Hove, England), 30(8), 988–999. https://doi.org/10.1080/09658211.2022.2068610
  • Simon, D. A., & Bjork, R. A. (2001). Metacognition in motor learning. Journal of Experimental Psychology. Learning, Memory, and Cognition, 27(4), 907–912. https://doi.org/10.1037/0278-7393.27.4.907
  • Smith, C. D., & Scarf, D. (2017). Spacing repetitions over long timescales: A review and a reconsolidation explanation. Frontiers in Psychology, 8. https://doi.org/10.3389/fpsyg.2017.00962
  • Sussman, A. L. (2001). Reality monitoring of performed and imagined interactive events: Developmental and contextual effects. Journal of Experimental Child Psychology, 79(2), 115–138. https://doi.org/10.1006/jecp.2000.2575
  • Tenney, E. R., MacCoun, R. J., Spellman, B. A., & Hastie, R. (2007). Calibration trumps confidence as a basis for witness credibility. Psychological Science, 18(1), 46–50. https://doi.org/10.1111/j.1467-9280.2007.01847.x
  • Woiwod, D. M., Fitzgerald, R. J., Sheahan, C. L., Price, H. L., & Connolly, D. A. (2019). A meta-analysis of differences in children’s reports of single and repeated events. Law and Human Behavior, 43(1), 99–116. https://doi.org/10.1037/lhb0000312
  • Zacks, J. M., Speer, N. K., Swallow, K. M., Braver, T. S., & Reynolds, J. R. (2007). Event perception: A mind / brain perspective. Psychological Bulletin, 133(2), 273–293. https://doi.org/10.1037/0033-2909.133.2.273