161
Views
0
CrossRef citations to date
0
Altmetric
Articles

Grade six students’ multidigit arithmetic strategy adaptivity and flexibility: evaluating a novel tri-phase assessment tool

ORCID Icon & ORCID Icon
Received 14 May 2022, Published online: 26 Mar 2024

Abstract

In this paper we present a novel adaptation of a tri-phase assessment tool, originally devised to investigate students’ linear equations-related strategy flexibility, to evaluate Danish grade-six students’ multidigit arithmetic-related strategy adaptivity and flexibility. Participants, 731 students, median age 12 years and drawn from 20 demographically different schools, completed a series of positive integer addition, subtraction and multiplication tasks designed to elicit shortcut strategies. First, students solved each task by means of their preferred strategy, with those using a shortcut strategy being defined as showing adaptivity for that task. Second, students solved the same tasks by means of whatever alternative strategies they could devise; those that offered at least two strategies were defined as showing flexibility for that task. Third, for each task, students were asked to indicate which of their strategies they believed was optimal. Statistical analyses, based on the total number of tasks on which students demonstrated adaptivity or flexibility, found students to be more flexible than adaptive, with 55% of students showing moderate or high levels of flexibility but only 21% for adaptivity. Both strategy adaptivity and flexibility, which differed according to operation, were implicated in students’ solution accuracy, with adaptivity being a stronger predictor than flexibility.

1. Introduction

Researchers have increasingly come to view the solving of mathematics tasks efficiently, creatively, flexibly and adaptively as a core goal of mathematics education (Verschaffel et al., Citation2009). Indeed, since the mathematics reforms of the 1990s, flexibility and adaptivity in strategy choice have become accepted elements of mathematical proficiency (Baroody, Citation2003; Hickendorff, Citation2018; Xu et al., Citation2017) and, despite a lack of evidence supporting the feasibility of such goals for all students (Torbeyns et al., Citation2009a), an important developmental goal for mathematics learning (Baroody & Dowker, Citation2003; Hatano & Oura, Citation2003; McMullen et al., Citation2016; Torbeyns & Verschaffel, Citation2016). Moreover, evidence is increasingly showing that arithmetic-related strategy adaptivity has long-term developmental implications for many aspects of children’s learning of mathematics (Sunde et al., Citation2023).

1.1. Defining adaptivity and flexibility

A precursor to such work can be found in early research on the development and manifestation of adaptive expertise in the context of mathematics learning, both generally (De Corte, Citation2007; Hatano, Citation1988; Verschaffel et al., Citation2009) and in respect of arithmetic (Hickendorff, Citation2018; McMullen et al., Citation2016, Citation2020, Citation2022; Nemeth et al., Citation2019; Torbeyns et al., Citation2009a). However, over the last few years, research into the development of students’ adaptive expertise (McMullen et al., Citation2016) has been complemented by an emphasis on flexibility. Indeed, (Nemeth et al., Citation2019) write of a ‘consensus among mathematics researchers and educators that the abilities to use various strategies for solving a problem (flexibility) as well as to use efficient strategies (adaptivity) are important mathematical competencies students should gain’.

However, the distinction between the two constructs has not always been transparent (Heinze et al., Citation2009), for example, some scholars seem to construe adaptivity as subsuming flexibility. For example, McMullen and his colleagues have suggested that the flexible application of number-related knowledge is an important component of arithmetical adaptivity (McMullen et al., Citation2016, Citation2020). Others, see also Baroody (Citation2003) see them as synonyms, as with Hickendorff’s (Citation2018, p. 578) somewhat expedient position that in her ‘paper, flexibility and adaptivity are used interchangeably as selecting the optimal strategy for a given problem in a given setting for a given person’. The more widely accepted view, as argued by Blöte et al. (Citation2000) and Star and Rittle-Johnson (Citation2008), sees adaptivity as subsumed by flexibility, as with Schneider et al.’s (Citation2011, p. 1527) argument that one ‘source of competence is procedural flexibility, where learners know multiple procedures and apply them adaptively to a range of situations’ and Rathgeb-Schnierer and Green’s (Citation2017, p. 3) position that ‘flexibility in mental calculation includes two central features: the knowledge of different solution methods and the ability to adapt them appropriately’.

Clarification of this latter view can be found in Verschaffel et al.’s (Citation2009) well-known review paper. After initially treating flexibility and adaptivity as synonymous, they concluded that 'the term “flexibility” is primarily used to refer to switching (smoothly) between different strategies, whereas the term “adaptivity” puts more emphasis on selecting the most appropriate strategy' (Verschaffel et al., Citation2009, p. 337, original emphasis). Others, as do we in this paper, have accepted this distinction (Heinze et al., Citation2009), although, as evidenced in Xu et al. (Citation2017), the vocabulary may change. The latter refer to potential flexibility and practical flexibility in ways that indicate a strong resonance with flexibility and adaptivity respectively. Importantly, further clarifying Verschaffel et al.’s (Citation2009) conclusions, although possessing a variety of strategies and being able to switch smoothly between these strategies can be considered an essential stepping-stone towards adaptivity, merely applying different solution strategies on a series of similar mathematical items or problems can hardly be considered as evidence of adaptivity (Verschaffel et al., Citation2009, p. 339).

Importantly, two key inferences can be drawn from Verschaffel et al.’s (Citation2009) distinction. First, flexibility can be seen as a precondition for adaptivity (Verschaffel et al., Citation2007) and, second, ‘whether a strategy is adaptive or not, does not depend on speed and accuracy’ (Nemeth et al., Citation2019, p. 2). Finally, to illustrate our construal of the distinction between flexible and adaptive strategy use, we draw on an example from Threlfall (Citation2002, p. 31), who writes that the adding of one number to another first by adding the tens and then the ones separately is more suited to 55 + 24 than to 33 + 29, whereas rounding a number before adding, and then compensating, is more suited to 33 + 29 than to 55 + 24.

Thus, a student who knows Threlfall’s two strategies can be construed as flexible. A student who chooses the more appropriate of the two strategies can be construed as adaptive. Importantly, there is some consensus with respect to the necessary precursors of strategy flexibility and adaptivity; a synthesis of the work of Hatano and Inagaki (Citation1986), Hickendorff (Citation2018), Hickendorff et al. (Citation2019), Threlfall (Citation2002) and Verschaffel et al. (Citation2009) indicates that evaluating strategy appropriateness requires students to

  • attend to the differing task characteristics, the implicated operations and the extent to which the numbers involved point the solver towards appropriate and efficient strategies;

  • draw on an appropriately developed conceptual knowledge, not least because the identification of appropriate strategies or the invention of new ones, say, is dependent on a depth understanding of the numbers involved and their relations to each other;

  • draw on an appropriate procedural knowledge;

  • be conscious of contextual matters such as whether speed is privileged over accuracy.

In this paper, we focus on the strategy selection and use of Danish grade six students with respect to multidigit addition, subtraction and multiplication. Our reasons for so doing are several, and some of these are discussed later, but here we offer four. First, Danish grade six students’ strategy selection and use have not yet been researched. Therefore, this study should not only address a significant gap but contribute to the international literature on the topic. Second, national assessments of Danish students’ mathematics-related competence have shown low levels of division-related competence. Therefore, a study of division-related strategy selection is unlikely to prove productive. Third, as Sunde et al. (Citation2020) observe, the Danish curriculum and assessment model forms the basis of PISA’s assessment framework. Therefore, investigating Danish students’ strategy use is important in relation to understanding Danish students’ PISA-related achievement. Fourth, as we outline below, much strategy-related research has exploited a choice/no choice research design, which, even in the choice condition, typically restrict the choices available. Therefore, this study draws on an approach to data collection that is not only novel in the context of multidigit arithmetic but also allows students, unconstrained by researcher-imposed strategies, to exploit their preferred strategies.

1.2. Strategy choice in the context of arithmetic

Our reading of the literature on children’s strategy selection and use in the context of arithmetic is that more attention has been paid to children in the first four years of elementary education than the latter. Frequently motivated by an interest in understanding the extent to which the curricular introduction of standard algorithms influences children’s strategy choice, much work has been undertaken in the European context, including Flemish studies of students in grades two, three and four (Torbeyns et al., Citation2009a), grade three (De Smedt et al., Citation2010; Torbeynset al., Citation2009b) and grade four (Torbeyns & Verschaffel, Citation2016). Elsewhere can be found French studies of students in grades three and five (Lemaire & Brun, Citation2017), German studies of students in grade two (Blöte et al., Citation2001), grade three (Heinze et al., Citation2018, Citation2009; Nemeth et al., Citation2019; Sievert et al., Citation2019) and grades three and four (Selter, Citation2001), Hungarian studies of students in grade four (Csíkos, Citation2016), Italian studies of students in grades three and five (Lucangeli et al., Citation2003) and Dutch studies of students in grade four (Hickendorff, Citation2020). Overall, these studies have indicated that cultural norms, including when and how standard algorithms are introduced, may influence greatly the strategies children later elect to use.

In similar vein, the limited number of studies involving older students, have found, for example, that Flemish grade six students flexibly apply indirect addition strategies to small difference subtraction tasks, even when the instructional setting emphasises direct subtraction (Peters et al., Citation2013; Torbeyns et al., Citation2018; Van Der Auwera et al., Citation2022). Elsewhere, when allowed to choose, Dutch grade six students privilege standard algorithms across all operations (Hickendorff, Citation2018), while French students in grades five and seven exploit various strategies dependent on the context of the problem and, in particular, whether the subtractions under scrutiny are presented vertically or horizontally (Lemaire & Callies, Citation2009). Other studies, of limited interest to this paper, have focused on the division strategies of Dutch grade six students (Fagginger Auer et al., Citation2016) or those of both Dutch and Flemish students in grades five and six (Hickendorff et al., Citation2018). Such studies have not only shown the influence on students’ strategy choices of standard algorithms but, importantly, highlighted how researcher decisions with respect to which strategies and task types are under scrutiny inevitably influence any conclusions. Importantly, by way of understanding more fully the relationship between adaptivity, flexibility and accuracy, McMullen et al. (Citation2016) found the adaptive number knowledge of grade six Finnish students to be strongly related to both their conceptual knowledge of arithmetic and their arithmetic fluency. They conclude that ‘being able to produce a large number of arithmetic sentences on these tasks is not merely about being able to quickly calculate simple arithmetic problems, but requires a better understanding of the relationship between numbers and arithmetic operations’ (p. 179).

1.3. Approaches to the measurement of strategy flexibility and adaptivity

Previous studies have used a variety of tools to measure students’ adaptivity and flexibility. From the perspective of adaptivity, this entails evaluating when a strategy is appropriate or efficient for the item being solved (Luwel et al., Citation2009; Nemeth et al., Citation2019). Many studies, as indicated above, have adopted a choice/no choice approach, which typically entails students being tested under two differing conditions. For each task in the choice condition, children are typically invited to choose between a range of strategies offered to them. In the no choice condition, they are instructed to use one or other of the same strategies (Torbeyns et al., Citation2004). The researcher-selected strategies are dependent on the nature of the study being undertaken and, importantly, the operations under scrutiny. Various studies (see Hickendorff, Citation2020; Lemaire & Callies, Citation2009; Si et al., Citation2016; Torbeyns et al., Citation2006; Torbeyns et al., Citation2009b; Torbeyns et al., Citation2018; Torbeyns & Verschaffel, Citation2013, Citation2016) have exploited this approach, arguing that by comparing students’ responses in the two conditions, typically involving evaluations of speed and accuracy, researchers are able to determine an individual’s adaptivity. However, despite the undoubted success of the choice/no choice approach, the very tight restrictions on choice imposed in both conditions necessarily mask novel approaches students may otherwise bring to the task. By way of contrast, a number of studies have attempted to elicit students’ preferred strategies, albeit in different ways and contexts. For example, Blöte et al. (Citation2001) invited German grade two students to solve two-digit addition and subtraction tasks in two different ways and record all the steps of each method, with flexible students being defined by their ability to provide two such methods. In the Finnish context, McMullen et al. (Citation2016) ‘asked students to come up with as many arithmetic calculation sentences as they could which used a set of numbers to equal a target number’ (p. 175). For example, the first of the four tasks had a target of 16, and the integers 2, 4, 8, 12 and 32 as potential operands. The advantage of such an approach, which can also be construed as a disadvantage, is that no particular operation is privileged. In similar vein, Torbeyns et al. (Citation2009a) examined second, third and fourth grade students’ two-digit addition and subtraction strategies by first inviting them to solve all given items as accurately and as quickly as possible by means of a preferred strategy, and then by inviting them to generate at least two different strategies for each item.

From the perspective of flexibility, studies with a clear focus on evaluating students’ preferred strategies rather than those imposed by researchers include the work of Rittle-Johnson and Star (Citation2007). Here, focused on students’ flexibility in the solving of linear equations, they adopt a three-part assessment, ‘designed to tap three components of flexibility—the abilities to generate, recognize, and evaluate multiple solution methods for the same problem’ (p. 564). The ‘generate’ element invited students to solve each of two given equations in two different ways. In the ‘recognise’ element, students were shown two equations and, in each case, invited to write down ‘all possible steps that could be done next’ (p. 565). The ‘evaluate’ element, also involving two equations, presented the first line of an unconventional or novel solution and invited students to comment on the strategy’s efficiency. This three-pronged approach leads us to the work of Xu et al. (Citation2017), which, as is explained below, will form the methodological underpinning of the study presented here. As with Rittle-Johnson and Star’s work, Xu et al. (Citation2017) have three foci: the strategies students choose to use when solving linear equations, whether students have access to strategies additional to their preferred, and importantly, how they interpret the efficacy of these different strategies. Their three-phase assessment, which they label a Tri-phase Flexibility Assessment model (hereafter TriFA), draws on elements of the studies discussed above, and incorporates the processes of generation, recognition and evaluation found in Rittle-Johnson and Star (Citation2007). During the first phase, students are invited to solve each assessment item with a strategy of their choice. In the second phase, students are invited to solve the same tasks but with as many different strategies as they can. Finally, during phase three, students are invited to identify the most efficacious of all the strategies they have offered. Overall, the total of number of strategies yielded by phases one and two offer an indication of student flexibility, while their responses to phases one and three offer an indication of student adaptivity.

2. The current study

In addition to those discussed above, the current study is motivated by at several additional factors. First, much of the research undertaken over the last 25 years has focused on both single and multidigit of addition and subtraction, with few studies examining strategy choice in relation to multiplication (Hickendorff et al., Citation2019). Second, most arithmetic-related studies have focused on younger students, with few considering the strategies students use at different ages. Third, and we reiterate this due to its methodological importance, one consequence of the choice/no choice research design is that studies rarely offer authentic opportunities for children to show their preferred strategies. Fourth, the sample sizes of most of the choice-no choice studies discussed above were relatively small and involved time-intensive procedures. For example, Van Der Auwera et al.’s (Citation2022) recent study involved only 50 students at each grade level and a data collection procedure that involved each student solving tasks in each of three conditions on three days separated by at least one day and never more than one week. Our goal, therefore, was to identify an approach that would facilitate the collection of, essentially, cohort-level data in time-efficient ways that would allow older students authentic opportunities to demonstrate their preferred arithmetic strategies in secure contexts. In light of these matters, the research presented in this paper, focused on Danish grade six students, is framed by the questions:

  • How is strategy adaptivity manifested in students’ approaches to multidigit arithmetic?

  • How is strategy flexibility manifested in students’ approaches to multidigit arithmetic?

  • How is student arithmetical accuracy associated with adaptivity and flexibility?

  • How appropriate is the TriFA model for addressing such questions?

2.1. Methods

As indicated above, many of the tools used in previous research on students’ arithmetic-related strategies, while appropriate for the contexts in which they were used, do not allow for the uncovering of students ‘first choice’ or ‘go to’ strategies. Consequently, partly in response to Xu et al.’s (Citation2017) invitation to adapt it for topic areas other than the linear equations for which it was developed, the TriFA approach was adopted. In addition to eliciting students’ preferred strategies, the TriFA is appropriate for whole-class settings, which means that large numbers of students from representative samples of schools can be assessed. Moreover, students will meet the test in familiar, comfortable and unthreatening contexts and, greatly simplifying procedural matters, all three phases can be completed within a single lesson.

In using a TriFA, we are conscious of earlier criticism of free-choice methods, which argue that such approaches are prone to bias, due to the nature of the tasks themselves and those who solve them. For example, Siegler and Lemaire (Citation1997, pp. 71–72) write that.

if a less accurate strategy is used mainly on easy problems and a more accurate strategy is used primarily on difficult ones, the more accurate strategy may produce lower percentages of correct answers (due to its being used on harder problems) and thus appear to be less accurate. Similarly, if good students tend to rely on Strategy A and less good students tend to rely on Strategy B, Strategy A may appear to yield superior performance even if it is no better than the alternative (for example, because the good students are more careful in executing it).

While we concede the general arguments against free-choice approaches, their validity tends to diminish if the criterion for speed is removed. That is, if one’s goals are focused solely on whether students have a range of strategies available to them, and can identify which of those strategies is the more effective then speed is of little relevance. Moreover, in this study our interest in accuracy is concerned less with particular strategies than its relationship to both adaptivity and flexibility in general.

2.2. Adapting the TriFA for arithmetic

Xu et al.’s (Citation2017) study draws on the notion of an innovative strategy, which they define as ‘the strategy for that problem that had the fewest steps and with the most simplified computations’ (p. 3). In our view, the adjective innovative implies that students’ strategies are novel, yet one cannot infer from students’ solutions whether such strategies are newly-invented or pre-existing. Consequently, drawing on Torbeyns et al. (Citation2009a) and Hickendorff (Citation2018), we refer to shortcut strategies, whereby the solution process is facilitated by appropriate modifications to a task’s numbers and/or operations in ways that both simplify the arithmetic and reduce the number of steps from those of the standard algorithm. In other words, this study is characterised by an emphasis on students attending to task features in order to make decisions about what strategies they employ.

Importantly, two equivalent tests of naked arithmetic tasks (Fuchs et al., Citation2010) were developed to ensure that any outcomes would not be the consequence of idiosyncratic responses to a particular item and that students sitting in adjacent seats would solve different tasks. Table  shows the tasks used to elicit students’ strategies for multidigit addition, subtraction and multiplication. Each task was designed to elicit the various strategies highlighted in the literature (Hickendorff, Citation2020; Sievert et al., Citation2019; Torbeyns et al., Citation2009a). With respect to addition, both sets included an item that could directly be solved by recognising a sum to one hundred (482 + 218 = 400 + 200 + (82 + 18) = 700). Both sets included a three-digit item to prompt the use of compensation as a shortcut strategy (199 + 323 = 200 + 323 − 1 = 522). Both sets included an item that can be addressed by a combination of compensation and a sum to one hundred (349 + 154 = 300 + 100 + 50 + 50 + 3 = 503). Finally, the hundreds digit for each three-digit addition item were chosen to ensure a three-digit sum.

Table 1. Multidigit items used across the different operations.

Both sets of subtraction tasks, adapted from (Hickendorff, Citation2020), included a task with 99 as a subtrahend to elicit compensation as a strategy for large difference numbers (963 − 499 = 964 − 500). Both sets, to prompt the use of indirect addition, included small difference tasks (504 − 476). Both sets included a large difference task to elicit a shortcut sequential strategy (704 − 17 = 704 − 10 − 7 = 687).

Finally, the three multiplication items shown in Table  were also designed to elicit different types of shortcut strategies. Each set included one item designed to elicit either compensation or partitioning as shortcut strategies (5 × 26 = 5 × 25 + 5 × 1 or 5 × 26 = 5 × 20 + 5 × 6). Each set included an item with 15 as either multiplicator or multiplicand with various opportunities for the use of different shortcut strategies (15 × 18 can be addressed by partitions like 10 × 18 + 5 × 18 or 10 × 15 + 8 × 15 or simplifications like 5 × 54 = 5 × 50 + 5 × 4). Each set included an item whose numbers fitted the model (a-b)2, designed to elicit compensation strategies (29 × 31 = 29 × 30 + 29 or 29 × 31 = 30 × 31 − 31).

2.3. Participants

Schools in five demographically different municipalities in Jutland, the central region of Denmark, were contacted to elicit their participation. This yielded 20 schools, both large and small, urban and rural, and 731 grade-six pupils, with a median age of 12 years. Prior to the granting of permission for their children’s participation, all parents received information two letters. The first was through their child’s schools’ communication platform and, the second was by means of an email that included legal information about GDPR. Of all families contacted, only one requested that their child not be included.

2.4. Implementing TriFA

Over a period of four weeks, members of the project team undertook the TriFA with each participating class. In each case, the complete assessment was completed in under an hour according to the timeline shown in Table , which was intended to act as a guide, although actual times varied slightly according to differences in class size and questions posed by students about the assessment and its implementation. Each phase was preceded by an oral introduction and instructions for its completion, including alerting students to the research focus of the assessment and, importantly, promising that it would have no impact on their grades.

Table 2. Recommended timeline for the TriFA implementation.

During the first phase, as indicators of their preferred strategies and adaptivity, students solved each task once, by whatever means they thought was appropriate. Students offering shortcut strategies rather than standard algorithms were construed as adaptive. During the second phase, as an indicator of strategy flexibility, students solved the same tasks again by whatever alternative strategies they were able to generate. In the third phase, as a further indication of strategy adaptivity, students evaluated the strategies generated in the first two phases to identify which they believed was optimal (Xu et al., Citation2017).

Procedurally, each student received a booklet comprising one page for each test item; Figure  shows the page for 482 + 218. Acknowledging that the instructions were written in Danish, it is probably important for the reader to know that in Scandinavian languages (Danish, Norwegian and Swedish) the verbs to count and to calculate are the same; in Danish, as shown in Figure , it is regne. Children typically learn from the context what the verb is inviting them to do and acquire at a young age an understanding that the verb represents a general invitation to find a solution. This is the case here and the manner of the tasks’ presentation would not rule out the possibility of a student actually counting on.

Figure 1. Exemplar solution sheet for phases 1, 2 and 3.

Figure 1. Exemplar solution sheet for phases 1, 2 and 3.

During phase 1, students wrote their solution strategy in the large box at the top of the page. Acknowledging each task’s design characteristics, and drawing on Hickendorff (Citation2018), students were asked to examine the numbers involved before deciding how to solve it. Where they used a mental strategy, they were asked to write as best they could the various stages of their mental reasoning. In addition, students received a second sheet, see Figure , on which they were asked to write the answer, but no working, to each task of phase 1. These answer sheets, which would be used later to evaluate students’ accuracy, were collected before phase 2 was introduced. During phase 2, students were invited to return to each of the tasks of phase 1 and, in the smaller boxes on each page, write any additional solutions they could. Finally, in phase 3, students were asked to tick, in the small circle, which of their solutions they believed was the best.

Figure 2. Exemplar answer sheet for phase 1.

Figure 2. Exemplar answer sheet for phase 1.

2.5. Coding

As indicated above, a shortcut strategy is construed as involving an appropriate modification of a task’s numbers and/or operations in ways that both simplify the arithmetic and reduce the number of steps from those of the standard algorithm. With respect to adaptivity, any phase one task solved by means of a shortcut strategy was recorded as having been solved adaptively. The rationale for this decision was based on two key characteristics of the project’s design. First, every task was designed to elicit shortcut strategies. Second, students were instructed at the start of phase one to pay attention to task characteristics before attempting their solution. Taken together, students who chose to use a standard algorithm for a particular task during phase one were deemed not to have demonstrated adaptivity on that task. With respect to flexibility, any task solved, over phases one and two, by two or more solutions was recorded as having been solved flexibly.

Two experts in the field independently undertook the development of a coding schedule for the various strategies used with the different items. These were then compared and contrasted to produce an agreed set of shortcut strategies for each item. In the case of uncertainty or disagreement, the opinion of a third expert was sought and a resolution achieved. Table , by way of exemplar, shows the set of agreed shortcut strategies for the item 482 + 218.

Table 3. Examples of shortcut strategies for 482 + 218, inspired by Sievert et al. (Citation2019).

Each student’s solutions were coded by the first author and at least one of several research assistants trained to code the TriFA. Students’ answers, as recorded on the answer sheet of Figure , were coded as correct, incorrect or incomplete. In addition, drawing on the solutions presented in their solution booklets, the total number of strategies and whether or not those strategies were shortcut were recorded. Inter-rater reliability, based on comparisons of 60 sets of grade 6 codes from each research assistant and the first author, yielded the following. First, with respect to the accuracy of students’ answers, the average Cohen Kappa for accuracy between the first author and the research assistants was 0.93, which is indicative of exceptionally high inter-rater reliability (Landis & Koch, Citation1977; McHugh, Citation2012). Any differences were typically unrelated to the correctness of a student’s answer but whether it was incorrect or incomplete. The Cohen Kappa for the number of strategies was, at 0.73, substantial, while that for determining whether students’ strategies were shortcut was, while moderate at 0.54, a little disappointing. Our view is that the differing levels of inter-coder reliability reflect the differing complexities of the three forms of coding. That being said, every disagreement, irrespective of its source, was checked by the first author for explanations attributable to misinterpretations of the definitions found in the coding guide. In the event of any uncertainty, the opinion of a third expert was sought and the disagreement resolved.

2.6. Analytical procedures

To facilitate analyses and following the approaches of Hästö et al. (Citation2019), where a version of the TriFA was used to measure students’ flexibility with linear equations, students were grouped according to pre-specified levels of adaptivity and flexibility. With respect to adaptivity, and in accordance with the project’s emphasis on task characteristics (Hickendorff, Citation2018, Citation2019; Threlfall, Citation2002; Verschaffel et al., Citation2009) students were grouped, as shown in Table , according to the number of phase-one tasks they solved by means of shortcut strategies. With respect to flexibility, and drawing on the results of phases one and two, similar groupings are shown in Table , based on the number of tasks solved by the individual student in two or more ways. For both groupings, accuracy, according to the number of items solved correctly, was calculated separately for each level of adaptivity and level of flexibility, separately. In other words, students were not grouped by their level of accuracy.

Finally, in this section, t-tests were conducted to determine the comparability of the different sets of items. The figures of Table  show, with respect to adaptivity, flexibility and accuracy, no statistically significant differences between the outcomes of the two sets, thus warranting our combining the data into a single set.

Table 4. Two-sample t tests conducted to ensure compatibility of the different item sets.

Table 5. Levels of adaptivity and corresponding accuracy.

3. Results

The results are presented in four parts. First, drawing on the results of phase 1, we examine students’ strategy adaptivity and the relationship between adaptivity and accuracy. Second, drawing on the data from phases 1 and 2, we examine students’ strategy flexibility and the relationship between flexibility and accuracy. Third we examine the relationship between adaptivity and flexibility. Fourth, we examine differences in students’ adaptivity and flexibility, across the three operations and phases of assessment. Throughout, we have worked to a statistical threshold of p < .001 for determining the significance of any results.

3.1. Adaptivity and accuracy

The figures of Table  show, with respect to phase one, the distribution of students’ adaptivity levels. As can be seen, very high numbers of students showed little or no adaptivity, while very few students demonstrated high levels of adaptivity. The figures also indicate generally high levels of accuracy and, unsurprisingly, a strong tendency for accuracy to increase with adaptivity.

To investigate more fully the connections between adaptivity and accuracy, students’ levels of accuracy were determined according to the protocol shown in Table . Following this, the figures of Table  show the distribution of students’ accuracy across the different adaptivity levels, with a chi square test (LR χ2 = 53.75, p < .0001) confirming that high levels of adaptivity resonate with high levels of accuracy.

Table 6. Protocol for grouping students by the number of correctly solved items.

Table 7. Accuracy levels as a proportion of all students at each level of adaptivity.

A one-way ANOVA (F(3,727)=17.89,p<.0001), warranted by evidence that with large samples parametric procedures are preferable to non-parametric even when data are skewed (Blanca et al., Citation2017; Fagerland, Citation2012; Vickers, Citation2005), confirmed a strong positive relationship between adaptivity and accuracy. That is, students with higher levels of adaptivity are significantly more accurate than students with low or no adaptivity. Moreover, the results of a Tukey’s HSD Post hoc test for multiple comparisons (Table ) show that students’ mean accuracy increased from all lower to higher performing adaptivity group, and that these increases were significant between None and all higher levels of adaptivity and for low and high levels of adaptivity. In this respect, it can be seen, for example, that students with high levels of adaptivity score, on average, 2.45 points more in accuracy (max score = 9).

Table 8. Tukey Post hoc tests showing differences in mean accuracy score of the different adaptivity levels alongside tests of significance.

Table 9. Frequency (percentage) of students in each level of flexibility.

3.2. Flexibility and accuracy

With respect to flexibility, as shown in Table , students were grouped according to the number of items they solved by means of more than one strategy, whether shortcut or standard algorithms. The figures indicate not only that more than half of all students (56%) demonstrate moderate or high levels of flexibility but that these figures are considerably higher than the corresponding figures for adaptivity shown in Table . That said, as with adaptivity, the general tendency is for accuracy to increase with flexibility. The figures of Table  show, confirmed by a one-way ANOVA (F(3,727) = 32.49, p < .0001), that increases in flexibility typically correspond to increases in accuracy. The exception lies with those students with no flexibility, whose accuracy is broadly equally distributed across the three higher levels of accuracy. In addition, the results of a Tukey’s HSD Post hoc test for multiple comparisons (Table ) show that students’ mean accuracy increased across all possible pairings, from a lower to a higher-performing flexibility group, and that these increases were significant across all dyads with the exception of that between medium and high levels of flexibility, which marginally failed to register as significant. Indeed, the figures show that high flexibility student scored, on average, 2.19 points (on a 0–9 scale) higher in accuracy than the no flexibility students.

Table 10. Accuracy levels as a proportion of all students at each level of flexibility.

Table 11. Tukey Post hoc tests showing differences in mean accuracy score of the different flexibility levels alongside tests of significance.

3.3. The relationship between adaptivity and flexibility

While the data presented thus far show that both strategy adaptivity and strategy flexibility are implicated in student accuracy, it is important to consider which, if any, is the more influential. The figures of Table  show, for each adaptivity level, the percentage of students at each flexibility level. A chi square test (χ2 = 64.57, p < .0001) confirms that such a distribution is unlikely to be due to chance. While there are some indications that high/low levels of adaptivity may equate to high/low levels of flexibility respectively, the overall picture is more complex. Across the two rows corresponding to low and medium adaptivity, the flexibility distributions are, to all practical purposes, identical. Also, regardless of their adaptivity levels, the proportion of students evaluated as having medium flexibility is higher than chance would have indicated. However, a one-way ANOVA indicates statistically significant different levels of flexibility across the adaptivity levels F=(3,727)=17.30,p<.0001. Indeed, the figures of Table  show, for each level of adaptivity, positive increases in flexibility (score 0–9). With one exception, the difference in flexibility from low to medium adaptivity, this positive difference was statistically significant. Overall, the mean flexibility score across all levels of adaptivity was 3.19 (SD = 2.79), highlighting the significance of the 3.39 flexibility difference between no adaptivity and high adaptivity students. In sum, confirming flexibility as a precursor to adaptivity, the figures indicate that higher levels of adaptivity appear related to higher levels of flexibility.

Table 12. Flexibility as a percentage of the students at each adaptivity level.

Table 13. Tukey Post hoc test showing differences in Flexibility (score 0–9) for different levels of adaptivity.

By way of contrast, the figures of Table  show, for each flexibility level, the percentage of students at each level of adaptivity. A chi square test (χ2 = 30.26, p = .0004) confirms that the distribution is unlikely to be due to chance. For each level of flexibility, high proportions of students demonstrate low or no adaptivity. Indeed, two-thirds of high flexibility students were shown to have no or low adaptivity. Moreover, the proportions of students showing high levels of adaptivity, irrespective of their flexibility level, seem remarkably low. In other words, the figures of Table  suggest that flexibility has a limited predictive power with respect to adaptivity. That being said, a one-way ANOVA shows significant differences for adaptivity (0–9) across the different levels of flexibility(F(3,727)=14.21,p<.0001). Indeed, the figures of Table  show, for each flexibility dyad a positive increase in adaptivity, which, in four cases was statistically significant. Overall, the mean adaptivity score across all levels of flexibility was just 1.39 (on a 0–9 scale), highlighting the significance of the 1.48 adaptivity difference between no flexibility and high flexibility students. In sum, the data suggest that access to multiple strategies is no guarantee that grade six students will either use shortcut strategies as a first choice or recognise them as optimal.

Table 14. Adaptivity as a percentage of the students at each level of flexibility.

Table 15. Post hoc Tukey test showing differences in adaptivity (score 0–9) for different levels of flexibility.

To investigate further the relationship between students’ flexibility and adaptivity, we examined those items that students identified as being solved optimally against when students first used a shortcut strategy. The figures of Table  show that when students’ first use of shortcut strategies was during phase 1, then 76% of optimal solutions identified in phase 3 were shortcut. By way of contrast, when students’ first use of shortcut strategies was during phase 2, then only 25% of optimal solutions identified in phase 3 were shortcut. Put another way, adaptive students, defined as those who introduce shortcut strategies in phase 1, are three times more likely to identify shortcut strategies as optimal than students who introduce shortcut strategies in phase 2.

Table 16. Percentage of students who, during phase 3, identified a shortcut strategy as the optimum in relation to when they first introduced a shortcut strategy.

3.4. Adaptivity, flexibility and the different operations

Figure  shows, for each operation, the percentage of phase 1 items solved by shortcut strategies (light grey), the percentage of phase 2 items solved only with shortcut strategies (dark grey) and the percentage of items solved with shortcut strategies in both phases 1 & 2 (mid-grey). It can be seen that while the use of solely shortcut strategies in both phases 1 and 2 is, in relative terms, rare, the proportion of phase-one items solved by shortcut strategies increases across the operations. Overall, confirming the generally low levels of adaptivity, the figures indicate a tendency for most students to invoke standard algorithms at some point.

Figure 3. Percentage of tasks solved uniquely by shortcut strategies at each phase of the study

Figure 3. Percentage of tasks solved uniquely by shortcut strategies at each phase of the study

Finally, Figure  shows the percentage of all items solved by more than one strategy, whether shortcut or not. It can be clearly seen that students demonstrate most flexibility when engaged in additive tasks, while their flexibility with respect to both subtraction and multiplication is, typically, around half that of addition. One interpretation, confirming the above, is that once standard algorithms become established, particularly with respect to subtraction and multiplication, students do not expect to engage with shortcut strategies, with the consequence that even when prompted, they may be unable to offer alternatives to the standard algorithms.

Figure 4. Flexibility by operation.

Figure 4. Flexibility by operation.

4. Discussion

In this paper, through an analysis of their use of shortcut strategies (Hickendorff, Citation2018; Torbeyns et al., Citation2009a, Citation2009b) and a novel application of a Tri-phase Flexibility Assessment (TriFA) (Xu et al., Citation2017), we have evaluated the strategy adaptivity and strategy flexibility of Danish grade six students with respect to multidigit arithmetic. Flexibility was defined as the ability to solve the same task by means of more than one strategy, while adaptivity depended on whether students’ first choice strategy or perception of which of their proposed strategies was optimal was shortcut. The multidigit arithmetic tasks used in the assessment were designed to elicit various forms of shortcut strategies based on particular task and operation characteristics. In the following, we discuss the outcomes of this study.

From the perspective of adaptivity, as measured by the number of items solved by means of shortcut strategies, the majority of students appeared limited. Indeed, more than three-quarters of the 731 participating students were assessed as possessing either low or no adaptivity. When set against a reform-oriented Danish curriculum that encourages the development of number-based strategies rather than standard algorithms (Ministry of Children and Education, Citation2019) these seem disappointingly problematic. The reasons for this apparent failure of Danish students to acquire the expected competences are not clear, although, acknowledging research internationally, it would not be surprising to find it due to instructional practices and parental interventions that privilege standard algorithms, as found in Flanders (Torbeyns & Verschaffel, Citation2013, Citation2016), France (Lemaire & Brun, Citation2017), Germany (Selter, Citation2001), Hungary (Csíkos Citation2016) and Italy (Lucangeli et al., Citation2003). However, such low proportions may not be as disappointing as the figures indicate. A similar free choice study of Dutch sixth grade students, who typically experience a European curriculum with an emphasis on number-based strategies, found students disinclined to use shortcut strategies, even when, as in the study reported here, tasks were chosen to elicit them (Hickendorff, Citation2018). In particular, she found, with respect to addition, subtraction and multiplication, an average of 12% of tasks solved by means of shortcut strategies, figures substantially less than the 22% represented above in Figure . That being said, highlighting the problematic nature of comparing the outcomes of different but conceptually similar studies, the lower figure found by Hickendorff may have been a consequence of the fact that only one-third of the addition, subtraction and multiplication tasks of her study were prefaced by an instruction to take a close look at the numbers and try to solve the task as cleverly as possible (Hickendorff, Citation2018, p. 583). Indeed, she found that such an instruction, which all students in our study received for all tasks, led to a significantly higher use of shortcut strategies on addition and multiplication but not subtraction. This subtraction-related lack of shortcut strategies, even when prompted, resonates well with the results of Figure , and may be a consequence of students’ familiarity and confidence with positive integer addition (and possibly multiplication) as a process of growth, where adding is joining, as opposed to subtraction construed as a process of separation or taking away (Bruno & Martinón, Citation1999).

That being said, perhaps the most important result concerning students’ adaptivity is its relationship to accuracy. The figures of Tables  and  show relatively high levels of accuracy across the different levels of adaptivity, although it is equally clear that accuracy increases with increasing adaptivity. Such findings seem to conflict with those of a Flemish study, albeit one focused solely on subtraction, that found no variation in strategy use across different attainment groups (Torbeyns et al., Citation2018). Importantly, the figures of Table  show quite clearly that adaptive students are significantly more likely to be accurate than non-adaptive students. Interestingly, while these findings resonate closely with a Finnish study that found adaptivity to be closely related to grade six students’ arithmetic fluency and knowledge of arithmetic concepts (McMullen et al., Citation2016), they conflict with those of Hickendorff (Citation2018), who found no significant impact of shortcut strategy use on students’ multidigit arithmetic accuracy. Also, Hickendorff (Citation2018) found consistently high levels of achievement.

Moreover, with potentially important developmental implications for Danish students, McMullen et al. (Citation2017) found that that Finnish students’ adaptive number knowledge, even after taking into account arithmetic conceptual knowledge and arithmetic fluency, is a strong predictor of later pre-algebra skills.

From the perspective of flexibility, as measured by the number of items solved by means of two or more strategies, almost half of all students were assessed as having little or no flexibility. Such results are not dissimilar to studies conducted in Flanders, where grade-three students, even after exposure to teaching focused on informal strategies, rarely show flexibility in their solutions to multidigit subtraction problems (De Smedt et al., Citation2010). However, a German study found that grade-two students taught in ways that emphasise conceptual understanding alongside procedural skills become more flexible than students taught in ways the emphasis only procedural skills. Indeed, in accordance with our definition of flexibility, conceptually-based instruction enabled most students (84% with respect to addition and 62% with respect to subtraction) to offer two solutions to each task (Blöte et al., Citation2001). Unfortunately, many studies nominally addressing strategy flexibility conflate flexibility and adaptivity, making comparisons problematic (see, for example, Nunes et al., Citation2020). That being said, the relatively low levels of strategy flexibility identified above resonate with studies of students’ equation-solving flexibility, whether at grade eight (Schneider et al., Citation2011), grade nine (Newton et al., Citation2010) or grade eleven (Hästö et al., Citation2019). In other words, strategy flexibility is poorly manifested both internationally and across topics, further confirming the tendency for students, having learnt standard algorithms, to reject or ignore alternatives (Csíkos, Citation2016; Hickendorff, Citation2020; Lemaire & Brun, Citation2017; Lucangeli et al., Citation2003; Selter, Citation2001; Torbeyns & Verschaffel, Citation2013, Citation2016).

Furthermore, although the data indicate that flexibility is implicated in accuracy, the relationship is less robust than that between adaptivity and accuracy. Indeed, the results accord well with those of a TriFA-related study of Finnish grade eleven students’ strategy flexibility with respect to linear equations, whereby a ‘“triangle” type relation between accuracy and flexibility’ was found with, essentially, ‘no students with high flexibility and low accuracy’ but some ‘with low flexibility but high accuracy’ (Hästö et al., Citation2019, p. 5).

Overall, the results indicate that the relationship between flexibility and adaptivity is complex. Indeed, while it seems clear that adaptive students are typically flexible, the reverse cannot be said with the same degree of confidence; flexible students are not typically adaptive. Such findings, which resonate with an earlier study, whereby equation-solving experts were consistently able to exhibit strategy flexibility but were inconsistent in their selection of the most efficient strategy (Star & Newton, Citation2009), challenge earlier assertions that flexibility can be seen as a precondition for adaptivity (Verschaffel et al., Citation2007). The findings also confirm earlier research that exposure to standard algorithms tends to limit students’ ability to trust alternative, number-based, shortcut strategies. Indeed, students whose preferred strategy is shortcut are more able to make judgements about strategy appropriateness than students whose first strategy is not shortcut. Further, the evidence indicates that adaptivity is a stronger predictor of accuracy than flexibility, highlighting the importance of adaptivity as an educational objective (Baroody, Citation2003; Hickendorff, Citation2018; Xu et al., Citation2017).

Finally, as found in earlier studies, strategy choice is strongly dependent on the operation (Torbeyns et al., Citation2009b). For example, irrespective of whether their preferred strategy was shortcut, when considering alternative approaches, students were not only more able to offer shortcut strategies for additive than either subtractive or multiplicative tasks but that additive tasks prompted the greatest flexibility. Interestingly, while multiplicative tasks yielded the highest proportion of preferred shortcut strategies, relatively few students were able to offer a second shortcut strategy. Finally, subtraction seemed to limit students’ use of shortcut strategies more than either addition or multiplication. As above, such findings confirm that once standard algorithms become established, particularly with respect to subtraction, students do not expect to engage with shortcut strategies, with the consequence that even when prompted, they may be unable to offer alternatives to the standard algorithms. In sum, we hypothesis that students’ decisions with respect to their preferred strategies are influenced not only by what they have been taught (Csíkos, Citation2016; Hickendorff, Citation2020; Lemaire & Brun, Citation2017; Lucangeli et al., Citation2003; Selter, Citation2001; Torbeyns & Verschaffel, Citation2013, Citation2016) but, importantly, by what they have been led to trust and internalise as desirable.

5. Reflections on method

In this paper, with respect to multidigit arithmetic, we have investigated the relationship between the strategy adaptivity, strategy flexibility and solution accuracy of Danish grade six students. In so doing, we have successfully exploited a novel adaptation of a Tri-phase Flexibility Assessment model (TriFA) previously used to uncover students’ strategy use when solving linear equations (Xu et al., Citation2017), and for the reasons outlined below, would recommend similar adaptations to colleagues working in different content domains.

We have shown the adapted TriFa to be a simple and quick to implement diagnostic tool suitable for assessing large numbers of students in an unthreatening and familiar environment. Each phase of the assessment offers unique insights into students’ strategy choices that can be examined and reported at both individual and class levels, as well as facilitating the planning of didactical activities. Phase one offers insights into what strategies students choose to use, with those turning to shortcut strategies being defined as adaptive. Phase two offers insights into students’ strategy repertoires and whether they can be defined as flexible. Finally, phase three offers insights into when, if ever, students identify shortcut strategies as optimal.

We have shown that by not restricting students’ strategy choice or privileging speed has facilitated our identifying an important distinction between adaptivity and flexibility (Rathgeb-Schnierer & Green, Citation2017). Indeed, moving beyond studies that construe flexibility and adaptivity as synonyms (Baroody, Citation2003; Hickendorff, Citation2018) or those that see adaptivity as a component of flexibility (Blöte et al., Citation2000; Rathgeb-Schnierer & Green, Citation2017; Schneider et al., Citation2011; Star & Rittle-Johnson, Citation2008), we have shown that the TriFA has facilitated a nuanced understanding not only of the interrelationship of flexibility and adaptivity but also their differential impact on accuracy. We have shown how the TriFa has yielded results that both resonate and contrast with studies undertaken elsewhere, particularly in culturally comparable European contexts, confirming that students’ strategy choices are influenced by cultural norms and expectations (Hatano & Inagaki, Citation1986; Lin et al., Citation2007), particularly systemic decisions concerning the introduction of standard algorithms (Csíkos, Citation2016; De Smedt et al., Citation2010; Hickendorff, Citation2020; Lemaire & Brun, Citation2017; Lucangeli et al., Citation2003; Nemeth et al., Citation2019; Torbeyns & Verschaffel, Citation2016), and importantly, researchers’ methodologies; choice/no choice studies typically yield different results from free-choice and confirm the significance of our novel methodology.

Overall, the TriFA has proved a powerful tool in support of our quest to understand students’ arithmetic task-related strategy choices, albeit at just one grade level, and the relationship of those choices to accuracy. Also, by way of a pointer towards future research, there remains a question that has yet to be addressed adequately; why, even in contexts like Denmark and the Netherlands, where national curricula privilege informal strategies above standard algorithms, do students reject efficient number-based shortcut strategies? Is the teaching of shortcut strategies, as implied by Torbeyns et al. (Citation2009a), simply not feasible? Is it a consequence, as implied by Skott (Citation2004), of loosely-framed curricula leading to a plethora of didactical approaches, each unique to the individual teacher? Elsewhere in Scandinavia, researchers have identified commonly articulated but differently enacted strategies (Andrews & Larson, Citation2017; Andrews & Nosrati, Citation2018), but in the context of Denmark, little is known about how teachers present mathematics to their students, a problem in need of resolution.

Finally, we close by inviting teachers to reflect on how they choose the tasks they present their students. Do the textbooks they use encourage the learning of shortcut strategies or subordinate them to standard algorithms? If the latter, what alternative resources do teachers have available to them? Have their teacher education programmes provided them with the skills to adapt tasks, like those used in this study, in ways that will facilitate students recognising the number relationships that simplify what they choose to do?

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by Danmarks Frie Forskningsfond [grant number #9038-00045B.].

References

  • Andrews, P., & Larson, N. (2017). Analysing genomgång: A Swedish mathematics teaching lesson event. Nordic Studies in Mathematics Education, 22(3), 85–105.
  • Andrews, P., & Nosrati, M. (2018). Gjennomgang and genomgång: Same or different? In H. Palmér & J. Skott (Eds.), Students’ and teachers’ values, attitudes, feelings and beliefs in mathematics classrooms (pp. 113–124). Springer International. https://doi.org/10.1007/978-3-319-70244-5_11.
  • Baroody, A. (2003). The development of adaptive expertise and flexibility: The integration of conceptual and procedural knowledge. In A. Baroody & A. Dowker (Eds.), The development of arithmetic concepts and skills: Constructing adaptive expertise (pp. 1–33). Lawrence Erlbaum Associates Publishers.
  • Baroody, A. J., & Dowker, A. E. (2003). The development of arithmetic concepts and skills construction adaptive expertise. Lawrence Erlbaum Associates Publishers.
  • Blanca, M., Alarcón, R., Arnau, J., Bono, R., & Bendayan, R. (2017). Non-normal data: Is ANOVA still a valid option? Psicothema, 29(4), 552–557. https://doi.org/10.7334/psicothema2016.383
  • Blöte, A. W., Klein, A. S., & Beishuizen, M. (2000). Mental computation and conceptual understanding. Learning and Instruction, 10(3), 221–247. ScienceDirect. https://doi.org/10.1016/S0959-4752(99)00028-6.
  • Blöte, A. W., Van der Burg, E., & Klein, A. S. (2001). Students’ flexibility in solving two-digit addition and subtraction problems: Instruction effects. Journal of Educational Psychology, 93(3), 627–638. https://doi.org/10.1037/0022-0663.93.3.627
  • Bruno, A., & Martinón, A. (1999). The teaching of numerical extensions: The case of negative numbers. International Journal of Mathematical Education in Science and Technology, 30(6), 789–809. https://doi.org/10.1080/002073999287482
  • Csíkos, C. (2016). Strategies and performance in elementary students’ three-digit mental addition. Educational Studies in Mathematics, 91(1), 123–139. https://doi.org/10.1007/s10649-015-9658-3
  • De Corte, E. (2007). Learning from instruction: The case of mathematics. Learning Inquiry, 1(1), 19–30. https://doi.org/10.1007/s11519-007-0002-4
  • De Smedt, B., Torbeyns, J., Stassens, N., Ghesquière, P., & Verschaffel, L. (2010). Frequency, efficiency and flexibility of indirect addition in two learning environments. Learning and Instruction, 20(3), 205–215. https://doi.org/10.1016/j.learninstruc.2009.02.020
  • Fagerland, M. W. (2012). t-tests, non-parametric tests, and large studies—a paradox of statistical practice? BMC Medical Research Methodology, 12(1), 78. https://doi.org/10.1186/1471-2288-12-78
  • Fagginger Auer, M. F., Hickendorff, M., & van Putten, C. M. (2016). Solution strategies and adaptivity in multidigit division in a choice/no-choice experiment: Student and instructional factors. Learning and Instruction, 41, 52–59. https://doi.org/10.1016/j.learninstruc.2015.09.008
  • Fuchs, L., Powell, S. R., Seethaler, P. M., Fuchs, D., Hamlett, C. L., Cirino, P. T., & Fletcher, J. M. (2010). A framework for remediating number combination deficits. Exceptional Children, 76(2), 135–156. https://doi.org/10.1177/001440291007600201
  • Hästö, P., Palkki, R., Tuomela, D., & Star, J. R. (2019). Relationship between mathematical flexibility and success in national examinations. European Journal of Science and Mathematics Education, 7(1), 1–13. https://doi.org/10.30935/scimath/9530
  • Hatano, G. (1988). Social and motivational bases for mathematical understanding. New Directions for Child and Adolescent Development, 1988(41), 55–70. https://doi.org/10.1002/cd.23219884105
  • Hatano, G., & Inagaki, K. (1986). Two courses of expertise. In H. Stevenson, H. Azuma, & K. Hakuta (Eds.), Child development and education in Japan (pp. 262–272). W H Freeman/Times Books/ Henry Holt & Co.
  • Hatano, G., & Oura, Y. (2003). Commentary: Reconceptualizing school learning using insight from expertise research. Educational Researcher, 32(8), 26–29. https://doi.org/10.3102/0013189X032008026
  • Heinze, A., Arend, J., Gruessing, M., & Lipowsky, F. (2018). Instructional approaches to foster third graders’ adaptive use of strategies: An experimental study on the effects of two learning environments on multi-digit addition and subtraction. Instructional Science, 46(6), 869–891. https://doi.org/10.1007/s11251-018-9457-1
  • Heinze, A., Star, J. R., & Verschaffel, L. (2009). Flexible and adaptive use of strategies and representations in mathematics education. ZDM, 41(5), 535–540. https://doi.org/10.1007/s11858-009-0214-4
  • Hickendorff, M. (2018). Dutch sixth graders’ use of shortcut strategies in solving multidigit arithmetic problems. European Journal of Psychology of Education, 33(4), 577–594. https://doi.org/10.1007/s10212-017-0357-6
  • Hickendorff, M. (2020). Fourth graders’ adaptive strategy use in solving multidigit subtraction problems. Learning and Instruction, 67, 101311. https://doi.org/10.1016/j.learninstruc.2020.101311
  • Hickendorff, M., Torbeyns, J., & Verschaffel, L. (2018). Grade-related differences in strategy use in multidigit division in two instructional settings. British Journal of Developmental Psychology, 36(2), 169–187. https://doi.org/10.1111/bjdp.12223
  • Hickendorff, M., Torbeyns, J., & Verschaffel, L. (2019). Multi-digit addition, subtraction, multiplication, and division strategies. In A. Fritz, V. G. Haase, & P. Räsänen (Eds.), International handbook of mathematical learning difficulties (pp. 543–560). Springer. https://doi.org/10.1007/978-3-319-97148-3_32.
  • Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159–174. https://doi.org/10.2307/2529310
  • Lemaire, P., & Brun, F. (2017). Age-related changes in children’s strategies for solving two-digit addition problems. Journal of Numerical Cognition, 3(3), 582–597. https://doi.org/10.5964/jnc.v3i3.117
  • Lemaire, P., & Callies, S. (2009). Children’s strategies in complex arithmetic. Journal of Experimental Child Psychology, 103(1), 49–65. https://doi.org/10.1016/j.jecp.2008.09.007
  • Lin, X., Schwartz, D. L., & Bransford, J. (2007). Intercultural adaptive expertise: Explicit and implicit lessons from Dr. Hatano. Human Development, 50(1), 65–72. https://doi.org/10.1159/000097686
  • Lucangeli, D., Tressoldi, P. E., Bendotti, M., Bonanomi, M., & Siegel, L. S. (2003). Effective strategies for mental and written arithmetic calculation from the third to the fifth grade. Educational Psychology, 23(5), 507–520. https://doi.org/10.1080/0144341032000123769
  • Luwel, K., Onghena, P., Torbeyns, J., Schillemans, V., & Verschaffel, L. (2009). Strengths and weaknesses of the choice/no-choice method in research on strategy use. European Psychologist, 14(4), 351–362. https://doi.org/10.1027/1016-9040.14.4.351
  • McHugh, M. L. (2012). Interrater reliability: The kappa statistic. Biochemia Medica, 22(3), 276–282. https://doi.org/10.11613/BM.2012.031
  • McMullen, J., Brezovszky, B., Hannula-Sormunen, M. M., Veermans, K., Rodríguez-Aflecht, G., Pongsakdi, N., & Lehtinen, E. (2017). Adaptive number knowledge and its relation to arithmetic and prealgebra knowledge. Learning and Instruction, 49, 178–187. https://doi.org/10.1016/j.learninstruc.2017.02.001
  • McMullen, J., Brezovszky, B., Rodríguez-Aflecht, G., Pongsakdi, N., Hannula-Sormunen, M. M., & Lehtinen, E. (2016). Adaptive number knowledge: Exploring the foundations of adaptivity with whole-number arithmetic. Learning and Individual Differences, 47, 172–181. https://doi.org/10.1016/j.lindif.2016.02.007
  • McMullen, J., Hannula-Sormunen, M. M., Lehtinen, E., & Siegler, R. S. (2020). Distinguishing adaptive from routine expertise with rational number arithmetic. Learning and Instruction, 68, 101347. https://doi.org/10.1016/j.learninstruc.2020.101347
  • McMullen, J., Hannula-Sormunen, M. M., Lehtinen, E., & Siegler, R. S. (2022). Predicting adaptive expertise with rational number arithmetic. British Journal of Educational Psychology, 92(2), 688–706. https://doi.org/10.1111/bjep.12471
  • Ministry of Children and Education. (2019). Matematik. Fælles Mål. 2019. https://emu.dk/sites/default/files/2020-09/GSK_F%C3%A6llesM%C3%A5l_Matematik.pdf
  • Nemeth, L., Werker, K., Arend, J., Vogel, S., & Lipowsky, F. (2019). Interleaved learning in elementary school mathematics: Effects on the flexible and adaptive use of subtraction strategies. Frontiers in Psychology, 10, 1–21. https://doi.org/10.3389/fpsyg.2019.00086
  • Newton, K. J., Star, J. R., & Lynch, K. (2010). Understanding the development of flexibility in struggling algebra students. Mathematical Thinking and Learning, 12(4), 282–305. https://doi.org/10.1080/10986065.2010.482150
  • Nunes, T., Vargas, B., Luciana, D., & Corso, V. (2020). Flexible mental calculation: Reasoning profiles of Brazilian students in second and fourth grades. Journal of Mathematics Education, 13(1), 35–55. https://doi.org/10.26711/007577152790053
  • Peters, G., De Smedt, B., Torbeyns, J., Ghesquière, P., & Verschaffel, L. (2013). Children's use of addition to solve two-digit subtraction problems. British Journal of Psychology, 104(4), 495–511. https://doi.org/10.1111/bjop.12003
  • Rathgeb-Schnierer, E., & Green, M. (2017). Profiles of cognitive flexibility in arithmetic reasoning: A cross-country comparison of German and American elementary students. Journal of Mathematics Education, 10(1), 1–16. https://doi.org/10.26711/007577152790009
  • Rittle-Johnson, B., & Star, J. R. (2007). Does comparing solution methods facilitate conceptual and procedural knowledge? An experimental study on learning to solve equations. Journal of Educational Psychology, 99(3), 561–574. https://doi.org/10.1037/0022-0663.99.3.561
  • Schneider, M., Rittle-Johnson, B., & Star, J. R. (2011). Relations among conceptual knowledge, procedural knowledge, and procedural flexibility in two samples differing in prior knowledge. Developmental Psychology, 47(6), 1525–1538. https://doi.org/10.1037/a0024997
  • Selter, C. (2001). Addition and subtraction of three-digit numbers: German elementary children’s success, methods and strategies. Educational Studies in Mathematics, 47(2), 145–173. https://doi.org/10.1023/A:1014521221809
  • Si, J., Li, H., Sun, Y., Xu, Y., & Sun, Y. (2016). Age-related differences of individuals’ arithmetic strategy utilization with different level of math anxiety. Frontiers in Psychology, 7, 1–11. https://doi.org/10.3389/fpsyg.2016.01612
  • Siegler, R. S., & Lemaire, P. (1997). Older and younger adults’ strategy choices in multiplication: Testing predictions of ASCM using the choice/no-choice method. Journal of Experimental Psychology: General, 126(1), 71–92. https://doi.org/10.1037/0096-3445.126.1.71
  • Sievert, H., van den Ham, A.-K., Niedermeyer, I., & Heinze, A. (2019). Effects of mathematics textbooks on the development of primary school children's adaptive expertise in arithmetic. Learning and Individual Differences, 74, 101716. https://doi.org/10.1016/j.lindif.2019.02.006
  • Skott, J. (2004). The forced autonomy of mathematics teachers. Educational Studies in Mathematics, 55(1), 227–257. https://doi.org/10.1023/B:EDUC.0000017670.35680.88
  • Star, J. R., & Newton, K. J. (2009). The nature and development of experts’ strategy flexibility for solving equations. ZDM, 41(5), 557–567. https://doi.org/10.1007/s11858-009-0185-5
  • Star, J. R., & Rittle-Johnson, B. (2008). Flexibility in problem solving: The case of equation solving. Learning and Instruction, 18(6), 565–579. https://doi.org/10.1016/j.learninstruc.2007.09.018
  • Sunde, P. B., De Smedt, B., Verschaffel, L., & Sunde, P. (2023). Grade one single-digit addition strategies as predictors of grade four achievement in mathematics. European Journal of Psychology of Education. Advance online publication. https://doi.org/10.1007/s10212-023-00761-x
  • Sunde, P. B., Sunde, P., & Sayers, J. (2020). Sex differences in mental strategies for single-digit addition in the first years of school. Educational Psychology, 40(1), 82–102. https://doi.org/10.1080/01443410.2019.1622652
  • Threlfall, J. (2002). Flexible mental calculation. Educational Studies in Mathematics, 50(1), 29–47. https://doi.org/10.1023/A:1020572803437
  • Torbeyns, J., De Smedt, B., Ghesquière, P., & Verschaffel, L. (2009a). Acquisition and use of shortcut strategies by traditionally schooled children. Educational Studies in Mathematics, 71(1), 1–17. https://doi.org/10.1007/s10649-008-9155-z
  • Torbeyns, J., De Smedt, B., Ghesquière, P., & Verschaffel, L. (2009b). Jump or compensate? Strategy flexibility in the number domain up to 100. ZDM, 41(5), 581–590. https://doi.org/10.1007/s11858-009-0187-3
  • Torbeyns, J., Ghesquière, P., & Verschaffel, L. (2009). Efficiency and flexibility of indirect addition in the domain of multi-digit subtraction. Learning and Instruction, 19(1), 1–12. https://doi.org/10.1016/j.learninstruc.2007.12.002
  • Torbeyns, J., Peters, G., De Smedt, B., Ghesquière, P., & Verschaffel, L. (2018). Subtraction by addition strategy use in children of varying mathematical achievement level: A choice/no-choice study. Journal of Numerical Cognition, 4(1), 215–234. https://doi.org/10.5964/jnc.v4i1.77
  • Torbeyns, J., & Verschaffel, L. (2013). Efficient and flexible strategy use on multi-digit sums: A choice/no-choice study. Research in Mathematics Education, 15(2), 129–140. https://doi.org/10.1080/14794802.2013.797745
  • Torbeyns, J., & Verschaffel, L. (2016). Mental computation or standard algorithm? Children’s strategy choices on multi-digit subtractions. European Journal of Psychology of Education, 31(2), 99–116. https://doi.org/10.1007/s10212-015-0255-8
  • Torbeyns, J., Verschaffel, L., & Ghesquière, P. (2004). Strategic aspects of simple addition and subtraction: The influence of mathematical ability. Learning and Instruction, 14(2), 177–195. https://doi.org/10.1016/j.learninstruc.2004.01.003
  • Torbeyns, J., Verschaffel, L., & Ghesquière, P. (2006). The development of children's adaptive expertise in the number domain 20 to 100. Cognition and Instruction, 24(4), 439–465. https://doi.org/10.1207/s1532690xci2404_2
  • Van Der Auwera, S., Torbeyns, J., De Smedt, B., Verguts, G., & Verschaffel, L. (2022). The remarkably frequent, efficient, and adaptive use of the subtraction by addition strategy: A choice/no-choice study in fourth- to sixth-graders with varying mathematical achievement levels. Learning and Individual Differences, 93, 102107. https://doi.org/10.1016/j.lindif.2021.102107
  • Verschaffel, L., Luwel, K., Torbeyns, J., & Van Dooren, W. (2007). Desarrollo de destrezas adaptativas: ¿una meta factible y valiosa para la eduación (primaria) de las matematicas? Ciencias Psicológicas, 1(1), 27–35. https://doi.org/10.22235/cp.v0i1.571
  • Verschaffel, L., Luwel, K., Torbeyns, J., & Van Dooren, W. (2009). Conceptualizing, investigating, and enhancing adaptive expertise in elementary mathematics education. European Journal of Psychology of Education, 24(3), 335–359. https://doi.org/10.1007/BF03174765
  • Vickers, A. J. (2005). Parametric versus non-parametric statistics in the analysis of randomized trials with non-normally distributed data. BMC Medical Research Methodology, 5(1), 35. https://doi.org/10.1186/1471-2288-5-35
  • Xu, L., Liu, R. D., Star, J. R., Wang, J., Liu, Y., & Zhen, R. (2017). Measures of potential flexibility and practical flexibility in equation solving. Frontiers in Psychology, 8, 1–13. https://doi.org/10.3389/fpsyg.2017.01368