Search in:

Cogent Education Volume 7, 2020 - Issue 1

Submit an article Journal homepage

Open access

6,304

Views

CrossRef citations to date

Altmetric

Listen

EDUCATIONAL ASSESSMENT & EVALUATION

An examination of IELTS candidates’ performances at different band scores of the speaking test: A quantitative and qualitative analysis

Laleh Dashti1 Department of Foreign Languages and Linguistics, Shiraz University, Shiraz, IranView further author information

Seyyed Ayatollah Razmjoo1 Department of Foreign Languages and Linguistics, Shiraz University, Shiraz, IranCorrespondence[email protected]
View further author information

Sammy King Fai Hui2 The Education University of Hong Kong, Hong KongView further author information

(Reviewing editor)

Article: 1770936 | Received 01 Feb 2020, Accepted 08 May 2020, Published online: 05 Jun 2020

Cite this article
https://doi.org/10.1080/2331186X.2020.1770936
CrossMark

In this article

Abstract
1. Introduction
2. Method
3. Results
4. Discussion
5. Conclusion and implications
Additional information
References
Appendixes

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Abstract

The purpose of this mixed-methods study was to explore Iranian IELTS candidates’ strengths and weaknesses in IELTS Speaking Test in terms of IELTS’s four speaking assessment criteria, namely Fluency and Coherence (FlC), Lexical Resource (LR), Grammar Range and Accuracy (GRA), and Pronunciation (Pro). It also aimed to examine the discourse features of the candidates’ performances in part 2 of IELTS Speaking across bands 5, 6, and 7. To this end, the oral performances of 59 IELTS candidates from a series of Mock IELTS Tests were collected, re-scored, and subjected to statistical investigation. Additionally, to better understand the performances, we conducted a content analysis on part 2 of the Test. The results of our regression analysis showed that FlC was the greatest predictor of success in IELTS Speaking, followed by GRA and LR, respectively while Pro was found to make the least unique contribution. Furthermore, our content analysis coupled with the application of the Monte Carlo test revealed that as the total scores moved up from 5 to 7, the rate of occurrence of the uncovered faults often declined substantially. It also showed that the association between the frequency of unearthed grammatical complications and the scores 5, 6, and 7 was strong whereas those of the remaining criteria were moderate. Moreover, the most salient obstacle found in the area of FlC, LR, GRA, and Pro, was incorrect connectors and conjunctions, inappropriate word choices, inaccurate simple sentences, and mispronunciations, respectively. The study holds clear implications for IELTS trainers, language teachers, and material developers.

Keywords:

IELTS
IELTS Speaking
speaking assessment criteria
IELTS Test

PUBLIC INTEREST STATEMENT

What makes an IELTS candidate at band 5 to achieve band 5 in IELTS Speaking? We can ask the same question about bands 6 and 7. A band score is the function of a candidate’s linguistic strengths and weaknesses. These areas are what we have explored in this study.

We found out that a candidate’s fluency and coherence more significantly contribute to a person’s total score than vocabulary or grammar does whereas pronunciation was the weakest point for the participants, originating from an EFL context. Moreover, the most salient obstacle found in the area of Fluency and Coherence was the alarming number of incorrect cohesive devices. In the Lexical Resource domain, candidates made an excessive number of inappropriate word choices. As for Grammar Range and Accuracy, the most conspicuous problem was simply forming simple sentences. Finally, mispronunciation was the error that severely influenced many candidate’s scores.

1. Introduction

IELTS Speaking Test comprises an interaction between a candidate and an examiner, which should take about 11–14 minutes. The test has three main parts, all of which concern the quantitative aspect of the current study and the second of which is the focus of the qualitative phase. Part 1 focuses on general questions about the candidate on various familiar topics such as family, friends, or hometown. Part 2 of the test has its function: an interaction pattern, a task required, and delivered performance. The candidate is given a task card containing a prompt and is asked to talk about a given topic for one to two minutes. Before the talk, the candidate will be given one minute to prepare (IELTS, Test format, Citation2019b). Part 3 regards questions related to the topic in part 2, requiring discussions on more abstract ideas.

In Part 2, while the candidate is speaking, the examiner observes the performance without causing any interruptions. A detailed performance descriptor (Appendix A) has been developed by IELTS which delineates a nine-band spoken performance assessment system based on four criteria defined and detailed out by Seedhouse et al. (Citation2014):

Fluency and Coherence refers to the ability to talk with normal levels of continuity, rate and effort and to link ideas and language together to form coherent, connected speech. The key indicators of fluency are speech rate and speech continuity. For coherence, the key indicators are logical sequencing of sentences, clear marking of stages in a discussion, narration or argument, and the use of cohesive devices (e.g., connectors, pronouns and conjunctions) within and between ‘sentences’.

Lexical Resource refers to the range of vocabulary the candidate can use and the precision with which meanings and attitudes can be expressed. The key indicators are the variety of words used, the adequacy and appropriacy of the words used and the ability to circumlocute (get round a vocabulary gap by using other words) with or without noticeable hesitation.

Grammatical Range and Accuracy refers to the range and the accurate and appropriate use of the candidate’s grammatical resource. The key indicators of grammatical range are the length and complexity of the spoken sentences, the appropriate use of subordinate clauses, and variety of sentence structures, and the ability to move elements around for information focus. The key

indicators of grammatical accuracy are the number of grammatical errors in a given amount of speech and the communicative effect of error.

Pronunciation refers to the capacity to produce comprehensible speech in fulfilling the speaking test requirements. The key indicators will be the amount of strain caused to the listener, the amount of unintelligible speech and the noticeability of L1 influence. (p. 5)

The research focus is, on the one hand, on performance scores, and on the other hand, the performance features of candidates only in part 2 of the IELTS Speaking Test. The reason for this selection is the restrictive challenges of the qualitative data analysis coupled with limited time and budget. The overall aim of this study is to unearth the strengths and weaknesses of spoken performances quantitatively and qualitatively in relation to the four speaking performance criteria, explicated in the public version of IELTS speaking band descriptor (IELTS, Citation2019a) at three bands of 5, 6, and 7. To this end, there are three research questions:

(1) What are the strengths and weaknesses of Iranian candidates’ performances in the IELTS Speaking Test based on the IELTS’s four speaking assessment criteria of Fluency and Coherence (FlC), Lexical Resource (LR), Grammatical Range and Accuracy (GRA), and Pronunciation (Pro)?

In order to answer this question, quantitative measures will be employed to discover the relative weight of each of the criteria in determining the overall scores.

(2) What key factors in FlC, LR, GRA, and Pro Criteria differentiate bands 5, 6, and 7?

Questions 2 is answered by analyzing the spoken data inductively, employing qualitative content analysis to transcripts of part 2 of the speaking tests.

(3) Is there a meaningful association between the discovered key factors and scores assigned?

Answering question 3 requires deploying a Monte Carlo test, an extension of the Chi-square test.

Informed of the discoveries of this study, IELTS trainers will be able to enhance their teaching practices by benefiting from candidates’ most salient strengths and focusing on their most probable weaknesses. Likewise, the results of the current study redound to many IELTS trainers and self-study prone candidates in that it may raise their awareness of what areas of spoken performance require greater attention, hence causing a higher likelihood of maximizing IELTS scores and overall success in the test. The findings of this study may also help material developers have a more realistic view of Iranian candidates’ speaking proficiency.

1.1. Literature review

1.1.1. Assessing speaking

Howarth (Citation2001) considered speaking as the process of communicating opinions, ideas, information, or emotions, hence the importance of speaking assessment. Assessing speaking refers to evaluating one’s capacity of producing oral language (Fulcher, Citation2003), and it is considered an indispensable component of large scale, small scale, and classroom-based assessment (Bachman, Citation1990).

According to Derakhshan and Nadi Khalili (Citation2016), speaking skill comprises two main categories: accuracy and fluency. The former is considered as the correct use of language components namely, grammar, vocabulary, and pronunciation in speaking, while the latter is “the ability to keep going when speaking spontaneously” (Gower et al., Citation1995). As Hedge (Citation2000) showed, fluency is a learner’ ability to speak coherently by linking words and sentences, utilizing stress and intonation, and pronouncing the sounds in a proper way. Thornbury (Citation2005) referred to the accuracy of vocabulary as the employment of suitable words in fitting contexts.

Fulcher (Citation2003) believed that although both speaking and writing are thought of as the productive skills, speaking is more than mere production. It involves the verbal skill as well. Furthermore, according to Fulcher (Citation2003), the linguistic features observed in speaking are different from those observed in writing.

There existed different approaches to assessing speaking; however, the recent approaches to assess speaking might address the abilities to get messages communicated (Bachman, Citation1990). Speaking performance is complex and assessing this skill becomes complicated as many variables come to play. For example, test takers’ characteristics, features of the speaking test, raters, and rubric descriptors might affect a test taker’s speaking score (Seong, Citation2014; Qian, Citation2007). That said, in addition to the linguistic knowledge (e.g., pronunciation, vocabulary, stress patterns, and rhythm), the strategies and ways of using this knowledge might introduce some other variables in an effective and successful speaking test (Fulcher, Citation2003).

1.2. Assessment in IELTS Speaking

The approaches of speaking assessment and the notion of L2 speaking ability have evolved and broadened dramatically over the past few decades (Purpura, Citation2016). IELTS speaking assessment system is a classical example of direct tests, a classification proposed by Clark (Citation1979), evaluating speaking skills and abilities in actual performance. Direct methods are defined as “procedures in which the examinee is asked to engage in face-to-face communicative exchanges with one or more human interlocutors” (Clark, Citation1979, p. 36). Direct tests have the advantage of their elicitation of speaking skills in a manner that duplicates “the setting and operation of the real-life situations in which proficiency is normally demonstrated” (Shohamy, Citation1994, p. 100); that is, direct assessments of speaking abilities presents substantial face validity.

1.3. Studies on IELTS Speaking Test

There are a plethora of studies on the IELTS Speaking Test, investigating it from various perspectives or aspects using different research designs.

Iwashita and Vasquez (Citation2019) studied how the distinctive features of discourse competence performance correlate to the IELTS speaking band descriptor. They undertook a detailed quantitative and qualitative examination of test-takers’ oral discourse at three proficiency levels. The features of discourse competence analyzed in the study included both cohesive devices and coherence devices. The analysis revealed that some features of discourse were more distinctively observed in the higher-level test-takers’ performance than the lower level test-takers, but other features (e.g., ellipsis and substitution) were not clearly distinguished across the levels.

Roothooft and Breeze (Citation2019) designed a study to enhance our understanding of the differences between band scores 4, 5, 6, 7, and 8 in terms of grammatical structures and morphemes as well as the error types and rates across the designated scores. The findings contributed to our understanding of the order in which certain grammatical structures are acquired in second language acquisition. The results also showed that higher score candidates attempted more complex structures despite a considerable rate of unsuccessful instances.

Elder and Wigglesworth (Citation2006) explored planning, proficiency, and task—three aspects of the IELTS Speaking Test, seeking to find out whether the three variables interact. The concentration of the study was on planning time, aiming to differentiate the oral performances of three groups of candidates, given no time, one minute, and two minutes to plan before they attempted the task. Neither the quantitative nor the qualitative analysis reported any significant differences between the performances of the groups, suggesting the one-minute planning time incorporated in IELTS Speaking is not likely to positively assist take-takers, yet it should remain part of the test for the sake of fairness and face validity.

Read and Nation (Citation2006), aimed to explore vocabulary use by candidates in the current version of the IELTS Speaking Test, in which Lexical Resource is one of the four criteria applied by examiners to rate candidate performance. The results of the study showed a pattern of decrease from band 8 to band 4, but there was a considerable variance within bands, suggesting that the lexical statistics did not suggest a reliable basis for differentiating oral proficiency levels. Additionally, the findings showed that the sophistication in vocabulary use of high-proficiency candidates was characterized by the fluent use of various formulaic expressions, often composed of high-frequency words, perhaps more so than any noticeable amount of low-frequency words in their speech.

Seedhouse et al. (Citation2014) studied the features of candidate discourse in relation to the scores given to them. The quantitative measures showed that accuracy does increase in direct relation to the given scores. Grammatical range and complexity of language were the lowest for band 5; however, surprisingly band 7 holders scored higher in this regard in comparison with band 8 candidates. The measure of fluency (pause length per 100 words) showed important differences between band scores 5 and 8. In addition, the qualitative analysis did not determine any single speaking feature that made a distinction between the band scores but suggested that in any given IELTS Speaking Test, a group of assessable speaking features can be seen to lead toward a given score.

2. Method

2.1. Design

The current study employed a mixed-methods approach to address the objectives. The quantitative data was gathered by scoring the oral performances of the subjects. The scoring was based on IELTS Speaking Descriptor (Appendix A). The data was analyzed utilizing descriptive and inferential statistics. Also, a qualitative content analysis was conducted for the further and deeper elaboration of the discourse features of performances in part 2 of the Test across bands 5, 6, and 7.

2.2. Instrument

This study will deploy two different instruments. The instrument for the quantitative phase of the study was a collection of IELTS Speaking Tests, a sample of which can be found in Appendix B. These tests were randomly selected from a test bank consisting of IELTS Speaking Tests amassed from a wide range of textbooks written on preparations for the test of IELTS. Each test entailed three parts as in a real test and was administered in its full form to the potential IELTS candidates in a series of mock IELTS tests.

Furthermore, upon scoring the speaking performances, the recorded files were transcribed for further qualitative analysis. Therefore, the second instrument used in the study was qualitative content analysis, with “an emergent framework” (Ary, Jacobs & Irvine, Citation2019) which assumes no variables a priori and is inductive.

2.3. Participants

This research study makes use of criterion sampling. 59 participants were chosen out of a total of 72 IELTS candidates sitting three Mock IELTS tests in Shiraz University Language Center. The reason only 59 were selected was that their marks in IELTS speaking met the criteria set for this research study, namely band-scores 5, 6, and 7.

The subjects were native speakers of Farsi from both genders, in varied age groups, with diverse educational levels, and (based on the information found in their registration forms) of upper-intermediate or higher language proficiency as it is typical of IELTS candidates.

It is noteworthy that these applicants were familiar with the procedures of the IELTS Speaking Test since they had attended IELTS preparation classes, hence being properly motivated. In addition, since the participants were typical candidates of IELTS, the sample was a fair representation of the actual population. One final note is that each candidate’s identity was transformed into code, comprised of M/F for gender, first name initial, last name initial, and a number. Table shows the demographic characteristics of the participants.

An examination of IELTS candidates’ performances at different band scores of the speaking test: A quantitative and qualitative analysis

Abstract

PUBLIC INTEREST STATEMENT

1. Introduction

1.1. Literature review

1.1.1. Assessing speaking

1.2. Assessment in IELTS Speaking

1.3. Studies on IELTS Speaking Test

2. Method

2.1. Design

2.2. Instrument

2.3. Participants

Table 1. Mock IELTS Test candidates’ demographics

2.4. Data collection procedure

2.5. Data analysis procedure

2.6. Inter and intra-rater reliabilities

Table 2. Intra-rater correlations—First rater

Table 3. Intra-rater correlations

3. Results

3.1. The first research question

Table 4. Descriptive statistics of measured variables

Table 5. Descriptive statistics of tests of normality

Table 6. Correlation between the Variables

Table 7. Residuals statistics

Table 8. ANOVA Test

Table 9. Model summary

Table 10. Coefficients in multiple regression analysis

3.2. The second research question

3.3. Fluency and coherence

Table 11. Frequency of key factors in Coherence

3.4. Lexical resource

Table 12. Frequency of key factors in Lexical Resource

3.5. Grammar range and accuracy

Table 13. Frequency of key factors in Grammar Range and Accuracy

3.6. Pronunciation

Table 14. Frequency of key factors in Pronunciation

3.7. The third research question

3.8. Fluency and coherence criterion

Table 15. Monte Carlo test to compare the band scores 5, 6, and 7 in terms of the frequency of Coherence error types

3.9. Lexical resource criterion

Table 16. Monte Carlo test to compare the band scores 5, 6, and 7 in terms of the frequency of error types in Lexical Resource

3.10. Grammar range and accuracy criterion

Table 17. Monte Carlo test to compare the band scores 5, 6, and 7 in terms of the frequency of error types in Grammar

3.11. Pronunciation criterion

Table 18. Chi-Square test to compare the band scores 5, 6, and 7 in terms of the frequency of error types in Pronunciation

4. Discussion

4.1. The quantitative aspect

4.2. The qualitative aspect

5. Conclusion and implications

Additional information

Funding

Notes on contributors

Laleh Dashti

Seyyed Ayatollah Razmjoo

References

Appendix A

Appendix B

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date