Views

CrossRef citations to date

Altmetric

Article Commentary

The Relationship between Word Difficulty and Frequency: A Response to Hashimoto (2021)

Jeffrey Stewarta Tokyo University of Science, Tokyo, JapanCorrespondence[email protected]

https://orcid.org/0000-0002-3350-3160

Joseph P. Vittab Kyushu University, Fukuoka, Japan

https://orcid.org/0000-0002-5711-969X

Christopher Nicklinc Rikkyo University, Tokyo, Japan

https://orcid.org/0000-0002-8945-0678

Stuart McLeand Momoyamagakuin University, Osaka, Japan

https://orcid.org/0000-0002-7035-378X

Geoffrey G. Pinchbecke Carleton University, Ottawa, Canada

https://orcid.org/0000-0002-3424-2214

Brandon Kramerf Kwansei Gakuin University, Nishinomiya, Japan

https://orcid.org/0000-0003-3910-0810

ABSTRACT

Hashimoto (2021) reported a correlation of −.50 (r² = .25) between word frequency rank and difficulty, concluding the construct of modern vocabulary size tests is questionable. In this response we show that the relationship between frequency and difficulty is clear albeit non-linear and demonstrate that if a wider range of frequencies is tested and log transformations are applied, the correlation can approach .80. Finally, while we acknowledge the great promise of knowledge-based word lists, we note that a strong correlation between difficulty and frequency is not, in fact, the primary reason size tests are organized by frequency.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

¹ As with Hashimoto (Citation2021) and as suggested by Plonsky (Citation2013), r² is used as the effect size of consequence as it describes the amount of variance shared between the variables (i.e., the strength of association).

² Meaning recall responses were also available for the data set used in this paper. The correlation to COCA rank was −0.542 compared to −0.533 for Yes/No responses, and the difference was statistically insignificant (Steiger’s z = 0.245, p > 0.8). While it is possible a larger set of items could establish a significant difference, this result suggests that relative to correlations of two proficiency levels for the same learner, test item format does not make as large a difference with correlations to frequency data.

³ A sensitivity power analysis assuming a 5% alpha and 20% beta threshold revealed that our sample size was powered to detect a minimal effect size of r = .32 (r² = .10) which is reasonable vis-a-vis Hashimoto’s value of most interest (r = .50).

⁴ The data was created by Parr (n. d.) and was retrieved from https://codepen.io/adrianparr/pen/jwmjmv?js-preprocessor=babel.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

The Relationship between Word Difficulty and Frequency: A Response to Hashimoto (2021)

Information for

Open access

Opportunities

Help and information

The Relationship between Word Difficulty and Frequency: A Response to Hashimoto (2021)

ABSTRACT

Disclosure statement

Notes

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature