229
Views
0
CrossRef citations to date
0
Altmetric
Review Article

Too Noisy at the Bottom: Why Gries’ (2008, 2020) Dispersion Measures Cannot Identify Unbiased Distributions of Words

ORCID Icon

References

  • Aggarwal, C. C., Hinneburg, A., & Keim, D. A. (2001). On the surprising behavior of distance metrics in high dimensional space. In International conference on database theory (pp. 420–434). Springer, Berlin, Heidelberg.
  • Bellman, R., & Dreyfus, S. (1962). Applied dynamic programming. Princeton University Press.
  • Biber, D., Reppen, R., Schnur, E., & Ghanem, R. (2016). On the (non) utility of Juilland’s D to measure lexical dispersion in large corpora. International Journal of Corpus Linguistics, 21(4), 439–464. https://doi.org/10.1075/ijcl.21.4.01bib
  • Casenhiser, D., & Goldberg, A. E. (2005). Fast mapping between a phrasal form and meaning. Developmental science, 8(6), 500–508. https://doi.org/10.1111/j.1467-7687.2005.00441.x
  • Gries, S. T. (2008). Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4), 403–437. https://doi.org/10.1075/ijcl.13.4.02gri
  • Gries, S. T. (2010). Dispersions and adjusted frequencies in corpora: Further explorations. In S. T. Gries, S. Wulff, & M. Davies (Eds.), Corpus-linguistic applications (pp. 197–212). Brill.
  • Gries, S. T. (2015). Some current quantitative problems in corpus linguistics and a sketch of some solutions. Language and Linguistics, 16(1), 93–117. https://doi.org/10.1177/1606822X14556606
  • Gries, S. T. (2022). Toward more careful corpus statistics: Uncertainty estimates for frequencies, dispersions, association measures, and more. Research Methods in Applied Linguistics, 1(1), 100002. https://doi.org/10.1016/j.rmal.2021.100002
  • Gries, S. T., & Paquot, M. (2021). Analyzing dispersion. In A practical handbook of corpus linguistics (pp. 99–118). Springer International Publishing.
  • Hoaglin, D. C. (1980). A poissoness plot. The American Statistician, 34(3), 146–149. https://doi.org/10.1080/00031305.1980.10483020
  • Ishikawa, S. (2013). The ICNALE and sophisticated contrastive interlanguage analysis of Asian learners of English. Learner Corpus Studies in Asia and the World, 1(1), 91–118.
  • Korn, F., Pagel, B. U., & Faloutsos, C. (2001). On the“dimensionality curse” and the“self-similarity blessing”. IEEE Transactions on Knowledge and Data Engineering, 13(1), 96–111. https://doi.org/10.1109/69.908983
  • Kullback, S., & Leibler, R. (1951). On information and sufficiency. Annals of Mathematical Statistics, 22(1), 79–86. https://doi.org/10.1214/aoms/1177729694
  • Lee, J. A., & Verleysen, M. (2007). Nonlinear dimensionality reduction (Vol. 1). Springer.
  • Lijffijt, J., & Gries, S. T. (2008). Correction to Stefan Th. Gries’“Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4), 403–437.
  • Mosteller, F., & Wallace, D. L. (1963). Inference in an authorship problem. Journal of the American Statistical Association, 58(302), 275–309. https://doi.org/10.1080/01621459.1963.10500849
  • Zhang, H., Han, Y., Zhang, X., & Cui, L. (2022). Frequency, dispersion and abstractness in the lexical sophistication analysis of a learner-based word bank: Dimensionality reduction and identification. Journal of Quantitative Linguistics, 29(2), 195–211. https://doi.org/10.1080/09296174.2020.1782716

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.