381
Views
2
CrossRef citations to date
0
Altmetric
Articles

The Probability Distribution of Textual Vocabulary in the English Language

, &

References

  • Altmann, G. (1980). Prolegomena to Menzerath’s law. In: R. Grotjahn (Ed), Glottometrika, 2 (pp. 1–10). Bochum: Brockmeyer.
  • Baayen, H. (1996). The effects of lexical specialization on the growth curve of the vocabulary. Computational Linguistics, 22(4), 455–480.
  • Baayen, H. (2001). Word Frequency Distributions. Dordrecht: Kluwer Academic Publishers.10.1007/978-94-010-0844-0
  • Brunet, E. (1988). Une mesure de la distance intertextuelle: la connexion lexicale. Le nombre et le texte [A measure of intertextual distance: lexical connection. Number of texts]. Revue informatue et statistique dans les sciences humaines. Universite de Liège.
  • Devore, J. (2000). Probability and Statistics. Pacific Grove: Brooks/Cole.
  • Fan, F. (2006a). Models for dynamic inter-textual type-token relationship. Glottometrics, 12, 1–10.
  • Fan, F. (2006b). A corpus-based empirical study on inter-textual vocabulary growth. Journal of Quantitative Linguistics, 13(1), 111–127.
  • Fan, F. (2010). An asymptotic model for the English hapax/vocabulary ratio. Computational Linguistics, 36(4), 631–637.
  • Fan, F. (2013). Text length, vocabulary size and text coverage constancy. Journal of Quantitative Linguistics, 20(4), 288–300.
  • Heaps, S. (1978). Information Retrieval: Computational and Theoretical Aspects New York, NY: Academic Press.
  • Herdan, G. (1964). Quantitative Linguistics London: Buttersworths.
  • Jurafsky, D., & Martin, J. (2009). Speech and Language Processing, An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Upper Saddle River: Prentice-Hall.
  • Kelih, E. (2012). On the dependency of word length on text length. Empirical results from Russian and Bulgarian parallel texts. In S. Naumann, P. Grzybek, R. Vulanović & G. Altmann (Eds.), Synergetic Linguistics. Text and Language as Dynamic systerms (pp. 67–80). Wien: Praesens Verlag.
  • Kennedy, G. (1998). An Introduction to Corpus Linguistics. London: Addison Wesley.
  • Köhler, R. (2012). Quantitative Syntax Analysis. Berlin/Boston: Walter de Gruyter.10.1515/9783110272925
  • Köhler, R., & Martináková-Rendeková, Z. (1998). A systems theoretical approach to language and music. In G. Altmann & W. A. Koch (Eds.), Systems. New Paradigms for the Human Sciences (pp. 514–546). Berlin: Walter de Gruyter.10.1515/9783110801194
  • Kornai, A. (2002). How many words are there? Glottometrics, 4, 61–86.
  • Labbé, C., & Labbé, D. (2001). Inter-textual distance and authorship attribution Corneille and Molière. Journal of Quantitative Linguistics, 8(3), 213–231.10.1076/jqul.8.3.213.4100
  • Laufer, B., & Ravenhorst-Kalovski, G. (2010). Lexical threshold revisited: Lexical text coverage, learners’ vocabulary size and reading comprehension. Reading in a Foreign Language, 22, 15–30.
  • Liu, N., & Nation, P. (1985). Factors affecting guessing vocabulary in context. RELC Journal, 16, 3–42.
  • Manning, C., & Schütze, H. (2001). Foundations of Statistical Natural Language Processing. Cambridge, MA: The MIT Press.
  • Nation, P., & Waring, R. (1997). Vocabulary size, text coverage and word lists. In: N. Schmitt & M. McCarthy (Eds), Vocabulary: Description, Acquisition and Pedagogy (pp. 6–19). Cambridge: Cambridge Univ. Press.
  • Orlov, K. (1982). Ein Modell der Häufigkeitsstruktur des Vokabulars [A Model of vocabulary frequency structure]. In: H. Guiter & M. Arapov (Eds), Studies on Zipf’s law (pp. 154–233). Bochum: Brockmeyer.
  • Popescu, I. (2009). Word Frequency Studies Berlin: Walter de Gruyter.
  • Schmitt, N., Jiang, X., & Grabe, W. (2011). The percentage of words known in a text and reading comprehension. The Modern Language Journal, 95, 26–43.10.1111/j.1540-4781.2011.01146.x
  • Tuldava, J. (1995). Methods in Quantitative Linguistics. Trier: Wissenschaftlicher Verlag Trier.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.