Abstract
Frequency of occurrence of words in a language is well described by Zipf's law. However, Zipf's law does not describe the distribution of the letters or phonemes from which words are composed. Examination of 32 text corpora from 18 languages shows that both letter and phoneme frequencies are well described by an equation first developed by Yule and by a parameter‐free equation that also describes the distribution of DNA codons.
Notes
Address correspondence to: Colin Martindale, Department of Psychology, University of Maine, Orono, ME 04469, U.S.A. Tel.: 207–581–2054. Fax: 207–581–6128. E‐mail: [email protected]