546
Views
17
CrossRef citations to date
0
Altmetric
Original Articles

Fitting Ranked English and Spanish Letter Frequency Distribution in US and Mexican Presidential Speeches

&
Pages 359-380 | Published online: 17 Nov 2011
 

Abstract

The limited range in the abscissa of ranked letter frequency distributions causes multiple functions to fit the observed distribution reasonably well. In order to critically compare various functions, we apply the statistical model selections on ten functions, using the texts of US and Mexican presidential speeches of the last few centuries. Despite minor switching of ranking order of certain letters during the temporal evolution for both datasets, the letter usage is generally stable. The best fitting function, judged by either least-square-error or by AIC/BIC model selection, is the Cocho/Beta function. We also use a novel method to discover clusters of letters by their observed-over-expected frequency ratios.

Acknowledgements

We would like to thank Osman Tuna Gökgöz for introducing us to the work by Al-Kindi. This work was partially supported by UNAM-PAPIIT project IN115908. The authors wish to thank the hospitality of the Centro de Investigación en Matemáticas Aplicadas, Pachuca, México, where the draft was finalized.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 394.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.