658
Views
19
CrossRef citations to date
0
Altmetric
Articles

Word-length Entropies and Correlations of Natural Language Written Texts

, , , , &

References

  • Ausloos, M. (2008). Equilibrium and dynamic methods when comparing an English text and its Esperanto translation. Physica A: Statistical Mechanics and its Applications, 387(25), 6411–6420.10.1016/j.physa.2008.07.016
  • Ausloos, M. (2012). Generalized Hurst exponent and multifractal function of original and translated texts mapped into frequency and length time series. Physical Review E, 86, 031108–031119.10.1103/PhysRevE.86.031108
  • Borwein, J., & Bailey, D. H. (2008). Mathematics by experiment, 2nd edition. Wellesley, MA: A K Peters Ltd.
  • Borwein, J., & Karamanos, K. (2005). Algebraic dynamics of certain gamma function values. In A. Eberhard, N. Hadjisavvas, & D. Luc (Eds), Generalized Convexity, Generalized Monotonicity and Applications (pp. 3–21). Berlin, Heidelberg: Springer.10.1007/b102138
  • Dunn, M., Greenhill, S., Levinson, S., & Gray, R. (2011). Evolved structure of language shows lineage-specific trends in word-order universals. Nature, 473(7345), 79–82.10.1038/nature09923
  • Ebeling, W., & Nicolis, G. (1991). Entropy of symbolic sequences: The role of correlations. EPL (Europhysics Letters), 14(3), 191–196.10.1209/0295-5075/14/3/001
  • Eger, S. (2013). A contribution to the theory of word length distribution based on a stochastic word length distribution model. Journal of Quantitative Linguistics, 20(3), 252–265.10.1080/09296174.2013.799910
  • Finkenstaed, T., & Wolff, D. (1973). Ordered profusion; studies in dictionaries and the English lexicon. No. v. 13–15 in Annales Universitatis Saraviensis: Reihe Philosophische Fakultä. Heidelberg: C. Winter.
  • Grotjahn, R., & Altmann, G. (1993). Modelling the distribution of word length: Some methodological problems. In R. Köhler & B. Rieger (Eds), Contributions to Quantitative Linguistics (pp. 141–153). Dordrecht: Springer.10.1007/978-94-011-1769-2
  • Grzybek, P. (2006). History and methodology of word length studies. The state of the art. Contributions to the Science of Text and Language. Word Length Studies and Related Issues. Text, Speech and Language Technology, 31, 15–90.
  • Grzybek, P., & Stadlober, E. (2002). Project report: The graz project on word length (frequencies). Journal of Quantitative Linguistics, 9(2), 187–192.10.1076/jqul.9.2.187.8486
  • Kalimeri, M., Constantoudis, V., Papadimitriou, C., Karamanos, K., Diakonos, F., & Papageorgiou, H. (2012). Entropy analysis of word-length series of natural language texts: Effects of text language and genre. International Journal of Bifurcation and Chaos, 22(9), 1250223 (8 pp).
  • Karamanos, K., & Nicolis, G. (1999). Symbolic dynamics and entropy analysis of Feigenbaum limit sets. Chaos, Solitons & Fractals, 10(7), 1135–1150.
  • Koehn, P. (2005). Europarl: A Parallel Corpus for Statistical Machine Translation. Conference Proceedings: The tenth Machine Translation Summit, Phuket, Thailand, pp. 79–86.
  • Ktori, M., Van Heuven, W., & Pitchford, N. (2008). Greeklex: A lexical database of modern greek. Behavior Research Methods, 40(3), 773–783.10.3758/BRM.40.3.773
  • Lambiotte, R., Ausloos, M., & Thelwall, M. (2007). Word statistics in Blogs and RSS feeds: Towards empirical universal evidence. Journal of Informetrics, 1(4), 277–286.10.1016/j.joi.2007.07.001
  • Mikros, G. K., Hatzigeorgiu, N., & Carayannis, G. (2005). Basic quantitative characteristics of the modern Greek language using the Hellenic national corpus. Journal of Quantitative Linguistics, 12(2–3), 167–184.10.1080/09296170500172478
  • Montemurro, M. A. (2014). Quantifying the information in the long-range order of words: Semantic structures and universal linguistic constraints. Cortex, 55, 5–16.
  • Montemurro, M. A., & Pury, P. A. (2002). Long-range fractal correlations in literary corpora. Fractals, 10, 451–461.10.1142/S0218348X02001257
  • Montemurro, M. A., & Zanette, D. H. (2011). Universal Entropy of word ordering across linguistic families. PLoS ONE, 6(5), e19875.10.1371/journal.pone.0019875
  • Nicolis, J. S. (2005). Super-selection rules modulating complexity: An overview. Chaos, Solutions & Fractals, 24(5), 1159–1163.
  • Nicolis, J. S. (2007). The role of chaos in cognition and music – super selection rules moderating complexity – A research program. Chaos, Solutions & Fractals, 33(4), 1093–1094.
  • Nicolis, G., & Gaspard, P. (1994). Toward a probabilistic approach to complex systems. Chaos, Solutions & Fractals, 4(1), 41–57.
  • Pande, H., & Dhami, H. S. (2012). Model generation for word length frequencies in texts with the application of Zipf’s order approach. Journal of Quantitative Linguistics, 19(4), 249–261.10.1080/09296174.2012.714531
  • Papadimitriou, C., Karamanos, K., Diakonos, F., Constantoudis, V., & Papageorgiou, H. (2010). Entropy analysis of natural language written texts. Physica A: Statistical Mechanics and its Applications, 389(16), 3260–3266.10.1016/j.physa.2010.03.038
  • R Development Core Team. (2008). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
  • Riedemann, H. (1996). Word-length distribution in English press texts. Journal of Quantitative Linguistics, 3(3), 265–271.10.1080/09296179608599634
  • Rottmann, O. A. (1999). Word and syllable lengths in east Slavonic. Journal of Quantitative Linguistics, 6(3), 235–238.
  • Şahin, G., Erentürk, M., & Hacinliyan, A. (2009). Detrended fluctuation analysis in natural languages using non-corpus parametrization. Chaos, Solutions and Fractals, 41(1), 198–205.
  • Zanette, D., & Montemurro, M. (2005). Dynamics of text generation with realistic Zipf’s distribution. Journal of Quantitative Linguistics, 12(1), 29–40.10.1080/09296170500055293
  • Ziegler, A. (1998). Word length in Portuguese texts. Journal of Quantitative Linguistics, 5(1–2), 115–120.10.1080/09296179808590117

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.