308
Views
0
CrossRef citations to date
0
Altmetric
Articles

N-Gram Approaches to the Historical Dynamics of Basic Vocabulary*

&

References

  • Abney, S., & Bird, S. (2010). The human language project: Building a universal corpus of the world’s languages. Proceedings of the 48th meeting of the ACL (pp. 88–97). ACL: Uppsala.
  • Borin, L. (2009). Linguistic diversity in the information society. In Proceedings of the SALTMIL 2009 Workshop on Information Retrieval and Information Extraction for Less Resourced Languages (pp. 1–7). Donostia: SALTMIL. Retrieved date, from http://spraakbanken.gu.se/personal/lars/pblctns/saltmil-2009.pdf
  • Borin, L. (2012). Core vocabulary: A useful but mystical concept in some kinds of linguistics. In: D. Santos, K. Lindén & W. Ng’ang’a (Eds), Shall We Play the Festschrift Game? Essays on the Occasion of Lauri Carlson’s 60th Birthday (pp. 53–65). Berlin: Springer.
  • Brown, C., Holman, E., Wichmann, S., & Velupillai, V. (2008). Automated classification of the world’s languages: A description of the method and preliminary results. Sprachtypologie und Universalienforschung, 61(4), 285–308.
  • Campbell, L., & Mixco, M. (2007). A Glossary of Historical Linguistics University of Utah Press.
  • Campbell, L., & Poser, W. J. (2008). Language Classification: History and method Cambridge: Cambridge University Press.
  • Cavnar, W. B., & Trenkle, J. M. (1994). N-gram-based text categorization. Proceedings of SDAIR-94, 3rd Annual Symposium on Document Analysis and Information Retrieval, 161–175.
  • Dolgopolsky, A. (1986). A probabilistic hypothesis concerning the oldest relationships among the language families of northern eurasia. In: V. Shevoroshkin & T. Markey (Eds), Typology, Relationship, and Time: A Collection of Papers on Language Change and Relationship by Soviet Linguists (pp. 27–50). Ann Arbor, MI: Karoma.
  • Dryer, M. S. (2011). Genealogical language list. In: M. S. Dryer & M. Haspelmath (Eds), The World Atlas of Language Structures Online Max Planck Digital Library: Munich. Retrieved date, from http://wals.info/supplement/4
  • Dunning, T. (1994). Statistical Identification of Language (Technical Report No. CRL MCCS-94-273). Computing Research Lab, New Mexico State University.
  • Hammarström, H. (2010). A full-scale test of the language farming dispersal hypothesis. Diachronica, 27(2), 197–213.
  • Haspelmath, M., Dryer, M. S., Gil, D., & Comrie, B. (2011). WALS Online Munich: Max Planck Digital Library. Retrieved date, from http://wals.info
  • Holman, E. W., Wichmann, S., Brown, C. H., Velupillai, V., Müller, A., & Bakker, D. (2008a). Advances in automated language classification. In: A. Arppe, K. Sinnemäki & U. Nikanne (Eds), Quantitative investigations in theoretical linguistics (pp. 40–43). Helsinki: University of Helsinki.
  • Holman, E. W., Wichmann, S., Brown, C. H., Velupillai, V., Müller, A., & Bakker, D. (2008b). Explorations in automated language classification. Folia Linguistica, 42(3–4), 331–354.
  • Huffman, S., & Mentor-Loritz, D. (1998). The Genetic Classification of Languages by n-gram Analysis: A computational technique Georgetown University.
  • Kessler, B. (2008). The mathematical assessment of long-range linguistic relationships. Language and Linguistics Compass, 2(5), 821–839.
  • Krauss, M. E. (1992). The world’s languages in crisis. Language, 68(1), 4–10.
  • Levenshtein, V. (1965). Dvoičnye kody’s ispravleniem vypadenij, vstavok i zameščenij simvolov [Binary codes with correction of deletions, insertions and reversals of symbols]. Doklady Akademii Nauk SSSR, 163(4), 845–848.
  • Lewis, M. P. (Ed.) (2009). Ethnologue: Languages of the World, 16th edition Dallas, TX: SIL International.
  • Lewis, M. P., Simons, G. F., & Fennig, C. D. (Eds). (2013). Ethnologue: Languages of the world, 17th edition Dallas, TX: SIL International. (online version: http://www.ethnologue.com/)
  • Maddieson, I., & Precoda, K. (1990). n.d. The UCLA Phonological Segment Inventory Database UK: Sage. Retrieved date, from http://web.phonetik.uni-frankfurt.de/upsid.html
  • Oswalt, R. (1971). Towards the construction of a standard lexicostatistic list. Anthropological Linguistics, 13(9), 421–434.
  • Pompei, S., Loreto, V., & Tria, F. (2011). On the accuracy of language trees. PloS one, 6(6), e20109.
  • Rama, T., & Singh, A.K. (2009). From bag of languages to family trees from noisy corpus. In Proceedings of the Conference on Recent Advances in Natural Language Processing. Borovets, Bulgaria.
  • Saitou, N., & Nei, M. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution, 4(4), 406–425.
  • Singh, A., & Surana, H. (2007). Can corpus based measures be used for comparative study of languages? Proceedings of 9th Meeting of the ACL Special Interest Group in Computational Morphology and Phonology, 40–47.
  • Starostin, S. A. (1991). Altajskaja problema i proisxoždenie japonskogo jazyka [The Altaic Problem and the Origin of the Japanese Language]. Moscow: Nauka Publishers.
  • Swadesh, M. (1948). The Time Value of Linguistic Diversity (paper presented at the Viking Fund Supper Conference for Anthropologists, 12th March 1948).
  • Swadesh, M. (1950). Salish internal relationships. International Journal of American Linguistics, 16(4), 157–167.
  • Swadesh, M. (1952). Lexico-statistic dating of prehistoric ethnic contacts: with special reference to North American Indians and Eskimos. Proceedings of the American Philosophical Society, 96(4), 452–463.
  • Swadesh, M. (1955). Towards greater accuracy in lexicostatistic dating. International Journal of American Linguistics, 21(2), 121–137.
  • Wichmann, S., Holman, E. W., Bakker, D., & Brown, C. H. (2010). Evaluating linguistic distance measures. Physica A: Statistical Mechanics and its Applications, 389, 3632–3639.
  • Wichmann, S., Holman, E. W., Rama, T., & Walker, R. S. (2011a). Correlates of reticulation in linguistic phylogenies. Language Dynamics and Change, 1(2), 205–240.
  • Wichmann, S., Müller, A., Velupillai, V., Wett, A., Brown, C.H., Molochieva, Z., et al. (2011b). The ASJP database version 14. (http://email.eva.mpg.de/~wichmann/listss14.zip)
  • Wichmann, S., Rama, T., & Holman, E. W. (2011c). Phonological diversity, word length, and population sizes across languages: The ASJP evidence. Linguistic Typology, 15, 177–198.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.