180
Views
3
CrossRef citations to date
0
Altmetric
Original Articles

Linguistic Phylogenetic Inference by PAM-like Matrices

&
Pages 95-120 | Published online: 13 Mar 2012

REFERENCES

  • Anttila , R. 1972 . An Introduction to Historical and Comparative Linguistics , New York : Macmillan Publishing Co., Inc. .
  • Atkinson , Q. D. and Gray , R. D. 2005 . Curious parallels and curious connections – phylogenetic thinking in biology and historical linguistics . Systematic Biology , 54 ( 4 ) : 513 – 526 .
  • Atkinson , Q. D. and Gray , R. D. 2006a . “ Are accurate dates an intractable problem for historical linguistics? ” . In Mapping our Ancestry: Phylogenetic Methods in Anthropology and Prehistory , Edited by: Lipo , C. P. , O'Brien , M. J. , Mailhammer , R. , Grant , A. and Holman , E. W. 269 – 296 . Chicago , Illinois : Aldine Press .
  • Atkinson , Q. D. and Gray , R. D. 2006b . “ How old is the Indo-European language family? Illumination or more moths to the flame? ” . In Phylogenetic methods and the prehistory of languages , Edited by: Forster , P. and Renfrew , C. 91 – 109 . Cambridge : McDonald Institute Press, University of Cambridge .
  • Atkinson , Q. D. , Nicholls , G. , Welch , D. and Gray , R. D. 2005 . From words to dates: water into wine, mathemagic or phylogenetic inference? . Transactions of the Philological Society , 103 ( 2 ) : 193 – 219 .
  • Bakker , D. , Müller , A. , Velupillai , V. , Wichmann , S. , Brown , C. H. , Brown , P. , Egorov , D. , Mailhammer , R. , Grant , A. and Holman , E. W. 2009 . Adding typology to lexicostatistics: a combined approach to language classification . Linguistic Typology , 13 ( 1 ) : 169 – 181 .
  • Barbançon, F., Warnow, T., & Evans, S. N. (2006). An experimental study comparing linguistic phylogenetic reconstruction. Proceedings of the Conference on Language and Genes, University of California, Santa Barbara, California, USA, pp. 45–55
  • Blanchard , P. , Petroni , F. , Serva , M. and Volchenkov , D. 2011 . Geometric representations of language taxonomies . Computer Speech and Language , 25 ( 3 ) : 679 – 699 .
  • Bodlaender , H. L. , Fellows , M. R. and Warnow , T. J. 1992 . “ Two strikes against perfect phylogeny ” . In Automata, Languages and Programming. Lecture Notes in Computer Science , Edited by: Kuich , W. 273 – 283 . Berlin : Springer Verlag . vol. 623
  • Brown , C. H. , Holman , E. W. , Wichmann , S. and Vilupillai , V. 2008 . Automated classification of the World's languages: A description of the method and preliminary results . STUF – Language Typology and Universals , 61 ( 4 ) : 285 – 308 .
  • Chor , B. and Tuller , T. 2006 . Finding a maximum likelihood tree is hard . Journal of the ACM (JACM) , 53 ( 5 ) : 722 – 744 .
  • Cysouw , M. and Jung , H. Cognate identification and alignment using practical orthographies . Proceedings of the 9th Meeting of the ACL Special Interest Group in Computational Morphology and Phonology . Prague, Czech Republic . pp. 109 – 116 .
  • Darwin , C. R. 1871 . The Descent of Man, and Selection in Relation to Sex , London : John Murray .
  • Dayhoff , M. O. and Eck , R. V. 1968 . A model of evolutionary change in proteins . Atlas of Protein Sequence and Structure 1967–1968 , 3 : 33 – 41 .
  • Dayhoff , M. O. , Eck , R. V. and Park , C. M. 1972 . A model of evolutionary change in proteins . Atlas of Protein Sequence and Structure , 5 : 89 – 99 .
  • Dayhoff , M. O. , Schwartz , R. M. and Orcutt , B. C. 1978 . A model of evolutionary change in proteins . Atlas of Protein Sequence and Structure , 5 ( 3 ) : 345 – 352 .
  • Delmestri , A. and Cristianini , N. 2010a . Robustness and statistical significance of PAM-like matrices for cognate identification . Journal of Communication and Computer , 7 ( 12 ) : 21 – 31 .
  • Delmestri , A. and Cristianini , N. 2010b . String similarity measures and PAM-like matrices for cognate identification . Bucharest Working Papers in Linguistics , XII ( 2 ) : 71 – 82 .
  • Diamond , J. M. and Bellwood , P. 2003 . Farmers and their languages: The first expansions . Science , 300 ( 5619 ) : 597 – 603 .
  • Downey , S. S. , Hallmark , B. , Cox , M. P. , Norquest , P. and Lansing , S. J. 2008 . Computational feature-sensitive reconstruction of language relationships: Developing the ALINE distance for comparative historical linguistic reconstruction . Journal of Quantitative Linguistics , 15 ( 4 ) : 340 – 369 .
  • Dyen , I. , Kruskal , J. B. and Black , P. 1992 . An Indoeuropean classification: A lexicostatistical experiment . Transactions of the American Philosophical Society , 82 (5) : 1 – 132 .
  • Ellison , M. T. and Kirby , S. Measuring language divergence by intra-lexical comparison . Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics . 2006 , Sydney , Australia. pp. 273 – 280 .
  • Embleton , S. M. 1986 . Statistics in Historical Linguistics, Series: Quantitative linguistics , Bochum : Studienverlag Brockmeyer . vol. 30
  • Evans , S. N. , Ringe , D. A. and Warnow , T. 2004 . “ Inference of divergence times as a statistical inverse problem ” . In Phylogenetic Methods and the Prehistory of Languages , Edited by: Forster , P. and Renfrew , C. 119 – 140 . Cambridge : McDonald Institute Press .
  • Felsenstein , J. 2004 . Inferring Phylogenies , Sunderland , MA : Sinauer Associates Inc. Publishers .
  • Feng , D. F. and Doolittle , R. F. 1997 . Converting amino acid alignment scores into measures of evolutionary time: a simulation study of various relationships . Journal of Molecular Evolution , 44 ( 4 ) : 361 – 370 .
  • Forster , P. and Toth , A. 2003 . Toward a phylogenetic chronology of ancient Gaulish, Celtic, and Indo-European . Proceedings of the National Academy of Sciences of the USA (PNAS) , 100 ( 15 ) : 9079 – 9084 .
  • Foulds , L. R. and Graham , R. L. 1982 . The Steiner problem in phylogeny is NP-complete . Advances in Applied Mathematics , 3 ( 1 ) : 43 – 49 .
  • Gamkrelidze , T. V. and Ivanov , V. V. 1995 . Indo-European and the Indo-Europeans: A Reconstruction and Historical Analysis of a Proto-Language and a Proto-Culture. Trends in Linguistics: Studies and Monographs , Edited by: Winter , W. Berlin : Mouton de Gruyter . vol. 80
  • Gotoh , O. 1982 . An improved algorithm for matching biological sequences . Journal of Molecular Biology , 162 ( 3 ) : 705 – 708 .
  • Gray , R. D. and Atkinson , Q. D. 2003 . Language-tree divergence times support the Anatolian theory of Indo-European origin . Nature , 426 : 435 – 439 .
  • Gray , R. D. and Jordan , F. M. 2000 . Language trees support the express-train sequence of Austronesian expansion . Nature , 405 : 1052 – 1055 .
  • Greenberg , J. H. 1957 . Essays in Linguistics , Chicago , Ill : University of Chicago Press .
  • Greenhill , S. J. , Atkinson , Q. D. , Meade , A. and Gray , R. D. 2010 . The shape and tempo of language evolution . Proceedings of the Royal Society, Series B: Biological Science , 277 : 2443 – 2450 .
  • Hastings , W. K. 1970 . Monte Carlo sampling methods using Markov Chains and their applications . Biometrika , 57 : 97 – 109 .
  • Holman , E. W. , Wichmann , S. , Brown , C. H. , Velupillai , V. , Müller , A. and Bakker , D. 2008 . Explorations in automated language classification . Folia Linguistica , 42 ( 2 ) : 331 – 354 .
  • Huelsenbeck , J. P. , Larget , B. , Miller , R. E. and Ronquist , F. 2002 . Potential applications and pitfalls of Bayesian inference of phylogeny . Systematic Biology , 51 ( 5 ) : 673 – 688 .
  • Kannan , S. and Warnow , T. A fast algorithm for the computation and enumeration of perfect phylogenies when the number of character states is fixed . Proceedings of the 6th Annual ACM-SIAM Symposium on Discrete Algorithms . San Francisco, California , USA. pp. 595 – 603 .
  • Kessler , B. 2001 . The Significance of Word Lists , Stanford , CA : CSLI Publications .
  • Kondrak , G. A new algorithm for the alignment of phonetic sequences . Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (ANLP-NAACL 2000), vol. 4 . Seattle , Washington . pp. 288 – 295 .
  • Kondrak , G. 2009 . Identification of cognates and recurrent sound correspondences in word lists . Traitement automatique des langues , 50 ( 2 ) : 201 – 235 .
  • Levenshtein , V. I. 1966 . Binary codes capable of correcting deletions, insertions and reversals . Soviet Physics Doklady , 10 ( 8 ) : 707 – 710 .
  • Lewis, M. P. (Ed.) (2009). Ethnologue: Languages of the World. 16th edn. Dallas: SIL International
  • Matlab. Analyzing the Origin of the Human Immunodeficiency Virus. Retrieved August 4, 2010, from http://www.mathworks.com/computational-biology/demos.html?file=/products/demos/shipping/bioinfo/hivdemo.html .
  • Nakhleh , L. , Warnow , T. , Ringe , D. A. and Evans , S. N. 2005 . A comparison of phylogenetic reconstruction methods on an Indo-European dataset . Transactions of the Philological Society , 103 ( 2 ) : 171 – 192 .
  • Needleman , S. B. and Wunsch , C. D. 1970 . A general method applicable to the search for similarities in the amino acid sequence of two proteins . Journal of Molecular Biology , 48 ( 3 ) : 443 – 453 .
  • Nicholls , G. K. and Gray , R. D. 2008 . Dated ancestral trees from binary trait data and their application to the diversification of languages . Journal of the Royal Statistical Society, Series B: Statistical Methodology , 70 ( 3 ) : 545 – 566 .
  • Nicholls , G. K. and Gray , R. D. 2006 . “ Quantifying uncertainty in a stochastic model of vocabulary evolution ” . In Phylogenetic Methods and the Prehistory of Languages , Edited by: Forster , P. and Renfrew , C. 161 – 171 . Cambridge : McDonald Institute Press .
  • Nichols , J. and Warnow , T. 2008 . Tutorial on computational linguistic phylogeny . Language and Linguistics Compass , 2 ( 5 ) : 760 – 820 .
  • Petroni , F. and Serva , M. 2008 . Language distance and tree reconstruction . Journal of Statistical Mechanics: Theory and Experiment , P08012 : 1 – 15 .
  • Rexová , K. , Frynta , D. and Zrzavý , J. 2003 . Cladistic analysis of languages: Indo-European classification based on lexicostatistical data . Cladistics , 19 : 120 – 127 .
  • Ringe , D. , Warnow , T. and Taylor , A. 2002 . Indo-European and computational cladistics . Transactions of the Philological Society , 100 ( 1 ) : 59 – 129 .
  • Ruhlen , M. 1994 . The Origin of Language , New York : John Wiley & Sons Inc .
  • Ryder , R. J. and Nicholls , G. K. 2011 . Missing data in a stochastic Dollo model for cognate data, and its application to the dating of Proto-Indo-European . Journal of the Royal Statistical Society, Series C: Applied Statistics , 60 ( 1 ) : 71 – 92 .
  • Saitou , N. and Nei , M. 1987 . The neighbor-joining method: a new method for reconstructing phylogenetic trees . Molecular Biology and Evolution , 4 ( 4 ) : 406 – 425 .
  • Serva , M. and Petroni , F. 2008 . Indo-European languages tree by Levenshtein distance . EPL (Europhysics Letters) , 81 : 68005 – p1:p5 .
  • Smith , T. F. and Waterman , M. S. 1981 . Identification of common molecular subsequences . Journal of Molecular Biology , 147 ( 1 ) : 195 – 197 .
  • Sokal , R. R. and Michener , C. D. 1958 . A statistical method for evaluating systematic relationships . University of Kansas Science Bulletin , 38 : 1409 – 1438 .
  • Steels , L. 2004 . “ Analogies between genome and language evolution ” . In Artificial Life IX: Proceedings of the 9th International Conference on the Simulation and Synthesis of Living Systems , Edited by: Pollack , J. B. 2002 – 2007 . Cambridge , MA : The MIT Press .
  • Studier , J. A. and Keppler , K. J. 1988 . A note on the neighbor-joining algorithm of Saitou and Neil . Journal of Molecular Biology and Evolution , 5 ( 6 ) : 729 – 731 .
  • Swadesh , M. 1952 . Lexico-statistic dating of prehistoric ethnic contacts . Proceedings of the American Philosophical Society , 96 ( 4 ) : 452 – 463 .
  • Swadesh , M. 1955 . Towards greater accuracy in lexicostatistics dating . International Journal of American Linguistics , 21 ( 2 ) : 121 – 137 .
  • Turchi , M. and Cristianini , N. A statistical analysis of language evolution . The Evolution of Language: Proceedings of the 6th Internationl Conference (EVOLANG6) . 2006 , Rome , Italy. pp. 348 – 355 .
  • Wang , W.S.-Y. and Minett , J. W. 2005 . Vertical and horizontal transmission in language evolution . Transactions of the Philological Society , 103 ( 2 ) : 121 – 146 .
  • Wichmann , S. and Saunders , A. 2007 . How to use typological databases in historical linguistic research . Diachronica , 24 ( 2 ) : 373 – 404 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.