163
Views
15
CrossRef citations to date
0
Altmetric
Articles

Improving Labbé’s Intertextual Distance: Testing a Revised Version on a Large Corpus of Italian Literature

, &
Pages 125-152 | Published online: 22 Apr 2013

References

  • Berruto , G. 1987 . Sociolinguistica dell’italiano contemporaneo , Roma : La Nuova Italia Scientifica .
  • Berry , M. W. , ed. 2004 . Survey of Text Mining. Clustering, Classification, and Retrieval , New York : Springer-Verlag .
  • Bolasco , S. , Chiari , I. and Giuliano , L. , eds. 2010 . Statistical Analysis of Textual Data. Proceedings of the 10th International Conference JADT – Journées d’Analyse statistique des Données Textuelles , Milano : LED .
  • Brunet, E. (1988). Une mesure de la distance intertextuelle: la connexion lexicale. Le nombre et le texte. Revue informatique et statistique dans les sciences humaines. Liège: Université de Liège.
  • Cortelazzo , M. and Tuzzi , A. 2008 . Metodi statistici applicati all'italiano , Bologna : Zanichelli .
  • Cortelazzo, M., Cortelazzo, M.A., Nadalutti, P., & Tuzzi, A. (2012). Una versione iterativa della distanza intertestuale applicata a un corpus di opere della letteratura italiana contemporanea. In A. Dister, D. Longrée & G. Purnell (Eds), Jadt 2012. Actes des 11es Journées internationales d’Analyse statistique des Données Textuelles (pp. 295–307). Liège, 13–15 June 2012. Liège-Bruxelles: LASLA-SESLA.
  • Coseriu , E. 1973 . Lezioni di linguistica generale , Torino : Boringhieri .
  • Coseriu , E. 1988 . Einführung in die Allgemeine Sprachwissenschaft , Tübingen : Francke .
  • Everitt , B. 1980 . Cluster Analysis , New York : Halsted Press .
  • Feldman , R. and Sanger , J. 2007 . The text mining handbook: advanced approaches in analyzing unstructured data , Cambridge : Cambridge University Press .
  • Labbé , C. and Labbé , D. 2001 . Inter-textual distance and authorship attribution Corneille and Moliére . Journal of Quantitative Linguistics , 8 ( 4 ) : 213 – 213 .
  • Labbé , D. 2007 . Experiments on authorship attribution by intertextual distance in English . Journal of Quantitative Linguistics , 14 ( 1 ) : 33 – 80 .
  • Labbé , D. 2009 . Si deux et deux sont quatre, Molière n’a pas écrit Dom Juan , Paris : Max Milo Éditions .
  • Merriam, T. (2003). An application of authorship attribution by intertextual distance in English. Corpus, 2 Decembre 2003. Retrieved 27th February 2013, from http://corpus.revues.org/index35.html.
  • Muller , C. 1968 . Initiation à la statistique linguistique , Paris : Larousse .
  • Muller , C. 1977 . Principes et méthodes de statistique lexicale , Paris : Hachette .
  • Muller , C. and Brunet , E. 1988 . La statistique résout-elle les problèmes d’attribution? . Strumenti critici , 3 ( 3 ) : 367 – 387 .
  • Pauli , F. and Tuzzi , A. 2009 . The end of year addresses of the Presidents of the Italian Republic (1948–2006): discoursal similarities and differences . Glottometrics , 18 : 40 – 51 .
  • Popescu , I.-I. , Mačutek , J. and Altmann , G. 2009 . Aspects of Word Frequencies , Lüdenscheid : RAM-Verlag .
  • R development core team (2010). R: a language and environment for statistical computing (version 2.13.1) [software]. Vienna, Austria: R foundation for statistical computing. Retrieved 4th August 2011, from http://www.r-project.org
  • Rudman , J. 1998 . The state of authorship attribution studies: Some problems and solutions . Computers and the Humanities , 31 : 351 – 365 .
  • Savoy , J. 2012 . Authorship attribution: a comparative study of three text corpora and three languages . Journal of Quantitative Linguistics , 19 ( 2 ) : 132 – 161 .
  • Schmid, H. (1994), Probabilistic part-of-speech tagging using decision trees. Proceedings of International Conference on New Methods in Language Processing, Manchester, UK. Retrieved 27th February 2013, from ftp://ftp.ims.uni-stuttgart.de/pub/corpora/tree-tagger1.pdf.
  • Stamatatos , E. 2009 . A survey of modern authorship attribution methods . Journal of the American Society for Information Science and Technology , 60 ( 3 ) : 538 – 556 .
  • Strauss , U. , Fan , F. and Altmann , G. 2008 . Problems in Quantitative Linguistics 1 , Lüdenscheid : RAM-Verlag .
  • Tuzzi, A. (2010). What to put in the bag? Comparing and contrasting procedures for text clustering. Italian Journal of Applied Linguistics – Statistica applicata, 22(1), 81–98.
  • Tuzzi, A. (2012). Reinhard Köhler’s scientific production: Words, numbers and pictures. In G. Altmann, P. Grzybek, S. Naumann & R. Vulanovic (Eds), Synergetic Linguistics. Text and Language as Dynamic Systems (pp. 223–242). Wien: Praesens Verlag
  • Tuzzi , A. , Popescu , I. -I. and Altmann , G. 2010 . Quantitative Analysis of Italian Texts , Lüdenscheid : RAM-Verlag .
  • Tweedie , F. J. and Baayen , R. H. 1998 . How variable may a Constant be? Measures of lexical richness in perspective . Computers and the Humanities , 32 ( 5 ) : 323 – 352 .
  • Viprey , J. -M. and Ledoux , C. N. 2006 . About Labbe’s “intertextual distance” . Journal of Quantitative Linguistics , 13 ( 2 ) : 265 – 283 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.