155
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Towards an inclusive system for the annotation of (dis)fluency in typical and atypical speech

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 381-398 | Received 26 Feb 2022, Accepted 14 Sep 2022, Published online: 07 Oct 2022
 

ABSTRACT

This paper presents an operational annotation system for (dis)fluencies in typical and atypical speech, based on existing standard annotation schemes previously established in the literature. Grounded in a functional approach to (dis)fluency, we address some of the conceptual and technical limitations found in previous annotation models, and offer an integrated and inclusive system which is compatible with different multi-layered annotation software such as Praat or ELAN. Our aim is twofold: to create comparable annotated corpora both in typical and atypical speech, and to provide natural language processing and the health sector with applications for diagnostic and therapy in speech disorders.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 The authors of this paper are ordered alphabetically. All authors contributed equally to all stages of the research project.

2 Throughout this paper, we use the terms ‘typical’ and ‘atypical’ to refer to disfluencies that are common in all speakers vs. those that occur primarily in speakers presenting some speech or speech disorder, respectively. By extension, these terms are also used to characterise speech and speakers themselves, although this is by no means meant in a discriminatory way nor does it denies the right to be Neurodiverse.

3 Note that the terms ‘prolongation’ and ‘lenghtening’ are often interchangeable in the literature. For the sake of consistency, we are only using the latter in this paper.

4 ANR-18-CE36-0008 Le Bégaiement : la Neurologie, la Phonétique, l’Informatique pour son Diagnostic et sa Rééducation (BENEPHIDIRE, PI : Fabrice Hirsch).

5 A first version of this model was presented during the 2021 DiSS workshop (Didirková et al., 2021).

6 Krippendorff (Citation2004) points out several limitations of Fleiss’ Kappa, such as its inability to account for individual preferences of annotators for particular categories. Despite its drawbacks, Kappa is widely used in corpus linguistics and is therefore useful to compare with previous proposals.

7 If κ < 0 : less than chance agreement; κ = 0.01–0.20: slight agreement; κ = 0.21–0.40: fair agreement; κ = 0.41–0.60: moderate agreement; κ = 0.61–0.80: substantial agreement; κ = 0.81–0.99: almost perfect agreement; κ = 1: perfect agreement..

Additional information

Funding

The work was supported by the Agence Nationale de la Recherche [ANR-18-CE36-0008]

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.