464
Views
33
CrossRef citations to date
0
Altmetric
Research Article

Automatic method of pause measurement for normal and dysarthric speech

, , , , , & show all
Pages 141-154 | Received 07 May 2009, Accepted 25 Sep 2009, Published online: 25 Jan 2010
 

Abstract

This study proposes an automatic method for the detection of pauses and identification of pause types in conversational speech for the purpose of measuring the effects of Friedreich's Ataxia (FRDA) on speech. Speech samples of ∼ 3 minutes were recorded from 13 speakers with FRDA and 18 healthy controls. Pauses were measured from the intensity contour and fit with bimodal lognormal distributions using the Expectation-Maximization algorithm in Matlab©. In the speakers with FRDA, both modes in the pause distributions had significantly larger means, with disproportionately fewer pauses associated with the first mode. From this preliminary study, it is concluded that distributional analysis of pause duration holds promise as a useful method of measuring the effects of FRDA on functional speech.

Acknowledgements

The authors thank the participants involved in this research. This project was funded by a research grant from the Friedreich's Ataxia Research Alliance (FARA), USA, and the Friedreich's Ataxia Research Association, Australasia.

Declaration of interest: The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.

Notes

1. The anti-alias filter and relatively high sampling rate theoretically is not necessary for pause analysis, but was applied to the files for future spectral studies.

2. Although the proposed measurement and analysis of pause duration is automatic, it should be noted that the preparation of the samples does require man hours. For this study, digitizing and saving the recordings required about 7 minutes per speaker. The listening and editing to the 3 minute samples required an additional average of 6 minutes per speaker, but is highly dependent on the experience of the editor.

3. Another example of a single factor responsible for bimodality in a complex phenomenon is gender in a distribution of human height (Joiner, Citation1975; but see Schilling, Watkins, and Watkins, Citation2002).

4. The standard deviation of log duration is akin to coefficient of variation, and can be a useful index of variation for phenomena in which increases in standard deviation coincide with increases in mean, such as that occur in speech segment duration (see Rosen, Citation2005).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 65.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 484.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.