453
Views
25
CrossRef citations to date
0
Altmetric
Original Articles

Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples

ORCID Icon, , , ORCID Icon, , , , & ORCID Icon show all
Pages 669-679 | Received 30 Apr 2017, Accepted 28 Jul 2018, Published online: 08 Nov 2018
 

Abstract

Purpose: This research aimed to automatically predict intelligible speaking rate for individuals with Amyotrophic Lateral Sclerosis (ALS) based on speech acoustic and articulatory samples.

Method: Twelve participants with ALS and two normal subjects produced a total of 1831 phrases. NDI Wave system was used to collect tongue and lip movement and acoustic data synchronously. A machine learning algorithm (i.e. support vector machine) was used to predict intelligible speaking rate (speech intelligibility × speaking rate) from acoustic and articulatory features of the recorded samples.

Result: Acoustic, lip movement, and tongue movement information separately, yielded a R2 of 0.652, 0.660, and 0.678 and a Root Mean Squared Error (RMSE) of 41.096, 41.166, and 39.855 words per minute (WPM) between the predicted and actual values, respectively. Combining acoustic, lip and tongue information we obtained the highest R2 (0.712) and the lowest RMSE (37.562 WPM).

Conclusion: The results revealed that our proposed analyses predicted the intelligible speaking rate of the participant with reasonably high accuracy by extracting the acoustic and/or articulatory features from one short speech sample. With further development, the analyses may be well-suited for clinical applications that require automatic speech severity prediction.

Acknowledgement

We would like to thank Dr. Anusha Thomas, Jennifer McGlothlin, Brian Richburg, Kristin Teplansky, Jana Mueller, Saara Raja, Heather Xiao, and the volunteering participants.

Declaration of interest

The authors report no conflicts of interest. The authors alone are responsible for the content and writing of this article.

Additional information

Funding

This work was supported by the National Institutes of Health [R01DC013547, R03DC013990, and K24DC016312] and by the American Speech-Language-Hearing Foundation through a New Century Scholar Research Grant.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 65.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 294.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.