304
Views
1
CrossRef citations to date
0
Altmetric
Research Articles

Voice banking to support individuals who use speech-generating devices: development and evaluation of Singaporean-accented English synthetic voices and a Singapore Colloquial English recording inventory

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, , ORCID Icon & show all
Pages 208-218 | Received 08 Apr 2022, Accepted 02 Feb 2023, Published online: 27 Mar 2023
 

Abstract

Voice banking involves recording an inventory of sentences produced via natural speech. The recordings are used to create a synthetic text-to-speech voice that can be installed on speech-generating devices. This study highlights a minimally researched, clinically relevant issue surrounding the development and evaluation of Singaporean-accented English synthetic voices that were created using readily available voice banking software and hardware. Processes used to create seven unique synthetic voices that produce Singaporean-accented English, and the development of a custom Singaporean Colloquial English (SCE) recording inventory, are reviewed. The perspectives of adults who spoke SCE and banked their voices for this project are summarized and were generally positive. Finally, 100 adults familiar with SCE participated in an experiment that evaluated the intelligibility and naturalness of the Singaporean-accented synthetic voices, as well as the effect of the SCE custom inventory on listener preferences. The addition of the custom SCE inventory did not affect intelligibility or naturalness of the synthetic speech, and listeners tended to prefer the voice created with the SCE inventory when the stimulus was an SCE passage. The procedures used in this project may be helpful for interventionists who wish to create synthetic voices with accents that are not commercially available.

Acknowledgments

The authors wish to express sincere gratitude to the participants of this study.

Disclosure statement

H. Timothy Bunnell directs the Nemours Speech Research Laboratory that developed the ModelTalker software and voice banking service. Jason Lilley is an Assistant Research Scientist at Nemours Speech Research Laboratory. Nemours is a nonprofit health care system, and the Speech Research Laboratory provides voice banking to clients on a fixed fee-for-service basis. We have no known conflict of interest to disclose.

Notes

1 Sennheiser USB headset microphone © is a product of Sennheiser electronic GmbH & Co. KG, Germany.

2 HP laptop EliteBook 830 G5© is a product of the Hewlett-Packard company, Palo Alto, California, United States.

3 What’s App, Instagram, and Facebook are owned by Meta Platforms Inc., Menlo Park, CA

4 Siri © is a virtual assistant that is part of Apple Inc.'s iOS, iPadOS, watchOS, macOS, tvOS, and audioOS operating systems.

Additional information

Funding

This study was funded by Singapore Ministry of Education (MOE) under the Education Research Funding Programme (SUG 06/18 CM) and administered by National Institute of Education (NIE), Nanyang Technological University, Singapore. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the Singapore MOE and NIE.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 65.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 294.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.