473
Views
10
CrossRef citations to date
0
Altmetric
Research Article

Relationship between speech recognition in noise and sparseness

, , &
Pages 75-82 | Received 15 Jun 2011, Accepted 08 Sep 2011, Published online: 22 Nov 2011
 

Abstract

Objective: Established methods for predicting speech recognition in noise require knowledge of clean speech signals, placing limitations on their application. The study evaluates an alternative approach based on characteristics of noisy speech, specifically its sparseness as represented by the statistic kurtosis. Design: Experiments 1 and 2 involved acoustic analysis of vowel-consonant-vowel (VCV) syllables in babble noise, comparing kurtosis, glimpsing areas, and extended speech intelligibility index (ESII) of noisy speech signals with one another and with pre-existing speech recognition scores. Experiment 3 manipulated kurtosis of VCV syllables and investigated effects on speech recognition scores in normal-hearing listeners. Study sample: Pre-existing speech recognition data for Experiments 1 and 2; seven normal-hearing participants for Experiment 3. Results: Experiments 1 and 2 demonstrated that kurtosis calculated in the time-domain from noisy speech is highly correlated (r > 0.98) with established prediction models: glimpsing and ESII. All three measures predicted speech recognition scores well. The final experiment showed a clear monotonic relationship between speech recognition scores and kurtosis. Conclusions: Speech recognition performance in noise is closely related to the sparseness (kurtosis) of the noisy speech signal, at least for the types of speech and noise used here and for listeners with normal hearing.

Acknowledgements

This work was supported by an ISVR Raleigh scholarship with partial additional support from Cochlear UK. We thank Martin Cooke for generous help, stimulating discussion, and providing the speech data for the first experiment. Thanks also to TNO Human Factors, Soesterberg, and the Netherlands for providing the babble noise. Thanks also to Koenrad Rhebergen for providing code for the ESII calculations.

Declaration of interest: The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.