12
Views
8
CrossRef citations to date
0
Altmetric
Original Articles

Speaker-Independent English Consonant and Japanese Word Recognition by a Stochastic Dynamic Time Warping Method

&
Pages 87-95 | Received 04 Jun 1987, Published online: 02 Jun 2015
 

Abstract

In this paper, a stochastic dynamic time warping method for speaker-independent recognition is proposed and some considerations are described on speaker-independent consonant recognition and word recognition on a large vocabulary size. In this method, conditional probabilities were used instead of local distances in a standard dynamic time warping method, and transition probabilities instead of path costs. This is related to both the standard DTW method and the hidden Markov model. In word recognition, the whole word templates are constructed by the concatenation of syllable templates, which are taken from spoken words. And, we got the reference patterns from 216 words uttered by 30 male speakers and recognized the other 200 words uttered by the other 10 speakers. The standard dynamic time warping method for speaker-independent recognition on 200 words gave the average word recognition rate of 89.3%. The stochastic dynamic time warping method we proposed here improved the recognition rate to 92.9%.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.