Views

CrossRef citations to date

Altmetric

Original Articles

The Nonlinear Time Alignment Model for Speech Recognition System

D D DoyeDepartment of Electronics and Computer Science Engineering, SGGS College of Engineering and Technology, Vishnupuri, Nanded431 606, India.View further author information

T R SontakkeDepartment of Electronics and Computer Science Engineering, SGGS College of Engineering and Technology, Vishnupuri, Nanded431 606, India.View further author information

Smita NagtodeDepartment of Electronics and Computer Science Engineering, SGGS College of Engineering and Technology, Vishnupuri, Nanded431 606, India.View further author information

Abstract

We present the new nonlinear time alignment model, which is much faster than widely accepted DTW algorithms. This work has been started with the aim of finding suitable time alignment algorithm and features, for Marathi (Language spoken in the state of Maharashtra, India) word Speech Recognition System. Proposed algorithm shows comparable or better recognition efficiency than widely accepted algorithms and is robust to end point variations. In this work, vocabularies are: (1) 46 isolated monosyllabic confusing Marathi alphabets and (2) 46 non-confusing names of the persons. The features used are LPC, LFCC and MFCC. For the confusing word vocabulary, the proposed algorithm proved to be best showing maximum recognition efficiency of 89.13% and second best is Itakura's DTW algorithm with maximum recognition efficiency of 86.96%. LFCC with Itakura's DTW algorithm shows poor performance with maximum recognition efficiency of 13.40%. But LFCC with proposed algorithm shows comparable results for non-confusing vocabulary.

Indexing terms:

Additional information

Notes on contributors

D D Doye

D D Doye received his BE (Electronics) degree in 1988 and ME (Electronics) degree in 1993, from SGGS College of Engineering and Technology, Vishnupuri, Nanded (MS). Presently, he is working as Assistant Professor in Department of Electronics and Computer Science and Engineering, SGGS College of Engineering and Technology, Vishnupuri, Nanded. His field of research is Speech Recognition.

T R Sontakke

T R Sontakke received his BE (Electrical Engineering) from Government College of Engineering, Aurangabad M Tech from VRCE Nagpur and PhD from IIT Bombay, Mumbai. He worked at TTTI Bhopal for a period of ten years. He is currently working as Professor in Electronics with additional charge of Principal, SGGS College of Engineering and Technology, Vishnupuri, Nanded. He has been the recipient of two Gold Medals from Institution of Engineers, India for his research papers. His research fields are Speech Recognition, Neural Networks and Image Processing.

Smita Nagtode

Smita Nagtode received his BE (Electronics and Telecommunication Engineering) from Bapuraoji Deshmukh College of Engineering Wardha (MS) in May/June 1998. She has received her ME (Electronics with specialization in Computer) degree from SGGS College of Engineering and Technology, Vishnupuri, Nanded in May 2000. Presently, she is working as Lecturer in department of Electronics, GH Raisoni College of Engineering and Technology, Digdoh Hills, Hingana Road, Nagpur.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

The Nonlinear Time Alignment Model for Speech Recognition System

Notes on contributors

D D Doye

T R Sontakke

Smita Nagtode

Information for

Open access

Opportunities

Help and information

The Nonlinear Time Alignment Model for Speech Recognition System

Abstract

Additional information

Notes on contributors

D D Doye

T R Sontakke

Smita Nagtode

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature