21
Views
1
CrossRef citations to date
0
Altmetric
Original Articles

Speech Recognition with Emphasis on Wavelet based Feature Extraction

, MIETE & , FIETE
Pages 3-13 | Published online: 26 Mar 2015

REFERENCES

  • J Picone, Signal modelling technique in speech recognition, Proc of IEEE, vol 81, no 9, pp 1215–1247, Sept 1993.
  • S Balakrishnama, A Ganapathiraju & J Picone, Linear discriminant analysis for signal processing problems, Proc of IEEE Southeastcon, Lexington, Kentucky, USA, pp 36–39, March 1999.
  • K Fukunaga, Introduction to statistical pattern recognition, Academic Press, San Diego, California, USA, 1990.
  • T Koizumi, M Mori, S Taniguchi & M Maruya, Recurrent Neural Network For Phoneme Recognition, Proc of 4 Int. Conf on Spoken Language Processing, Philadelphia, USA, vol 1, pp 326–328, Oct 1996.
  • C G Looney, Pattern recognition using Neural Neworks: Theory and algorithms for Engineers and Scientists, Oxford University Press, New York, 1997.
  • S Furui, M Sondhi & Marcel Dekker, Waibel, Neural Network approach for speech recognition, Advances in speech processing, 1991, pp 555–595.
  • Waibel, T Hanazawa, G Hinton, K Shikano & K J Lang, Phoneme recognition using time-delay neural network, IEEE Trans on ASSP, vol 37, no 3, pp 328–339, March 1989.
  • S H Chen, Y F Liao, S M Chiang & S Chang, An RNN-Based preclassification method for fast continuous Mandarin speech recognition, IEEE Trans, on Speech and Audio Processing, vol 6, no 1, pp 86–90, Jan 1998.
  • N Morgan & H Bourlard, Continuos speech recognition: an introduction to hybrid/connectionist approach, IEEE Signal Processing Mag, pp 25–42, May 1995.
  • L Rabiner & B H Juang, Fundamental of Speech Recognition, Prentice Hall 1993.
  • S Young, A review of large vocabulary continuous speech recognition, IEEE Signal Processing Mag, pp 45–57, Sept 1996.
  • L R Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proc of IEEE, vol 77, no 2, pp 257–285, February 1989.
  • B H Juang & S Furui, Automatic recognition and understanding of spoken language-A first step toward natural human-machine communication, Proc of IEEE, vol 88, no 8, pp 1142–1165, August 2000.
  • J Picone, W J Ebel & N. Deshmukh, Automatic speech understanding: The next generation, Digital Signal Processing Technology, vol CR57, pp 101–114, 1995.
  • R Rosenfeld, Two decades of statistical language modeling: Where do we go from here? Proc of IEEE, vol 88, no 8, pp. 1270–1278, August 2000.
  • H Ney & S Ortmanns, Dynamic programming search for continuous speech recognition, IEEE Signal Processing Mag, pp 64–83, Sept 1999.
  • Deshmukh, A Ganapathiraju & J Picone, Hierarchical search for large vocabulary conversational speech recognition, IEEE Signal Processing Mag, pp 84–107, Sept 1999.
  • O Ghitza, Auditory models and human performance in tasks related to speech coding and speech recognition, IEEE Trans on Speech and Audio Processing, vol 2, no 1, PART-II, pp. 115–132, Jan 1994.
  • J B Allen, How do human process and recognize speech?, IEEE Trans on Speech and Audio Processing, vol 2, no 4, pp 567–577, Oct 1994.
  • L R Rabiner & R W Schafer, Digital processing of speech signals, Prentice-Hall, Englewood Cliffs, New Jersey, USA, 1978.
  • R Vergin, D O'Shaughnessy & A. Farhat, Generalized Mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition, IEEE Trans. on Speech and Audio Processing, vol 7, no 5, pp 525–534, Sept. 1999.
  • J C Junqua, H Wakita & H Hermansky, Evaluation and optimization of perceptually-based asr front end, IEEE Trans. on Speech and Audio Processing, vol 1, no 1, pp 39–48, Jan 1993.
  • H Wassner & G Chollet, New Cepstral Representation using Wavelet Analysis and Spectral Transformations for Robust Speech Recognition, Proc of 4 Int. Conf on Spoken Language Processing, Philadelphia, USA, vol 1, pp 260–263. Oct 1996.
  • S Furui, Speaker Independent Isolated Word Recognition Using Dynamic Features of the Speech Spectrum, IEEE Trans ASSP, vol 34, no 1, pp 52–59, Feb 1986.
  • N Harte, S Vaseghi & B Milner, Dynamic Features for the Segmental Speech Recognition, Proc of 4 Int Conf on Spoken Language Processing, Philadelphia, USA, vol 2, pp 933–936, Oct 1996.
  • Hermansky & N Morgan, RASTA Processing of Speech, IEEE Trans on Speech and Audio Processing, vol 2, no 4, pp 578–589, 1994.
  • J Shen, ‘Discriminative Temporal Feature Extraction For Robust Speech Recognition’, Electronics Letters, vol 33, no 19, pp 1598–1600, 11 Sept 1997.
  • D S Kim, J H Jeong & S Y Lee, Feature Extraction Based on Zero Crossings and Peak Amplitudes for Robust Speech Recognition in Noisy Environment, Proc Int Conf on Acoustic Speech and Signal Processing, Atlanta, USA, pp 61–64, May 1996.
  • D S Kim, S Y Lee, R M Kil & X Zhu, Auditory model for speech recognition in real world noisy environments, Electronics Letters, vol 33, no 1, pp 12–13, 2 Jan 1997.
  • J Hernando, On the use of filter-bank energies drives from the auto-correlation sequence for noisy speech recognition, Proc of 6 Int Conf on Spoken Language Processing, Beijing, China, vol 3, pp 102–105, Oct 2000
  • J Hernando & C Nadeu, Linear prediction of one-sided auto-correlation sequence for noisy speech recognition, IEEE Trans on SAP, vol 5, no 1, pp 80–84, 1997.
  • K H You & H C Wang, Robust features for noisy speech recognition based on temporal trajectory filtering of short-time autocorrelation sequences. Speech Communication, vol 28, pp 13–24, 1999.
  • J Xu & G Wei, Noise-Robust speech recognition based on difference of power spectrum, Electronics Letters, vol 36, no 14, pp 1247–1248, 6 July 2000.
  • S Mallat, Wavelet tour signal of signal processing, Academic Press, 1999.
  • O Rioul & M Vetterli, Wavelet and signal processing, IEEE Signal Processing Mag, pp 14–38, Oct 1991.
  • A Graps, An Introduction To Wavelets, http://www.eee.metu.edu.tr/~ngencer/amara.htm.
  • B T Tan, M Fu, A Spray & P Dermody, The use of Wavelet Transform in Phoneme Recognition, Proc of 4 Int Conf on Spoken Language Processing, Philadelphia, USA, vol 4, pp 2431–2434, Oct 1996.
  • C J Long, Phoneme discrimination using non-linear wavelets method, PhD thesis, Loughborough University, Dept of Electronic and Electrical Engineering, Feb 1999.
  • C J Long & S Datta, Wavelet based feature extraction for phoneme recognition, Proc of 4th Int Conf on Spoken Language Processing, Philadelphia, USA, vol 1, pp 264–267, Oct 1996.
  • O Farooq & S Datta, Dynamic feature extraction by wavelet analysis, Proc of 6th Int Conf on Spoken Language Processing, Beijing, China, vol 4, pp 696–699, Oct 2000.
  • O Farooq & S Datta, Speaker independent phoneme recognition by MLP using wavelet features, Proc of 6 Int Conf on Spoken Language Processing, Beijing, China, vol 1, pp 393–396, Oct 2000.
  • C J Long & S Datta, Discriminant wavelet basis construction for speech recognition, Proc of 5 Int Conf of Spoken Language Processing, Sydney, Australia, vol 3, pp 1047–1049, 30 Nov—4 Dec 1998.
  • E Lukasik, Wavelet packets based features selection for voiceless plosive classification, Proc of Int Conf Acoustic Speech and Signal Processing 2000, vol 2, pp 689–692, 2000.
  • TIMIT Acoustic-Phonetic Continuous Speech Corpus, National Institute of Standards and Technology, Speech Disc 1-1.1, NTIS Order no PB91-505065, Oct 1990.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.