110
Views
7
CrossRef citations to date
0
Altmetric
Full papers

Whole Body Motion Noise Cancellation of a Robot for Improved Automatic Speech Recognition

, , , &
Pages 1405-1426 | Published online: 02 Apr 2012

References

  • Rodemann , T. , Heckmann , M. , Schölling , B. , Joublin , F. and Goerick , C. 2006 . “ Real-time sound localization with a binaural head-system using a biologically-inspired cue-triple mapping ” . In Proc. IEEE/RSJ Int. Conf. on Robots and Intelligent Systems 860 – 865 . Beijing
  • Levinson , S. E. , Zhu , W. , Li , D. , Squire , K. , Lin , R. S. , Kleffner , M. , McClain , M. and Lee , J. 2003 . “ Automatic language acquisition by an autonomous robot ” . In Proc. Int. Joint Conf. on Neural Networks 2716 – 2721 . Portland , OR
  • Hara , I. , Asano , F. , Asoh , H. , Ogata , J. , Ichimura , N. , Kawai , Y. , Kanehiro , F. , Hirukawa , H. and Yamamoto , K. 2004 . “ Robust speech interface based on audio and video information fusion for humanoid HRP-2 ” . In Proc. IEEE/RSJ Int. Conf. on Robots and Intelligent Systems 2404 – 2410 . Sendai
  • Saruwatari , H. , Mori , Y. , Takatani , T. , Ukai , S. , Shikano , K. , Hiekata , T. and Morita , T. 2005 . “ Two-stage blind source separation based on ICA and binary masking for real-time robot audition system ” . In Proc. IEEE/RSJ Int. Conf. on Robots and Intelligent Systems 209 – 214 . Edmonton
  • Nakadai , K. , Nakajima , H. , Hasegawa , Y. and Tsujino , H. 2009 . “ Sound source separation of moving speakers for robot audition ” . In Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing 3685 – 3688 . Taipei
  • Yamamoto , S. , Nakadai , K. , Nakano , M. , Tsujino , H. , Valin , J. M. , Komatani , K. , Ogata , T. and Okuno , H. G. 2006 . “ Real-time robot audition system that recognizes simultaneous speech in the real world ” . In Proc. IEEE/RSJ Int. Conf. on Robots and Intelligent Systems 5333 – 5338 . Beijing
  • Valin , J.-M. , Yamamoto , S. , Rouat , J. , Michaud , F. , Nakadai , K. and Okuno , H. G. 2007 . Robust recognition of simultaneous speech by a mobile robot . IEEE Trans. Robotics , 23 : 742 – 752 .
  • Takahashi , T. , Yamamoto , S. , Nakadai , K. , Komatani , K. , Ogata , T. and Okuno , H. G. 2008 . “ Soft missing-feature mask generation for simultaneous speech recognition system in robots ” . In Proc. Int. Conf. on Spoken Language Processing (Interspeech) 992 – 997 . Brisbane
  • Boll , S. 1979 . Suppression of acoustic noise in speech using spectral subtraction . IEEE Trans. Acoust. Speech Signal Process. , ASSP-27 : 113 – 120 .
  • Cohen , I. 2002 . Noise estimation by minima controlled recursive averaging for robust speech enhancement . IEEE Signal Process. Lett. , 9 : 12 – 15 .
  • Nakadai , K. , Okuno , H. G. and Kitano , H. 2000 . Humanoid active audition system improved by the cover acoustics . Lecture Notes Artif. Intell. , 1886 : 544 – 554 .
  • Nishimura , Y. , Nakano , M. , Nakadai , K. , Tsujino , H. and Ishizuka , M. 2006 . “ Speech recognition for a robot under its motor noises by selective application of missing feature theory and MLLR ” . In Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition 53 – 58 . Pittsburgh , PA
  • Ito , A. , Kanayama , T. , Suzuki , M. and Makino , S. 2005 . “ Internal noise suppression for speech recognition by small robots ” . In Proc. Int. Conf. on Spoken Language Processing (Interspeech) 2685 – 2688 . Lisbon
  • Even , J. , Sawada , H. , Saruwatari , H. , Shikano , K. and Takatani , T. 2009 . “ Semi-blind suppression of internal noise for hands-free robot spoken dialog system ” . In Proc. IEEE/RSJ Int. Conf. on Robots and Intelligent Systems 659 – 663 . St Louis , MO
  • Mizumachi , M. and Nakamura , S. 2004 . “ Passive subtractive beamformer for near-field sound sources ” . In Proc. IEEE Sensor Array and Multichannel Signal Processing Workshop 74 – 78 . Barcelona
  • Zheng , Y. R. , Goubran , R. A. and El-Tanany , M. 2003 . A nested sensor array focusing on near field targets . Proc. IEEE Sensors , 2 : 843 – 848 .
  • Ince , G. , Nakadai , K. , Rodemann , T. , Hasegawa , Y. , Tsujino , H. and Imura , J. 2009 . “ Ego noise suppression of a robot using template subtraction ” . In Proc. IEEE/RSJ Int. Conf. on Robots and Intelligent Systems 199 – 204 . St Louis , MO
  • Ince , G. , Nakadai , K. , Rodemann , T. , Hasegawa , Y. , Tsujino , H. and Imura , J. 2010 . “ A hybrid framework for ego noise cancellation of a robot ” . In Proc. IEEE/RSJ Int. Conf. on Robotics and Automation 3623 – 3628 . Anchorage , AK
  • Schmidt , R. 1986 . Multiple emitter location and signal parameter estimation . IEEE Trans. Antennas Propagat. , 34 : 276 – 280 .
  • Parra , L. C. and Alvino , C. V. 2002 . Geometric source separation: merging convolutive source separation with geometric beamforming . IEEE Trans. Speech Audio Process , 10 : 352 – 362 .
  • Nakajima , H. , Nakadai , K. , Hasegawa , Y. and Tsujino , H. 2008 . “ Adaptive step-size parameter control for real-world blind source separation ” . In Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing 149 – 152 . Las Vegas , NV
  • Ephraim , Y. and Malah , D. 1984 . Speech enhancement using minimum mean-square error short-time spectral amplitude estimator . IEEE Trans. Acoust. Speech Signal Process , ASSP-32 : 1109 – 1121 .
  • Cohen , I. and Berdugo , B. 2002 . “ Microphone array post-filtering for non-stationary noise suppression ” . In Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing 901 – 904 . Orlando , FL
  • Nishimura , Y. , Shinozaki , T. , Iwano , K. and Furui , S. 2004 . “ Noise-robust speech recognition using multiband spectral features ” . In Proc. 148th Acoustical Society of AmericaMeet San Diego , CA 1aSC7
  • Nakadai , K. , Okuno , H. , Nakajima , H. , Hasegawa , Y. and Tsujino , H. 2008 . “ An open source software system for robot audition HARK and its evaluation ” . In Proc. IEEE?RAS Int. Conf. on Humanoid Robots 561 – 566 . Daejeon

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.