7
Views
3
CrossRef citations to date
0
Altmetric
PAPERS

Binaural active audition for humanoid robots to localise speech over entire azimuth range

, , &
Pages 355-367 | Received 29 Sep 2008, Accepted 10 May 2009, Published online: 25 Nov 2009

References

  • Bahoura , M and Pelletier , C . . Respiratory sound classification using cepstral analysis and gaussian mixture models . Proceedings of the IEEE/EMBS International Conference . Sep. 1–5 , San Francisco, USA.
  • Berglund , E J . 2005 . Active audition for robots using parameter-less self-organising maps , Ph.D. thesis Australia : The University of Queensland . (October)
  • Blauert , J . 1996 . Spatial hearing—the psychophysics of human sound localization. , Rev. ed. , Cambridge, MA : The MIT Press .
  • Cheng , C I and Wakefield , G H . 2001 . Introduction to head-related transfer functions (HRTFs): space . J Audio Eng Soc. , 49 ( 4 ) : 231 – 248 .
  • Hara , I , Asano , F , Kawai , Y , Kanehiro , F and Yamamoto , K . . Robust speech interface based on audio and video information fusion for humanoid HRP-2 . Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2004) . October , Sendai, Japan. pp. 2404 – 2410 .
  • Huang , J , Ohnishi , N and Sugie , N . . Spatial localization of sound sources: azimuth and elevation estimation . Proceedings of IEEE/IMTC International Conference on Instrumentation and Measurement Technology . May , St. Paul, MN, USA. pp. 330 – 333 .
  • Hwang , S , Park , Y and Park , Y . . Sound source localization using HRTF database . Proceedings of International Conference on Control, Automation, and Systems (ICCAS2005) . June , Busan, South Korea. pp. 751 – 755 .
  • Kim , H-D. 2008 . Binaural active audition for humanoid robots , Ph.D. thesis Japan : Kyoto University . (September)
  • Kim , H-D , Choi , J-S and Kim , M . 2007a . Human-robot interaction in real environments by audio-visual integration . Int J Control, Automation, and Systems. , 5 ( 1 ) : 61 – 69 .
  • Kim , H-D , Komatani , K , Ogata , T and Okuno , H G . 2007b . Real-time auditory and visual talker tracking through integrating EM algorithm and particle filter , 280 – 290 . Kyoto, , Japan : Springer-Verlag . IEA/AIE-2007, LNAI 4570; June
  • Lu , L , Zhang , H-J and Jiang , H . 2002 . Content analysis for audio classification and segmentation . IEEE Trans Speech Audio Process. , 10 ( 7 ) : 504 – 516 .
  • Moon , T K . 1996 . The expectation-maximization algorithm . IEEE Signal Process Mag. , 13 ( 6 ) : 47 – 60 .
  • Nakadai , K , Hidai , K-i , Okuno , H G and Kitano , H . . Real-time speaker localization and speech separation by audio-visual integration . Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2002) . May , Washington DC, USA. pp. 1043 – 1049 .
  • Nishiura , T , Yamada , T , Nakamura , S and Shikano , K . . Localization of multiple sound sources based on a CSP analysis with a microphone array . Proceedings of IEEE/ICASSP International Conference on Acoustics, Speech, and Signal Processing . June , Istanbul, Turkey. pp. 1053 – 1056 .
  • Schmidt , R O . 1986 . Multiple emitter location and signals parameter estimation . IEEE Trans Antennas and Propagation , AP-34 : 276 – 280 .
  • Shah , J K , Iyer , A N , Smolenski , B Y and Yantormo , R E . . Robust voiced/unvoiced classification using novel feature and Gaussian mixture model . Paper presented at: IEEE/ICASSP International Conference on Acoustics, Speech, and Signal Processing . May , Montreal, Canada.
  • Thurlow , W R , Mangels , J W and Runge , P S . 1967 . Head movements during sound localization . Journal of the Acoustical Society of America. , 42 ( 2 ) : 489 – 493 .
  • Valin , J-M , Yamamoto , S , Rouat , J , Michaud , F , Nakadai , K and Okuno , H G . 2007 . Robust recognition of simultaneous speech by a mobile robot . IEEE Trans Robot. , 23 ( 4 ) : 742 – 752 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.