59
Views
2
CrossRef citations to date
0
Altmetric
Original Articles

New speech/music discrimination approach based on warping transformation and ANFIS

, , &
Pages 237-247 | Published online: 16 Feb 2007

References

  • Burred , J. J. and Lerch , A. 2004 . Hierarchical automatic audio signal classification . Journal of the Audio Engineering Society , 52 : 724 – 739 .
  • Carey , M. J. , Parris , E. S. and Lloyd-Thomas , H. . A comparison of features for speech, music discrimination . Proceedings of the IEEE ICASSP'99 . Phoenix, USA. pp. 1432 – 1435 .
  • Davis , S. and Mermelstein , P. 1980 . Experiments in syllable-based recognition of continuous speech . IEEE Transactions on Acoustics, Speech, & Signal Processing , 28 : 357 – 366 .
  • Duda , R. , Hart , P. and Stork , D. 2000 . Pattern classification , New York : Wiley .
  • El-Maleh , K. , Klein , M. , Petrucci , G. and Kabal , P. 2000 . Speech/music discrimination for multimedia applications . Proceedings of the IEEE ICASSP'2000 , 6 : 2445 – 2448 .
  • Goodwin , M. M. and Laroche , J. . Audio segmentation by feature-space clustering using linear discriminant analysis and dynamic programming . Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . New York, USA. pp. 131 – 134 .
  • Harb , H. and Chen , L. 2003 . Robust speech music discrimination using spectrum's first order statistics and neural networks . Proceedings of the IEEE International Symposium on Signal Processing and Its Applications , 2 : 125 – 128 .
  • Härmä , A. , Karjalainen , M. , Savioja , L. , Välimäki , V. , Laine , U. K. and Huopaniemi , J. 2000 . Frequency-warped signal Processing for audio applications . Journal of the Audio Engineering Society , 48 ( 11 ) : 1011 – 1031 .
  • ISO-IEC . 1999 . MPEG-4 Overview (ISO/IEC JTC1/SC29/WG11 N2995 Document)
  • Jang , J. S.R. . Fuzzy modeling using generalized neural networks and kalman filter algorithm . Proceedings of the Ninth National Conference on Artificial Intelligence . Anaheim, CA, USA. pp. 762 – 767 .
  • Jang , J. S.R. 1993 . Adaptive-network-based fuzzy inference systems . IEEE Transactions on Systems, Manchines and Cybernetics , 23 ( 3 ) : 665 – 685 .
  • Jang , J. S.R. and Sun , C. T. 1993 . Functional equivalence between radial basis function networks and fuzzy inference systems . IEEE Transactions on Neural Networks , 4 : 156 – 159 .
  • Karneback , S. . Discrimination between speech and music based on a low frequency modulation feature . European Conference on Speech Communications and Technology . Alborg, Denmark. pp. 1891 – 1894 .
  • Lee , C. C. 1990 . Fuzzy logic in control systems: fuzzy logic controller-Part I . IEEE Transactions on Systems, Man and Cybernetics , 20 : 404 – 435 .
  • Logan , B. . Mel frequency cepstral coefficients for music modeling . Proceedings of the International Symposium on Music Information Retrieval (ISMIR) . Plymouth, MA, USA.
  • Minami , K. , Akutsu , A. , Hamada , H. and Tonomura , Y. 1998 . Video handling with music and speech detection . IEEE Multimedia , 5 ( 3 ) : 17 – 25 .
  • Qiao , R. Y. . Mixed wideband speech and music coding using a speech/music discriminator . Proceedings of IEEE TENCON . Brisbane, Australia. pp. 605 – 608 .
  • Saunders , J. . Real-time discrimination of broacast speech/music . Proceedings of IEEE ICASSP'96 . Atlanta, USA. pp. 993 – 996 .
  • Scheirer , E. and Slaney , M. . Construction and evaluation of a robust multifeature speech/music discriminator . Proceedings of IEEE ICASSP'97 . Munich, Germany. pp. 1331 – 1334 .
  • Smith , J. O. III and Abel , J. S. 1999 . Bark and ERB bilinear transforms . IEEE Transactions on Speech and Audio Processing , 7 : 697 – 708 .
  • Takagi , T. and Sugeno , M. . Derivation of fuzzy control rules from human operator's control actions . Proceedings of the IFAC Symposium on Fuzzy Information, Knowledge Representation and Decision Analysis . Marseilles, France. pp. 55 – 60 .
  • Tancerel , L. , Ragot , S. , Ruoppila , V. T. and Lefebvre , R. . Combined speech and audio coding by discrimination . Proceedings of the IEEE Workshop on Speech Coding . Delavan, WI, USA. pp. 17 – 20 .
  • Tsukamoto , Y. 1979 . “ An approach to fuzzy reasoning methods ” . In Advances in Fuzzy Set Theory and Applications , Edited by: Ragade , R. and Yager , R. 137 – 149 . Amsterdam : North-Holland .
  • Tzanetakis , G. and Cook , P. 2002 . Musical genre classification of audio signals . IEEE Transactions on Speech and Audio Processing , 10 ( 5 ) : 293 – 302 .
  • Wang , W. Q. , Gao , W. and Ying , D. W. 2003 . A fast and robust speech/music discrimination approach . Proceedings of the 4th Pacific Rim Conference on Multimedia , 3 : 1325 – 1329 .
  • Zadeh , L. A. 1965 . Fuzzy sets . Information and Control , 8 : 338 – 353 .
  • Zhang , T. and Kuo , J. 2001 . Audio content analysis for online audiovisual data segmentation and classification . IEEE Transactions on Speech and Audio Processing , 9 ( 4 ) : 441 – 457 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.