27
Views
0
CrossRef citations to date
0
Altmetric
Shortlisted Papers

Using Multiple Level Fusion for Improving Performance of Speaker Recognition

, , &
Pages 39-48 | Received 30 Mar 2011, Accepted 29 Jun 2011, Published online: 09 Apr 2013

References

  • Bach, F.R., Lanckriet, G.R.G., Jordan, M.I., Multiple kernel learning, conic duality, and the SMO algorithm. Proceedings of the 21st International Conference on Machine Learning. pp41–48. USA (2004).
  • Bengio, S., Mariethoz, J., Learning the decision function for speaker verification. Proceedings of the ICASSP 2007. pp425–428. USA (2007).
  • Brummer, N., Preez, J., Application-independent evaluation of speaker detection. Computer Speech and Language. Volume 20, pp230–275. UK (2006).
  • Chakroborty, S., Saha, G., Improved text-independent speaker identification using fused MFCC and IMFCC feature sets based on Gaussian filter. International Journal of Signal Processing. Volume 5, No 1, pp11–19. USA (2009).
  • Cho, S.Y., Probabilistic Based Recursive Model for Adaptive Processing of Data Structure. Expert Systems with Applications. Volume 32, No 2, pp1403–1422. USA (2008).
  • Davis, S.B., Mermelstein, P., Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing. Volume 28, No 4, pp357–366. USA (1980).
  • Dehak, R., Dehak, N., Kenny, P., Dumouchel, P., Kernel combination for SVM speaker verification. Proceedings of The Speaker and Language Recognition Workshop. South Africa (2008).
  • Deng, H., Du, L., Wan, H., Combination of likelihood scores using linear and SVM approaches for text-independent speaker verification. Proceedings of the ICSP 04. pp2261–2264. China (2004).
  • Garcia-Romero, D., Fierrez-Aguilar, J., Gonzalez-Rodriguez, J., Ortega-Garcia, J., Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech. Proceedings of the ICASSP2003. pp229–232. Hong Kong (2003).
  • Gutschaven, B., Verlinde, P., Multi-modal identity verification using Support Vector Machines (SVM). Proceedings of the 3rd International Conference on Information Fusion. pp THB3/3−THB3/8. France (2000).
  • Higgins, J.E., Dodd, T.J., Damper, R.I., Information fusion for subband-HMM speaker recognition. Proceedings of the IEEE IJCNN’01, Washington DC. pp1504–1509. USA (2001).
  • Holmes, J., Holmes, W., Speech Synthesis and Recognition. Taylor & Francis Group Publisher. USA (2003).
  • Hsu, C.W., Chang, C.C., Lin, C.J., A Practical Guide to Support Vector Classification. (2011). Retrieved from http://www.csie.ntu.edu.tw/˜cjlin/papers/guide/guide.pdf.
  • Hu, R., Damper, R.I., Fusion of two classifiers for speaker identification: removing and not removing silence. Proceedings of the 7th International Conference on Information Fusion. pp429–436. Sweden (2005).
  • Islam, T., Kabal, P., Partial-energy weighted interpolation of linear prediction coefficients. Proceedings of The ISSS Workshop on Automatic Speaker Recognition, Identification and Verification. pp105–107. USA (2000).
  • Jain, A., Ross, A., Multibiometric Systems. Communication of the ACM. Volume 47, No 1, pp34–40. USA (2004).
  • Jin, Q., Navratil, J., Reynolds, D.A., Campbell, J.P., Andrews, W.D., Abramson, J.S., Combining Cross-stream and Time Dimensions in Phonetic Speaker Recognition. Proceedings of the ICASSP2003. pp800–803. Hong Kong (2003).
  • Kajarekar, S.S., Four weightings and a fusion: a cepstral-SVM system for speaker recognition. Proceedings of the AUSR 2005. pp17–22. Australia (2005).
  • Kittler, J., Hatef, M., Duin, R.P., Matas, J.G., On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence. Volume 20, No 3, pp226–239. USA (1998).
  • Lanckriet, G.R.G., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.I., Learning the kernel matrix with semidefinite programming. The Journal of Machine Learning Research. Volume 5, No 1, pp27–72. USA (2004).
  • Liu, D., Cho, S.Y., Sun, D.M., Qiu, Z.D., Biometrics: Theory, Application, and Issues. pp57–80. Nova Science Publisher. USA (2011).
  • Long, Y., Guo, W., Dai, L., Interfusing the confused region score of speaker verification systems. Proceedings of the International Symposium on Chinese Spoken Language Processing 2008. pp314–317. China (2008).
  • Longworth, C., Gales, M.J.F., Multiple kernel learning for speaker recognition. Proceedings of the ICASSP 2008. pp1581–1584. USA (2008).
  • Ma, Z., Yang, Y., Wu, Z., Further feature extraction for speaker recognition. Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. pp4135–4138. USA (2003).
  • Memon, S., Lech, M., He, L., Using information theoretic vector quantisation for inverted MFCC based speaker verification. Proceedings of the 2nd International Conference on Computer, Control and Communication. pp1–5. Romania (2009).
  • Murty, S., Yegnanarayana, B., Combining evidence from residual phases and MFCC features for speaker recognition. IEEE Signal Processing Letters. Volume 13, No 1, pp52–55. USA (2006).
  • NIST, Speaker Recognition Evaluation Corpus 2001. (2001). Retrieved from http://www.itl.nist.gov/iad/mig/tests/spk/2001/.
  • Nosratighods, M., Thiruvaran, T., Epps, J., Ambikairajah, E., Ma, B., Li, H., Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE. Proceedings of the ICASSP2009. pp4233–4236. Taiwan (2009).
  • Rakotomamonjy, A., Bach, F.R., Canu, S., Grandvalet, Y., More efficiency in multiple kernel learning. Proceedings of the 24th International Conference on Machine Learning. pp775–782. USA (2007).
  • Rakotomamonjy, A., Bach, F.R., Canu, S., Grandvalet, Y., Simple MKL. The Journal of Machine Learning Research. Volume 9, No 11, pp2491–2521. USA (2008).
  • Ross, A., Jain, A., Information fusion in biometrics. Pattern Recognition Letters. Volume 24, No 13, pp2115–2125. Holland (2003).
  • Seyedin, S., Ahadi, M., Feature extraction based on DCT and MVDR spectral estimation for robust speech recognition. Proceedings of the 9th ICSP. pp605–608. China (2008).
  • Sonnenburg, S., Rätsch, G., Schäfer, C., Schölkopf, B., Large scale multiple kernel learning. The Journal of Machine Learning Research. Volume 7, No 7, pp1531–1565. USA (2006).
  • Tommi, S., Jaakkola, Haussler, D., Exploiting generative models in discriminative classifiers. Advances in Neural Information Processing Systems. Volume 11, pp487–493. MIT Press. USA (1998).
  • Vale, E.E., Alcaim, A., Adaptive weighting of subband-classifier responses for robust text-independent speaker recognition. Electronics Letters. Volume 44, No 21, pp1280–1282. UK (2008).
  • Vale, E.E., Cunha, A., Alcaim, A., Robust text-independent identification using multiple subband-classifiers in coloured noise environment. Proceedings of the 15th International Conference on Systems, Signals and Image Processing. pp275–278. Slovakia (2008).
  • Varchol, P., Levicky, D., Juhar, J., Multimodal biometric authentication using speech and hand geometry fusion. Proceedings of the 15th International Conference on Publication on Systems, Signals and Image Processing. pp57–60. Slovakia (2008).
  • Wang, L., Ohtsuka, S., Nakagawa, S., High improvement of speaker identification and verification by combing MFCC and phase information. Proceedings of the ICASSP2009. pp4529–4532. Taiwan (2009).
  • Wang, H.Q., Sun, F.C., Cai, Y.N., Chen, N., Ding, L.G., On Multiple Kernel Learning Methods. Acta Automatica Sinica. Volume 38, No 6, pp1037–1050. China (2010).
  • Wang, J., Wang, J., Speaker recognition using features derived from fractional fourier transform. Proceedings of the IEEE International Conference on Automatic Identification Advanced Technologies. pp95–100. China (2005).
  • Zheng, N., Lee, T., Ching, P., Integration of complementary acoustic features for speaker recognition. IEEE Signal Processing Letters. Volume 14, No 3, pp181–184. USA (2007).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.