Search in:

Advanced search

HKIE Transactions Volume 18, 2011 - Issue 4: The HKIE Outstanding Paper Award for Young Engineers/Researchers 2011

Journal homepage

Views

CrossRef citations to date

Altmetric

Shortlisted Papers

Using Multiple Level Fusion for Improving Performance of Speaker Recognition

Liu Di Beijing University of Posts and Telecommunications, The first author who is at the age of 35 or below on the closing date of submission

Cho Siu Yeung Division of Engineering, The University of Nottingham Ningbo

Sun Dongmei Institute of Information Science, Beijing Jiaotong University

Qiu Zhengding Institute of Information Science, Beijing Jiaotong University

Pages 39-48 | Received 30 Mar 2011, Accepted 29 Jun 2011, Published online: 09 Apr 2013

Cite this article
https://doi.org/10.1080/1023697X.2011.10668243

References
Citations
Metrics
Reprints & Permissions

References

Bach, F.R., Lanckriet, G.R.G., Jordan, M.I., Multiple kernel learning, conic duality, and the SMO algorithm. Proceedings of the 21st International Conference on Machine Learning. pp41–48. USA (2004).
Google Scholar
Bengio, S., Mariethoz, J., Learning the decision function for speaker verification. Proceedings of the ICASSP 2007. pp425–428. USA (2007).
Google Scholar
Brummer, N., Preez, J., Application-independent evaluation of speaker detection. Computer Speech and Language. Volume 20, pp230–275. UK (2006).
Google Scholar
Chakroborty, S., Saha, G., Improved text-independent speaker identification using fused MFCC and IMFCC feature sets based on Gaussian filter. International Journal of Signal Processing. Volume 5, No 1, pp11–19. USA (2009).
Google Scholar
Cho, S.Y., Probabilistic Based Recursive Model for Adaptive Processing of Data Structure. Expert Systems with Applications. Volume 32, No 2, pp1403–1422. USA (2008).
Google Scholar
Davis, S.B., Mermelstein, P., Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing. Volume 28, No 4, pp357–366. USA (1980).
Google Scholar
Dehak, R., Dehak, N., Kenny, P., Dumouchel, P., Kernel combination for SVM speaker verification. Proceedings of The Speaker and Language Recognition Workshop. South Africa (2008).
Google Scholar
Deng, H., Du, L., Wan, H., Combination of likelihood scores using linear and SVM approaches for text-independent speaker verification. Proceedings of the ICSP 04. pp2261–2264. China (2004).
Google Scholar
Garcia-Romero, D., Fierrez-Aguilar, J., Gonzalez-Rodriguez, J., Ortega-Garcia, J., Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech. Proceedings of the ICASSP2003. pp229–232. Hong Kong (2003).
Google Scholar
Gutschaven, B., Verlinde, P., Multi-modal identity verification using Support Vector Machines (SVM). Proceedings of the 3rd International Conference on Information Fusion. pp THB3/3−THB3/8. France (2000).
Google Scholar
Higgins, J.E., Dodd, T.J., Damper, R.I., Information fusion for subband-HMM speaker recognition. Proceedings of the IEEE IJCNN’01, Washington DC. pp1504–1509. USA (2001).
Google Scholar
Holmes, J., Holmes, W., Speech Synthesis and Recognition. Taylor & Francis Group Publisher. USA (2003).
Google Scholar
Hsu, C.W., Chang, C.C., Lin, C.J., A Practical Guide to Support Vector Classification. (2011). Retrieved from http://www.csie.ntu.edu.tw/˜cjlin/papers/guide/guide.pdf.
Google Scholar
Hu, R., Damper, R.I., Fusion of two classifiers for speaker identification: removing and not removing silence. Proceedings of the 7th International Conference on Information Fusion. pp429–436. Sweden (2005).
Google Scholar
Islam, T., Kabal, P., Partial-energy weighted interpolation of linear prediction coefficients. Proceedings of The ISSS Workshop on Automatic Speaker Recognition, Identification and Verification. pp105–107. USA (2000).
Google Scholar
Jain, A., Ross, A., Multibiometric Systems. Communication of the ACM. Volume 47, No 1, pp34–40. USA (2004).
Google Scholar
Jin, Q., Navratil, J., Reynolds, D.A., Campbell, J.P., Andrews, W.D., Abramson, J.S., Combining Cross-stream and Time Dimensions in Phonetic Speaker Recognition. Proceedings of the ICASSP2003. pp800–803. Hong Kong (2003).
Google Scholar
Kajarekar, S.S., Four weightings and a fusion: a cepstral-SVM system for speaker recognition. Proceedings of the AUSR 2005. pp17–22. Australia (2005).
Google Scholar
Kittler, J., Hatef, M., Duin, R.P., Matas, J.G., On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence. Volume 20, No 3, pp226–239. USA (1998).
Google Scholar
Lanckriet, G.R.G., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.I., Learning the kernel matrix with semidefinite programming. The Journal of Machine Learning Research. Volume 5, No 1, pp27–72. USA (2004).
Google Scholar
Liu, D., Cho, S.Y., Sun, D.M., Qiu, Z.D., Biometrics: Theory, Application, and Issues. pp57–80. Nova Science Publisher. USA (2011).
Google Scholar
Long, Y., Guo, W., Dai, L., Interfusing the confused region score of speaker verification systems. Proceedings of the International Symposium on Chinese Spoken Language Processing 2008. pp314–317. China (2008).
Google Scholar
Longworth, C., Gales, M.J.F., Multiple kernel learning for speaker recognition. Proceedings of the ICASSP 2008. pp1581–1584. USA (2008).
Google Scholar
Ma, Z., Yang, Y., Wu, Z., Further feature extraction for speaker recognition. Proceedings of the IEEE International Conference on Systems, Man and Cybernetics. pp4135–4138. USA (2003).
Google Scholar
Memon, S., Lech, M., He, L., Using information theoretic vector quantisation for inverted MFCC based speaker verification. Proceedings of the 2nd International Conference on Computer, Control and Communication. pp1–5. Romania (2009).
Google Scholar
Murty, S., Yegnanarayana, B., Combining evidence from residual phases and MFCC features for speaker recognition. IEEE Signal Processing Letters. Volume 13, No 1, pp52–55. USA (2006).
Google Scholar
NIST, Speaker Recognition Evaluation Corpus 2001. (2001). Retrieved from http://www.itl.nist.gov/iad/mig/tests/spk/2001/.
Google Scholar
Nosratighods, M., Thiruvaran, T., Epps, J., Ambikairajah, E., Ma, B., Li, H., Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE. Proceedings of the ICASSP2009. pp4233–4236. Taiwan (2009).
Google Scholar
Rakotomamonjy, A., Bach, F.R., Canu, S., Grandvalet, Y., More efficiency in multiple kernel learning. Proceedings of the 24th International Conference on Machine Learning. pp775–782. USA (2007).
Google Scholar
Rakotomamonjy, A., Bach, F.R., Canu, S., Grandvalet, Y., Simple MKL. The Journal of Machine Learning Research. Volume 9, No 11, pp2491–2521. USA (2008).
Google Scholar
Ross, A., Jain, A., Information fusion in biometrics. Pattern Recognition Letters. Volume 24, No 13, pp2115–2125. Holland (2003).
Google Scholar
Seyedin, S., Ahadi, M., Feature extraction based on DCT and MVDR spectral estimation for robust speech recognition. Proceedings of the 9th ICSP. pp605–608. China (2008).
Google Scholar
Sonnenburg, S., Rätsch, G., Schäfer, C., Schölkopf, B., Large scale multiple kernel learning. The Journal of Machine Learning Research. Volume 7, No 7, pp1531–1565. USA (2006).
Google Scholar
Tommi, S., Jaakkola, Haussler, D., Exploiting generative models in discriminative classifiers. Advances in Neural Information Processing Systems. Volume 11, pp487–493. MIT Press. USA (1998).
Google Scholar
Vale, E.E., Alcaim, A., Adaptive weighting of subband-classifier responses for robust text-independent speaker recognition. Electronics Letters. Volume 44, No 21, pp1280–1282. UK (2008).
Google Scholar
Vale, E.E., Cunha, A., Alcaim, A., Robust text-independent identification using multiple subband-classifiers in coloured noise environment. Proceedings of the 15th International Conference on Systems, Signals and Image Processing. pp275–278. Slovakia (2008).
Google Scholar
Varchol, P., Levicky, D., Juhar, J., Multimodal biometric authentication using speech and hand geometry fusion. Proceedings of the 15th International Conference on Publication on Systems, Signals and Image Processing. pp57–60. Slovakia (2008).
Google Scholar
Wang, L., Ohtsuka, S., Nakagawa, S., High improvement of speaker identification and verification by combing MFCC and phase information. Proceedings of the ICASSP2009. pp4529–4532. Taiwan (2009).
Google Scholar
Wang, H.Q., Sun, F.C., Cai, Y.N., Chen, N., Ding, L.G., On Multiple Kernel Learning Methods. Acta Automatica Sinica. Volume 38, No 6, pp1037–1050. China (2010).
Google Scholar
Wang, J., Wang, J., Speaker recognition using features derived from fractional fourier transform. Proceedings of the IEEE International Conference on Automatic Identification Advanced Technologies. pp95–100. China (2005).
Google Scholar
Zheng, N., Lee, T., Ching, P., Integration of complementary acoustic features for speaker recognition. IEEE Signal Processing Letters. Volume 14, No 3, pp181–184. USA (2007).
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Using Multiple Level Fusion for Improving Performance of Speaker Recognition

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Using Multiple Level Fusion for Improving Performance of Speaker Recognition

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date