Using Multiple Level Fusion for Improving Performance of Speaker Recognition

Liu Di Beijing University of Posts and Telecommunications, The first author who is at the age of 35 or below on the closing date of submission

Cho Siu Yeung Division of Engineering, The University of Nottingham Ningbo

Sun Dongmei Institute of Information Science, Beijing Jiaotong University

Qiu Zhengding Institute of Information Science, Beijing Jiaotong University

Abstract

In this paper, a multiple level fusion framework to apply into the automatic speaker recognition system in order to improve its performance is presented. Based on the framework, different multiple level fusion methods, such as a strong multiple level fusion and three weak multiple level fusions, are defined in this paper. To examine the effectiveness of the proposed framework, two-feature combination scheme would be considered. After investigating the availability of strong and weak multiple level fusions for this scheme, the framework adopts a weak multiple level fusion method which combines two level fusions, ie matching-score fusion and decisionmaking fusion. In the matching-score level, a commonly used method called the score vector fusion is adopted. In the decision-making level, the kernel combination, also known as Multiple Kernel Learning is chosen. These two techniques can be embedded into many automatic speaker recognition systems. Throughout the evaluation by NIST 2001 corpus, two sets of experiments were conducted that the results of the two-feature combination scheme by the multiple level fusions are better than the traditional matching-score level fusion and unimodal methods. It is demonstrated that the multiple level fusion framework is an effective method to fuse the features for speaker recognition applications.

Keywords:

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Using Multiple Level Fusion for Improving Performance of Speaker Recognition

Information for

Open access

Opportunities

Help and information

Using Multiple Level Fusion for Improving Performance of Speaker Recognition

Abstract

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature