Combining evidence from residual phase and MFCC features for speaker recognition

Dimensions

Murty, K. S. R. ; Yegnanarayana, B. (2006) Combining evidence from residual phase and MFCC features for speaker recognition IEEE Signal Processing Letters, 13 (1). pp. 52-55. ISSN 1070-9908

Full text not available from this repository.

Official URL: http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arn...

Related URL: http://dx.doi.org/10.1109/LSP.2005.860538

Abstract

The objective of this letter is to demonstrate the complementary nature of speaker-specific information present in the residual phase in comparison with the information present in the conventional mel-frequency cepstral coefficients (MFCCs). The residual phase is derived from speech signal by linear prediction analysis. Speaker recognition studies are conducted on the NIST-2003 database using the proposed residual phase and the existing MFCC features. The speaker recognition system based on the residual phase gives an equal error rate (EER) of 22%, and the system using the MFCC features gives an EER of 14%. By combining the evidence from both the residual phase and the MFCC features, an EER of 10.5% is obtained, indicating that speaker-specific excitation information is present in the residual phase. This information is useful since it is complementary to that of MFCCs.

Item Type:	Article
Source:	Copyright of this article belongs to IEEE.
ID Code:	57755
Deposited On:	29 Aug 2011 11:58
Last Modified:	29 Aug 2011 11:58

Repository Staff Only: item control page