مرکز منطقه ای اطلاع رساني علوم و فناوري - An investigation of sub-band FM feature extraction in speaker recognition

DocumentCode :

484757

Title :

An investigation of sub-band FM feature extraction in speaker recognition

Author :

Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Jones, E.

Author_Institution :

Sch. of Electr. Eng. & Telecommun., Univ. of New South Wales, Sydney, NSW

fYear :

2008

fDate :

18-19 June 2008

Firstpage :

Lastpage :

Abstract :

Following recent evidence that FM features extracted from a sub-band decomposition of speech are highly uncorrelated, this paper investigates the effect of the number of auditory scale sub-bands in FM based front-end processing. For this study, a newly developed robust FM extraction method based on the least square differential ratio is used to extract features, comprising one FM component per sub-band. Automatic speaker recognition experiments were conducted on the cellular NIST 2001 database, with the number of filters in the front-end varied from 6 to 26. Performance degradation was observed for very low numbers of filters and very high numbers of filters. Results show that for a 4 kHz speech bandwidth, a minimum of 10 and a maximum of 18 sub-bands is a suitable choice for speech front-end applications such as automatic speaker recognition.

Keywords :

feature extraction; frequency modulation; speaker recognition; FM based front-end processing; auditory scale sub-bands; automatic speaker recognition; least square differential ratio; speech subband decomposition; subband FM feature extraction; Frequency modulation; Mel scale; automatic speaker recognition; filter bank;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Signals and Systems Conference, 208. (ISSC 2008). IET Irish

Conference_Location :

Galway

ISSN :

0537-9989

Print_ISBN :

978-0-86341-931-7

Type :

conf

Filename :

4780925

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=484757