DocumentCode :
2082667
Title :
Significance of magnitude and phase information via VTEO for humming based biometrics
Author :
Patil, Hemant A. ; Madhavi, Maulik C.
Author_Institution :
Dhirubhai Ambani Inst. of Inf. & Commun. Technol., Gandhinagar, India
fYear :
2012
fDate :
March 29 2012-April 1 2012
Firstpage :
372
Lastpage :
377
Abstract :
In this paper, recognition of persons is attempted from their hum. This kind of application can be useful to design humming-based biometrics system or person-dependent Query-by-Humming (QBH) system and hence play an important role in music information retrieval (MIR) system. This paper develops a new feature extraction technique to exploit phase spectrum information along with magnitude spectrum information from hum signal. In particular, structure of state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC) is modified to capture the phase spectrum information. In addition, a new energy measure, viz., Variable length Teager Energy Operator (VTEO) is employed to compute subband energies of different time-domain subband signals (i.e., output of 24 triangular shaped filters used in Mel filterbank). Discriminatively-trained polynomial classifier of 2nd order approximations are used as the basis for recognition experiment.
Keywords :
approximation theory; biometrics (access control); cepstral analysis; channel bank filters; feature extraction; information retrieval systems; music; query processing; signal classification; speech recognition; MFCC; MIR system; QBH system; VTEO; discriminatively-trained polynomial classifier; energy measure; feature extraction technique; humming-based biometrics system; magnitude information; magnitude spectrum information; mel filterbank; mel frequency cepstral coefficients; music information retrieval system; person recognition; person-dependent query-by-humming system; phase information; phase spectrum information; time-domain subband signals; triangular shaped filters; variable length teager energy operator; Feature extraction; Filter banks; Frequency domain analysis; Information filters; Mel frequency cepstral coefficient; Speech; Time domain analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biometrics (ICB), 2012 5th IAPR International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4673-0396-5
Electronic_ISBN :
978-1-4673-0397-2
Type :
conf
DOI :
10.1109/ICB.2012.6199779
Filename :
6199779
Link To Document :
بازگشت