• DocumentCode
    134261
  • Title

    Exploiting Variable length Teager Energy Operator in melcepstral features for person recognition from humming

  • Author

    Madhavi, Maulik C. ; Patil, Hemant A.

  • Author_Institution
    Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    624
  • Lastpage
    628
  • Abstract
    In this paper, we attempt voice biometrics problem using only humming signal rather than normal speech. This paper adapts a new feature extraction technique which exploits Variable length Teager Energy Operator (VTEO) onto subband filtered signal of Mel filterbank. This feature modifies structure of state-of-the-art feature set, viz., Mel Frequency Cepstral Coefficients (MFCC). In particular, a new energy measure, viz., VTEO is employed to compute subband energies of different time-domain subband signals. The features derived MFCCs to capture magnitude and phase spectrum information via VTEO are termed as MFCC-VTMP. Discriminatively-trained polynomial classifier of 2nd order approximations is used as the basis for all experiments. MFCC-VTMP feature set is found to be better than MFCC for various evaluation factors such as order of polynomial classifier, dimension of feature vector, signal degradation conditions and class separability. % EER of MFCC and MFCC-VTMP are found to be 12.20% and 12.01%, respectively using 2nd order polynomial classification.
  • Keywords
    biometrics (access control); cepstral analysis; channel bank filters; feature extraction; mathematical operators; polynomials; signal classification; speech recognition; 2nd order approximations; MFCC-VTMP feature set; VTEO; class separability; discriminatively-trained polynomial classifier; feature extraction technique; feature vector dimension; humming signal; mel cepstral features; mel filterbank; mel frequency cepstral coefficients; person recognition; phase spectrum information; polynomial classifier order; signal degradation conditions; subband filtered signal; time-domain subband signals; variable length teager energy operator; voice biometrics problem; Feature extraction; Mel frequency cepstral coefficient; Noise; Polynomials; Speech; Training; Vectors; Humming; MFCC-VTMP; Melcepstrum; VTEO; person recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936654
  • Filename
    6936654