• DocumentCode
    2426492
  • Title

    Different aspects of source information for limited data speaker verification

  • Author

    Das, Rohan Kumar ; Pati, Debadatta ; Mahadeva Prasanna, S.R.

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
  • fYear
    2015
  • fDate
    Feb. 27 2015-March 1 2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Limited data speaker verification has shown its significance in practical system oriented applications. The paper shows the importance of different aspects of voice source feature for limited test data scenario. A baseline speaker verification system using conventional mel frequency cepstral co-efficients (MFCC) feature is developed and performance under limited test data condition (≤10 s) is evaluated. A parallel system based on source feature mel power difference of spectrum in subband (M-PDSS) is developed in the i-vector based speaker verification framework. Both the systems were fused at the score level for the cases of short segments of test speech, which demonstrated the importance of source feature with reduction in test data duration. A comparative study of the M-PDSS feature is then made with our earlier work using discrete cosine transform of the integrated linear prediction residual (DCTILPR) feature and then fusion of two source features M-PDSS and DCTILPR along with MFCC features is carried out. An absolute improvement of 5.19% is obtained for 2 s of test data which conveys the significance of multiple source information under limited data speaker verification as it carries different aspects of source information.
  • Keywords
    cepstral analysis; speaker recognition; DCTILPR feature; M-PDSS feature; MFCC feature; baseline speaker verification system; conventional mel frequency cepstral co-efficients feature; discrete cosine transform of integrated linear prediction residual feature; limited data speaker verification; multiple source information; practical system oriented applications; test data condition; voice source feature; Decision support systems; Dynamic range; Feature extraction; Handheld computers; Market research; Mel frequency cepstral coefficient; NIST; DCTILPR; M-PDSS; MFCC; short utterances; source features; speaker verification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2015 Twenty First National Conference on
  • Conference_Location
    Mumbai
  • Type

    conf

  • DOI
    10.1109/NCC.2015.7084846
  • Filename
    7084846