• DocumentCode
    2701461
  • Title

    A New Segmentation Algorithm Combined with Transient Frames Power for Text Independent Speaker Verification

  • Author

    Saeidi, Rahim ; Mohammadi, H.R.S. ; Rodman, R.D. ; Kinnunen, Tomi

  • Author_Institution
    Res. Center for Intelligent Signal Process., Tehran, Iran
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    In this paper we propose a new segmentation algorithm called delta MFCC based speech segmentation (DMFCC-SS), with application to speaker recognition systems. We show that DMFCC-SS can separate the regions of speech that result from similar likelihood scores using models such as a Gaussian mixture model (GMM), and can therefore be used to identify the regions of speech between two transitional states in a speech signal. By combining this segmentation algorithm with the discriminative power of transient frames in speaker recognition, we can investigate the tradeoff in speed-up rates that result from DMFCC-SS, with speaker verification equal error rates that result from representatives of each segment. We use a universal background model Gaussian mixture model (UBM-GMM) as a baseline system. The proposed speed-up algorithm, working in the pre-processing stage, performs well while having no computational load compared to the main GMM system. Experimental results show the superior performance of this pre-processing method in comparison with other algorithms working in the pre-processing stage of a UBM-GMM system.
  • Keywords
    Gaussian processes; speech processing; speech recognition; Gaussian mixture model; delta MFCC based speech segmentation; segmentation algorithm; speaker recognition systems; speech signal; text independent speaker verification; transient frames power; universal background model; Application software; Cepstral analysis; Computer science; Error analysis; Mel frequency cepstral coefficient; Power system modeling; Signal processing; Signal processing algorithms; Speaker recognition; Speech processing; Speaker recognition; UBM-GMM; speech segmentation; speed-up; transient frames;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366910
  • Filename
    4218098