A New Segmentation Algorithm Combined with Transient Frames Power for Text Independent Speaker Verification

Author

Saeidi, Rahim ; Mohammadi, H.R.S. ; Rodman, R.D. ; Kinnunen, Tomi

Author_Institution

Res. Center for Intelligent Signal Process., Tehran, Iran

Volume

4

fYear

2007

fDate

15-20 April 2007

Abstract

In this paper we propose a new segmentation algorithm called delta MFCC based speech segmentation (DMFCC-SS), with application to speaker recognition systems. We show that DMFCC-SS can separate the regions of speech that result from similar likelihood scores using models such as a Gaussian mixture model (GMM), and can therefore be used to identify the regions of speech between two transitional states in a speech signal. By combining this segmentation algorithm with the discriminative power of transient frames in speaker recognition, we can investigate the tradeoff in speed-up rates that result from DMFCC-SS, with speaker verification equal error rates that result from representatives of each segment. We use a universal background model Gaussian mixture model (UBM-GMM) as a baseline system. The proposed speed-up algorithm, working in the pre-processing stage, performs well while having no computational load compared to the main GMM system. Experimental results show the superior performance of this pre-processing method in comparison with other algorithms working in the pre-processing stage of a UBM-GMM system.

Keywords

Gaussian processes; speech processing; speech recognition; Gaussian mixture model; delta MFCC based speech segmentation; segmentation algorithm; speaker recognition systems; speech signal; text independent speaker verification; transient frames power; universal background model; Application software; Cepstral analysis; Computer science; Error analysis; Mel frequency cepstral coefficient; Power system modeling; Signal processing; Signal processing algorithms; Speaker recognition; Speech processing; Speaker recognition; UBM-GMM; speech segmentation; speed-up; transient frames;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location

Honolulu, HI

ISSN

1520-6149

Print_ISBN

1-4244-0727-3

Type

conf

DOI

10.1109/ICASSP.2007.366910

Filename

4218098