DocumentCode :
2620128
Title :
Text-independent distributed speaker identification and verification using GMM-UBM speaker models for mobile communications
Author :
Chowdhury, Md Fozur Rahman ; Selouani, Sid-Ahmed ; O´Shaughnessy, Douglas
Author_Institution :
INRS - EMT, Univ. du Quebec, Montréal, QC, Canada
fYear :
2010
fDate :
10-13 May 2010
Firstpage :
57
Lastpage :
60
Abstract :
This paper presents the simulation results of a speaker identification and verification (SIDV) system that would be efficient for resource limited mobile devices. The proposed system works as a text-independent system within the distributed speech recognition (DSR) framework and is designed to identify a target speaker or imposter using short digit utterances rather than long utterances. In this distributed SIDV (DSIDV), the target speaker model is developed by using the most popular generative system called a GMM-UBM system. A Gaussian Mixture Model (GMM) for each true speaker is derived from the Universal Background Model (UBM) by using Bayesian maximum a posteriori (MAP) adaptation. The objective of this paper is to show how speaker recognition and verification over telephone channels can be done using short speeches and DSR technology robust to channel distortions. The ETSI Aurora2 speech corpus was tested in these experiments. The experimental results show that the proposed DSIDV system yields excellent identification and detection performances in a ETSI DSR evaluation task and would be suitable for small hand held mobile devices.
Keywords :
Bayes methods; Gaussian processes; maximum likelihood estimation; mobile communication; speaker recognition; Bayesian maximum a posteriori adaptation; DSR framework; ETSI Aurora2 speech corpus; ETSI DSR evaluation task; GMM; GMM-UBM speaker models; Gaussian mixture model; MAP adaptation; UBM; channel distortions; distributed SIDV system; distributed speech recognition; mobile communications; resource limited mobile devices; small handheld mobile devices; speaker identification and verification system; target speaker identification; telephone channels; text-independent distributed speaker identification; universal background model; Data models; Databases; Mobile communication; Robustness; Distributed speaker identification; Gaussian mixture model; Universal background model; distributed speaker verification; text independent;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-7165-2
Type :
conf
DOI :
10.1109/ISSPA.2010.5605556
Filename :
5605556
Link To Document :
بازگشت