DocumentCode
3319006
Title
Comparative experiments to evaluate the use of a CHMM-based speaker identification engine for Arabic spontaneous speech
Author
Tolba, Hesham
Author_Institution
Electr. Eng. Dept., Taibah Univ., Al Madinah, Saudi Arabia
fYear
2009
fDate
8-11 Aug. 2009
Firstpage
241
Lastpage
245
Abstract
In this paper, we present a comparative study between two identification engines to identify speakers automatically from their voices when speaking spontaneously in Arabic. The first engine is based on the continuous hidden Markov models (CHMMs) while the second one is based on the artificial neural networks (ANNs). The Mel frequency cepstral coefficients (MFCCs) were selected to describe the speech signal. The general Gaussian density distribution HMM was developed for the CHMM-based engine. Elman network was developed for the ANN-based engine. A series of experiments to evaluate both engines have been carried out using a subset of an arabic database. The identification rate was found to be 100% for both engines during text dependent experiments. However, for text-independent experiments, the performance for the CHMM-based engine outperformed that of the ANN-based engine. The identification rates for the CHMM- and the ANN-based engines were found to be 80% and 50%, respectively.
Keywords
Gaussian distribution; cepstral analysis; hidden Markov models; natural language processing; neural nets; speaker recognition; Arabic spontaneous speech; CHMM-based speaker identification engine; Elman network; Gaussian density distribution; Mel frequency cepstral coefficient; artificial neural network; continuous hidden Markov model; Discrete wavelet transforms; Engines; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Robustness; Spatial databases; Speaker recognition; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-4519-6
Electronic_ISBN
978-1-4244-4520-2
Type
conf
DOI
10.1109/ICCSIT.2009.5234955
Filename
5234955
Link To Document