DocumentCode :
1690030
Title :
Accent recognition using i-vector, Gaussian Mean Supervector and Gaussian posterior probability supervector for spontaneous telephone speech
Author :
Bahari, Mohamad Hasan ; Saeidi, Rahim ; Van hamme, Hugo ; Van Leeuwen, David
Author_Institution :
Center for Process. Speech & Images, KU Leuven, Leuven, Belgium
fYear :
2013
Firstpage :
7344
Lastpage :
7348
Abstract :
In this paper, three utterance modelling approaches, namely Gaussian Mean Supervector (GMS), i-vector and Gaussian Posterior Probability Supervector (GPPS), are applied to the accent recognition problem. For each utterance modeling method, three different classifiers, namely the Support Vector Machine (SVM), the Naive Bayesian Classifier (NBC) and the Sparse Representation Classifier (SRC), are employed to find out suitable matches between the utterance modelling schemes and the classifiers. The evaluation database is formed by using English utterances of speakers whose native languages are Russian, Hindi, American English, Thai, Vietnamese and Cantonese. These utterances are drawn from the National Institute of Standards and Technology (NIST) 2008 Speaker Recognition Evaluation (SRE) database. The study results show that GPPS and i-vector are more effective than GMS in this accent recognition task. It is also concluded that among the employed classifiers, the best matches for i-vector and GPPS are SVM and SRC, respectively.
Keywords :
Bayes methods; Gaussian processes; natural language processing; signal classification; signal representation; speaker recognition; speech processing; support vector machines; American English language; Cantonese language; English utterances; GMS; GPPS; Gaussian mean supervector; Gaussian posterior probability supervector; Hindi language; NBC; NIST 2008 speaker recognition evaluation database; National Institute of Standards and Technology; Russian language; SRC; SRE database; SVM; Thai language; Vietnamese language; accent recognition task; i-vector; naive Bayesian classifier; native languages; sparse representation classifier; spontaneous telephone speech; support vector machine; utterance modeling method; utterance modelling schemes; Acoustics; Databases; NIST; Speech; Speech recognition; Support vector machines; Vectors; Accent Recognition; Gaussian Mean Supervector; Gaussian Posterior Probability Supervector; i-vector;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639089
Filename :
6639089
Link To Document :
بازگشت