DocumentCode :
3070872
Title :
Speaker Accent Classification Using Distance Metric Learning Approach
Author :
Ullah, Sameeh ; Karray, Fakhri
Author_Institution :
Univ. of Waterloo, Waterloo
fYear :
2007
fDate :
15-18 Dec. 2007
Firstpage :
900
Lastpage :
905
Abstract :
A speaker´s accent is the most important factor affecting the performance of automatic speech recognition (ASR) systems because accents vary widely, even within the same country or community. This variation is due to the fact that when non- native speakers start to learn a second language, the substitution of native language phoneme pronunciation is a common process. Such substitution leads to fuzziness between the phoneme boundaries and phoneme classes. This fuzziness reduces out-of class variations and increases the similarities between the different sets of phonemes. In this paper, a new method is proposed based on the side information from dissimilar pairs of accent groups, to transfer data points to a new space where the Euclidian distances between similar and dissimilar points become minimum and maximum, respectively.
Keywords :
Gaussian processes; learning (artificial intelligence); optimisation; speaker recognition; ASR systems; Euclidian distances; Gaussian mixture model; automatic speech recognition; distance metric learning approach; native language phoneme pronunciation; nonnative speakers; optimization procedure; speaker accent classification; Anatomy; Automatic speech recognition; Hidden Markov models; Humans; Information technology; Natural languages; Routing; Signal processing; Speech processing; Topology; Automatic Speech Recognition system; phoneme; phoneme classes; speaker´s accent;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2007 IEEE International Symposium on
Conference_Location :
Giza
Print_ISBN :
978-1-4244-1835-0
Electronic_ISBN :
978-1-4244-1835-0
Type :
conf
DOI :
10.1109/ISSPIT.2007.4458152
Filename :
4458152
Link To Document :
بازگشت