Title :
Speaker Accent Classification Using Distance Metric Learning Approach
Author :
Ullah, Sameeh ; Karray, Fakhri
Author_Institution :
Univ. of Waterloo, Waterloo
Abstract :
A speaker´s accent is the most important factor affecting the performance of automatic speech recognition (ASR) systems because accents vary widely, even within the same country or community. This variation is due to the fact that when non- native speakers start to learn a second language, the substitution of native language phoneme pronunciation is a common process. Such substitution leads to fuzziness between the phoneme boundaries and phoneme classes. This fuzziness reduces out-of class variations and increases the similarities between the different sets of phonemes. In this paper, a new method is proposed based on the side information from dissimilar pairs of accent groups, to transfer data points to a new space where the Euclidian distances between similar and dissimilar points become minimum and maximum, respectively.
Keywords :
Gaussian processes; learning (artificial intelligence); optimisation; speaker recognition; ASR systems; Euclidian distances; Gaussian mixture model; automatic speech recognition; distance metric learning approach; native language phoneme pronunciation; nonnative speakers; optimization procedure; speaker accent classification; Anatomy; Automatic speech recognition; Hidden Markov models; Humans; Information technology; Natural languages; Routing; Signal processing; Speech processing; Topology; Automatic Speech Recognition system; phoneme; phoneme classes; speaker´s accent;
Conference_Titel :
Signal Processing and Information Technology, 2007 IEEE International Symposium on
Conference_Location :
Giza
Print_ISBN :
978-1-4244-1835-0
Electronic_ISBN :
978-1-4244-1835-0
DOI :
10.1109/ISSPIT.2007.4458152