Title :
Trajectory clustering for automatic speech recognition
Author :
Yan Han ; de Veth, Johan ; Boves, Louis
Author_Institution :
Dept. of Language & Speech, Radboud Univ., Nijmegen, Netherlands
Abstract :
In this paper, we present an approach for automatic clustering of multi-dimensional dynamic trajectories corresponding to speech data that is based on Trajectory Clustering (TC). TC uses the Expectation Maximization algorithm (EM) for clustering with the mixtures of Multiple Linear Regression model. Since the initial values of the model parameters are critical to the clustering performance, a successive splitting algorithm was developed to incrementally increase the number of clusters. We define multipath HMM topologies using the trajectory clusters found. Based on the hypothesis that pronunciation variation in speech is more systematic at a unit level that is longer than a phone, we used modelling units defined in terms of Head-Body-Tail (HBT) models for connected digit recognition for the Dutch language. It appears that multi-path HMM topologies based on TC clusters outperform multi-path HMM topologies based on prior knowledge about speaker gender and speaking rate.
Keywords :
expectation-maximisation algorithm; hidden Markov models; natural languages; regression analysis; speaker recognition; Dutch language; HBT; automatic clustering; automatic speech recognition; digit recognition; expectation maximization algorithm; head-body-tail; hidden Markov models; multidimensional dynamic trajectory; multipath HMM topology; multiple linear regression; pronunciation variation; speaker gender; speaking rate; speech data; successive splitting algorithm; trajectory clustering; Acoustics; Hidden Markov models; Speech; Speech recognition; Training; Trajectory; Vectors;
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1