Title :
Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation
Author :
Tonomura, Masahiro ; Kosaka, Testuo ; Matsunaga, Shoichi
Author_Institution :
ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan
Abstract :
The paper proposes a novel speech adaptation algorithm that enables adaptation even with a small amount of speech data. This is a unified algorithm of two efficient conventional speaker adaptation techniques, which are maximum a posteriori (MAP) estimation and transfer vector field smoothing (VFS). This algorithm is designed to avoid the weaknesses of both MAP and VFS. A higher phoneme recognition performance was obtained by using this algorithm than with individual methods, showing the superiority of the proposed algorithm. The phoneme recognition error rate was reduced from 22.0% to 19.1% using this algorithm for a speaker-independent model with seven adaptation phrases. Furthermore, a priori knowledge concerning speaker characteristics was obtained for this algorithm by generating an initial HMM with the speech of a selected speaker cluster based on speaker similarity. The adaptation using this initial model reduced the phoneme recognition error rate from 22.0% to 17.7%
Keywords :
error statistics; hidden Markov models; maximum likelihood estimation; probability; smoothing methods; speech recognition; HMM; a priori knowledge; adaptation phrases; error rate; maximum a posteriori probability estimation; phoneme recognition performance; speaker adaptation; speaker cluster; speaker similarity; speaker-independent model; transfer vector field smoothing; Adaptation model; Algorithm design and analysis; Character generation; Clustering algorithms; Error analysis; Hidden Markov models; Parameter estimation; Smoothing methods; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479787