Title :
Speaker adaptation by variable reference model subspace and application to large vocabulary speech recognition
Author :
Teng, Wen Xuan ; Gravier, Guillaume ; Bimbot, Frederic ; Soufflet, Férdéric
Author_Institution :
TELISMA
Abstract :
Recently, we presented a rapid speaker adaptation technique, reference model interpolation (RMI), which is based on the linear interpolation of speaker-dependent models and the a posteriori selection of reference models. The approach uses the a priori knowledge provided by a set of representative speakers to guide the estimation of a new speaker model in the speaker space. RMI achieved rapid supervised adaptation in phoneme decoding tasks. In this paper, we present two new results of RMI: firstly, we apply the RMI technique in a practical large vocabulary continuous speech recognition (LVCSR) system with unsupervised instantaneous adaptation. Secondly, we propose an evolutional subspace scenario which integrates the slow update of reference models with RMI rapid adaptation to achieve incremental adaptation. The unsupervised adaptation experiments carried out on broadcast news transcription task show encouraging results for both instantaneous and incremental adapatation.
Keywords :
decoding; interpolation; speaker recognition; vocabulary; linear interpolation; phoneme decoding; posteriori selection; speaker adaptation technique; speaker-dependent model; variable reference model interpolation; vocabulary continuous speech recognition; Speech recognition; Vocabulary; LVCSR; reference models; speaker adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960600