DocumentCode :
1693279
Title :
Rapid adaptation for mobile speech applications
Author :
Bacchiani, Michiel
Author_Institution :
Google Inc., New York, NY, USA
fYear :
2013
Firstpage :
7903
Lastpage :
7907
Abstract :
We investigate the use of iVector-based rapid adaptation for recognition in mobile speech applications. We show that on this task, the proposed approach has two merits over a linear-transform based approach. First it provides larger error reductions (11% vs. 6%) as it is better suited for the short utterances and varied recording conditions. Second it omits the need for speaker data pooling and/or clustering and the very large infrastructure complexity that accompanies that. Empirical results show that although the proposed utterance-based training algorithm leads to large data fragmentation, the resulting model re-estimation performs well. Our implementation within the MapReduce framework allows processing of the large statistics that this approach gives rise to when applied on a database of thousands of hours.
Keywords :
error statistics; estimation theory; mobile computing; speech recognition; transforms; MapReduce framework; data fragmentation; error reductions; iVector-based rapid adaptation; infrastructure complexity; linear-transform based approach; mobile speech recognition applications; model reestimation; recording conditions; speaker data pooling; utterance-based training algorithm; Abstracts; Adaptation models; Hidden Markov models; IP networks; Lead; Training; Eigenvoices; GMM; iVectors; rapid adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639203
Filename :
6639203
Link To Document :
بازگشت