DocumentCode
1693279
Title
Rapid adaptation for mobile speech applications
Author
Bacchiani, Michiel
Author_Institution
Google Inc., New York, NY, USA
fYear
2013
Firstpage
7903
Lastpage
7907
Abstract
We investigate the use of iVector-based rapid adaptation for recognition in mobile speech applications. We show that on this task, the proposed approach has two merits over a linear-transform based approach. First it provides larger error reductions (11% vs. 6%) as it is better suited for the short utterances and varied recording conditions. Second it omits the need for speaker data pooling and/or clustering and the very large infrastructure complexity that accompanies that. Empirical results show that although the proposed utterance-based training algorithm leads to large data fragmentation, the resulting model re-estimation performs well. Our implementation within the MapReduce framework allows processing of the large statistics that this approach gives rise to when applied on a database of thousands of hours.
Keywords
error statistics; estimation theory; mobile computing; speech recognition; transforms; MapReduce framework; data fragmentation; error reductions; iVector-based rapid adaptation; infrastructure complexity; linear-transform based approach; mobile speech recognition applications; model reestimation; recording conditions; speaker data pooling; utterance-based training algorithm; Abstracts; Adaptation models; Hidden Markov models; IP networks; Lead; Training; Eigenvoices; GMM; iVectors; rapid adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639203
Filename
6639203
Link To Document