مرکز منطقه ای اطلاع رساني علوم و فناوري - Rapid adaptation for mobile speech applications

DocumentCode :

1693279

Title :

Rapid adaptation for mobile speech applications

Author :

Bacchiani, Michiel

Author_Institution :

Google Inc., New York, NY, USA

fYear :

2013

Firstpage :

7903

Lastpage :

7907

Abstract :

We investigate the use of iVector-based rapid adaptation for recognition in mobile speech applications. We show that on this task, the proposed approach has two merits over a linear-transform based approach. First it provides larger error reductions (11% vs. 6%) as it is better suited for the short utterances and varied recording conditions. Second it omits the need for speaker data pooling and/or clustering and the very large infrastructure complexity that accompanies that. Empirical results show that although the proposed utterance-based training algorithm leads to large data fragmentation, the resulting model re-estimation performs well. Our implementation within the MapReduce framework allows processing of the large statistics that this approach gives rise to when applied on a database of thousands of hours.

Keywords :

error statistics; estimation theory; mobile computing; speech recognition; transforms; MapReduce framework; data fragmentation; error reductions; iVector-based rapid adaptation; infrastructure complexity; linear-transform based approach; mobile speech recognition applications; model reestimation; recording conditions; speaker data pooling; utterance-based training algorithm; Abstracts; Adaptation models; Hidden Markov models; IP networks; Lead; Training; Eigenvoices; GMM; iVectors; rapid adaptation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6639203

Filename :

6639203

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1693279