• DocumentCode
    1693279
  • Title

    Rapid adaptation for mobile speech applications

  • Author

    Bacchiani, Michiel

  • Author_Institution
    Google Inc., New York, NY, USA
  • fYear
    2013
  • Firstpage
    7903
  • Lastpage
    7907
  • Abstract
    We investigate the use of iVector-based rapid adaptation for recognition in mobile speech applications. We show that on this task, the proposed approach has two merits over a linear-transform based approach. First it provides larger error reductions (11% vs. 6%) as it is better suited for the short utterances and varied recording conditions. Second it omits the need for speaker data pooling and/or clustering and the very large infrastructure complexity that accompanies that. Empirical results show that although the proposed utterance-based training algorithm leads to large data fragmentation, the resulting model re-estimation performs well. Our implementation within the MapReduce framework allows processing of the large statistics that this approach gives rise to when applied on a database of thousands of hours.
  • Keywords
    error statistics; estimation theory; mobile computing; speech recognition; transforms; MapReduce framework; data fragmentation; error reductions; iVector-based rapid adaptation; infrastructure complexity; linear-transform based approach; mobile speech recognition applications; model reestimation; recording conditions; speaker data pooling; utterance-based training algorithm; Abstracts; Adaptation models; Hidden Markov models; IP networks; Lead; Training; Eigenvoices; GMM; iVectors; rapid adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639203
  • Filename
    6639203