• DocumentCode
    3529449
  • Title

    Unsupervisec cross-validation adaptation algorithms for improved adaptation performance

  • Author

    Shinozaki, Takahiro ; Kubota, Yu ; Furui, Sadaoki

  • Author_Institution
    Dept. of Comput. Sci., Tokyo Inst. of Technol., Tokyo
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4377
  • Lastpage
    4380
  • Abstract
    An unsupervised cross-validation adaptation algorithm and its variation are proposed that introduce the idea of cross-validation in the unsupervised batch-mode adaptation framework to improve the adaptation performance. The first algorithm is constructed on a general adaptation technique such as MLLR and can be used in combination with any adaptation method. The second algorithm is a modified version of the first algorithm and works with lower computational cost by assuming MLLR. These algorithms are extensions of our previously proposed CV training methods and are useful to suppress the negative effect of the conventional unsupervised batch-mode adaptation process that reinforces the errors included in automatic transcriptions. The proposed algorithms were evaluated in domain adaptation, speaker adaptation, and in their combination for large vocabulary spontaneous speech recognition. When the domain and speaker adaptations were combined using a read speech initial model, the relative word error rate reduction by the proposed method was 29% whereas the reduction by the conventional approach was 23%.
  • Keywords
    maximum likelihood estimation; regression analysis; speaker recognition; vocabulary; CV training; adaptation performance; domain adaptation; maximum likelihood linear regression; read speech initial model; speaker adaptation; unsupervised batch-mode adaptation; unsupervised cross-validation adaptation; vocabulary spontaneous speech recognition; word error rate reduction; Computational efficiency; Context modeling; Error correction; Iterative algorithms; Iterative decoding; Maximum likelihood linear regression; Parameter estimation; Speech analysis; Speech recognition; Vocabulary; MAP; MLLR; Unsupervised adaptation; computational cost; cross-validation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960599
  • Filename
    4960599