• DocumentCode
    294655
  • Title

    Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation

  • Author

    Tonomura, Masahiro ; Kosaka, Testuo ; Matsunaga, Shoichi

  • Author_Institution
    ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    688
  • Abstract
    The paper proposes a novel speech adaptation algorithm that enables adaptation even with a small amount of speech data. This is a unified algorithm of two efficient conventional speaker adaptation techniques, which are maximum a posteriori (MAP) estimation and transfer vector field smoothing (VFS). This algorithm is designed to avoid the weaknesses of both MAP and VFS. A higher phoneme recognition performance was obtained by using this algorithm than with individual methods, showing the superiority of the proposed algorithm. The phoneme recognition error rate was reduced from 22.0% to 19.1% using this algorithm for a speaker-independent model with seven adaptation phrases. Furthermore, a priori knowledge concerning speaker characteristics was obtained for this algorithm by generating an initial HMM with the speech of a selected speaker cluster based on speaker similarity. The adaptation using this initial model reduced the phoneme recognition error rate from 22.0% to 17.7%
  • Keywords
    error statistics; hidden Markov models; maximum likelihood estimation; probability; smoothing methods; speech recognition; HMM; a priori knowledge; adaptation phrases; error rate; maximum a posteriori probability estimation; phoneme recognition performance; speaker adaptation; speaker cluster; speaker similarity; speaker-independent model; transfer vector field smoothing; Adaptation model; Algorithm design and analysis; Character generation; Clustering algorithms; Error analysis; Hidden Markov models; Parameter estimation; Smoothing methods; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479787
  • Filename
    479787