Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation

Author

Tonomura, Masahiro ; Kosaka, Testuo ; Matsunaga, Shoichi

Author_Institution

ATR Interpreting Telecommun. Res. Labs., Kyoto, Japan

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

688

Abstract

The paper proposes a novel speech adaptation algorithm that enables adaptation even with a small amount of speech data. This is a unified algorithm of two efficient conventional speaker adaptation techniques, which are maximum a posteriori (MAP) estimation and transfer vector field smoothing (VFS). This algorithm is designed to avoid the weaknesses of both MAP and VFS. A higher phoneme recognition performance was obtained by using this algorithm than with individual methods, showing the superiority of the proposed algorithm. The phoneme recognition error rate was reduced from 22.0% to 19.1% using this algorithm for a speaker-independent model with seven adaptation phrases. Furthermore, a priori knowledge concerning speaker characteristics was obtained for this algorithm by generating an initial HMM with the speech of a selected speaker cluster based on speaker similarity. The adaptation using this initial model reduced the phoneme recognition error rate from 22.0% to 17.7%

Keywords

error statistics; hidden Markov models; maximum likelihood estimation; probability; smoothing methods; speech recognition; HMM; a priori knowledge; adaptation phrases; error rate; maximum a posteriori probability estimation; phoneme recognition performance; speaker adaptation; speaker cluster; speaker similarity; speaker-independent model; transfer vector field smoothing; Adaptation model; Algorithm design and analysis; Character generation; Clustering algorithms; Error analysis; Hidden Markov models; Parameter estimation; Smoothing methods; Speech recognition; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479787

Filename

479787