DocumentCode :
1882677
Title :
UBM-based incremental speaker adaptation
Author :
Wu, TingYao ; Lu, Lie ; Chen, Ke ; Zhang, Hong-Jiang
Author_Institution :
Center for Inf. Sci., Peking Univ., Beijing, China
Volume :
2
fYear :
2003
fDate :
6-9 July 2003
Abstract :
This paper addresses a novel algorithm of incremental speaker adaptation (ISA) based on universal background model (UBM) for saving storage and real-time processing. This algorithm can be seen as an extension of traditional speaker adaptation. It consists of two steps, adaptation and combination. It not only considers the speaker´s characteristics in limited training data, but also prohibits over-fitting of the updated model. The incremental adaptation algorithm needs little storage and meets the requirement of real-time processing. In order to evaluate the efficiency and effectivity of the proposed approach, a real-time speaker segmentation system for broadcasting news is built. Experiment results demonstrate that our approach yields real time operation and achieves satisfactory performance.
Keywords :
audio databases; broadcasting; speaker recognition; speech processing; broadcasting news; incremental speaker adaptation; real-time processing; real-time speaker segmentation system; training data; universal background model; Asia; Broadcasting; Information science; Instruction sets; Maximum likelihood linear regression; Real time systems; Speaker recognition; Speech; Training data; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
Type :
conf
DOI :
10.1109/ICME.2003.1221718
Filename :
1221718
Link To Document :
بازگشت