• DocumentCode
    1882677
  • Title

    UBM-based incremental speaker adaptation

  • Author

    Wu, TingYao ; Lu, Lie ; Chen, Ke ; Zhang, Hong-Jiang

  • Author_Institution
    Center for Inf. Sci., Peking Univ., Beijing, China
  • Volume
    2
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    This paper addresses a novel algorithm of incremental speaker adaptation (ISA) based on universal background model (UBM) for saving storage and real-time processing. This algorithm can be seen as an extension of traditional speaker adaptation. It consists of two steps, adaptation and combination. It not only considers the speaker´s characteristics in limited training data, but also prohibits over-fitting of the updated model. The incremental adaptation algorithm needs little storage and meets the requirement of real-time processing. In order to evaluate the efficiency and effectivity of the proposed approach, a real-time speaker segmentation system for broadcasting news is built. Experiment results demonstrate that our approach yields real time operation and achieves satisfactory performance.
  • Keywords
    audio databases; broadcasting; speaker recognition; speech processing; broadcasting news; incremental speaker adaptation; real-time processing; real-time speaker segmentation system; training data; universal background model; Asia; Broadcasting; Information science; Instruction sets; Maximum likelihood linear regression; Real time systems; Speaker recognition; Speech; Training data; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1221718
  • Filename
    1221718