• DocumentCode
    1793218
  • Title

    Randomization effect on iterative-based speaker diarization system for telephone conversations

  • Author

    Furmanov, Tal ; Aminov, Lidiya ; Moyal, Ami ; Lapidot, Itshak

  • Author_Institution
    Appl. Mater., Rehovot, Israel
  • fYear
    2014
  • fDate
    3-5 Dec. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The primary objective of speaker diarization system is to designate speech segments to one of K speakers in the conversation. We use a hidden-distortion-model (HDM)-based system. HDM allows using different emission models as speaker models. We investigate the effect of randomization in two different levels. One level is stochastic training versus deterministic training and the other, random model initialization versus preserving initialization from the previous iteration. The emission models were codebooks (CBs) trained using K-means algorithm, both, batch and stochastic versions, as well as a self-organizing map (SOM) in its stochastic version. The evaluation performed on 108 telephone conversations from the LDC CallHome corpus. We will show that randomizing is always outperforming the deterministic training. Stochastic training demonstrated relative improvement of 3.5%. Random initialization achieved relative improvement of 7.28% comparing to preservation of initialization from the previous iteration.
  • Keywords
    iterative methods; mobile radio; self-organising feature maps; speaker recognition; HDM; K-means algorithm; LDC CallHome corpus; SOM; batch versions; codebooks; deterministic training; emission models; hidden distortion model; iterative-based speaker diarization system; preserving initialization; random model initialization; randomization effect; self-organizing map; speaker models; speech segments; stochastic training; stochastic versions; telephone conversations; Adaptation models; Convergence; Density estimation robust algorithm; Hidden Markov models; Speech; Stochastic processes; Training; K-means; hidden-distortion model (HDM); initialization; self-organizing maps (SOM); speaker diarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical & Electronics Engineers in Israel (IEEEI), 2014 IEEE 28th Convention of
  • Conference_Location
    Eilat
  • Print_ISBN
    978-1-4799-5987-7
  • Type

    conf

  • DOI
    10.1109/EEEI.2014.7005738
  • Filename
    7005738