• DocumentCode
    2556202
  • Title

    Speech activity and speaker novelty detection methods for meeting processing

  • Author

    Sugiyama, Masahide ; Markov, Konstantin ; Ronzhin, Andrey ; Budkov, Victor ; Karpov, Alexey ; Prischepa, Maria

  • Author_Institution
    Human Interface Lab., Univ. of Aizu, Fukushima, Japan
  • fYear
    2009
  • fDate
    12-14 Oct. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Segmentation of multi-speaker meeting audio data recorded with several microphones into speech/silence frames is one of the first tasks at development of the speaker diarization system. Energy normalization techniques and signal correlation methods are used in order to avoid the crosstalk problem, in which participant´s speech appears on other participants´ microphones. A comparison of different types of microphones and a configuration of the recording devices implemented inside the intelligent meeting room are described. Special attention is paid to improvement of the novelty detection performance of the on-line speaker diarization system.
  • Keywords
    microphones; speech processing; energy normalization techniques; meeting processing; multi-speaker segmentation; speaker diarization system; speaker novelty detection methods; speech activity; Audio recording; Humans; Informatics; Laboratories; Loudspeakers; Microphones; NIST; Signal processing; Speech processing; Speech recognition; multimodal interfaces; sound source localization; speaker diarization; voice activity detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Ultra Modern Telecommunications & Workshops, 2009. ICUMT '09. International Conference on
  • Conference_Location
    St. Petersburg
  • Print_ISBN
    978-1-4244-3942-3
  • Electronic_ISBN
    978-1-4244-3941-6
  • Type

    conf

  • DOI
    10.1109/ICUMT.2009.5345325
  • Filename
    5345325