DocumentCode
2556202
Title
Speech activity and speaker novelty detection methods for meeting processing
Author
Sugiyama, Masahide ; Markov, Konstantin ; Ronzhin, Andrey ; Budkov, Victor ; Karpov, Alexey ; Prischepa, Maria
Author_Institution
Human Interface Lab., Univ. of Aizu, Fukushima, Japan
fYear
2009
fDate
12-14 Oct. 2009
Firstpage
1
Lastpage
6
Abstract
Segmentation of multi-speaker meeting audio data recorded with several microphones into speech/silence frames is one of the first tasks at development of the speaker diarization system. Energy normalization techniques and signal correlation methods are used in order to avoid the crosstalk problem, in which participant´s speech appears on other participants´ microphones. A comparison of different types of microphones and a configuration of the recording devices implemented inside the intelligent meeting room are described. Special attention is paid to improvement of the novelty detection performance of the on-line speaker diarization system.
Keywords
microphones; speech processing; energy normalization techniques; meeting processing; multi-speaker segmentation; speaker diarization system; speaker novelty detection methods; speech activity; Audio recording; Humans; Informatics; Laboratories; Loudspeakers; Microphones; NIST; Signal processing; Speech processing; Speech recognition; multimodal interfaces; sound source localization; speaker diarization; voice activity detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Ultra Modern Telecommunications & Workshops, 2009. ICUMT '09. International Conference on
Conference_Location
St. Petersburg
Print_ISBN
978-1-4244-3942-3
Electronic_ISBN
978-1-4244-3941-6
Type
conf
DOI
10.1109/ICUMT.2009.5345325
Filename
5345325
Link To Document