DocumentCode
1732897
Title
Design of audio-visual TV broadcast news transcription system prototype
Author
Chaloupka, Josef
Author_Institution
Lab. of Comput. Speech Process., Tech. Univ. of Liberec, Liberec, Czech Republic
fYear
2011
Firstpage
209
Lastpage
212
Abstract
This contribution focuses on the design of our automatic audio-visual TV broadcast news transcription system, where we would like to extend our Czech transcription system to use information from the visual signal of TV news video recordings. The subsystems for visual signal segmentation, for visual speaker identification and for visual voice activity detection are described here. These subsystems should help to develop our automatic audiovisual transcription system.
Keywords
audio-visual systems; image segmentation; speaker recognition; television broadcasting; video signal processing; Czech transcription system; TV news video recordings; automatic audio-visual TV broadcast news transcription system; visual signal segmentation; visual speaker identification; visual voice activity detection; Discrete cosine transforms; Hidden Markov models; Humans; Image color analysis; Image segmentation; Speech recognition; Visualization; audio-visual TV broadcast news transcription; visual signal segmentation; visual speaker idntification; visual voice activity detector;
fLanguage
English
Publisher
ieee
Conference_Titel
ELMAR, 2011 Proceedings
Conference_Location
Zadar
ISSN
1334-2630
Print_ISBN
978-1-61284-949-2
Electronic_ISBN
1334-2630
Type
conf
Filename
6044293
Link To Document