DocumentCode
173000
Title
Automatic propagation of manual annotations for multimodal person identification in TV shows
Author
Budnik, Mateusz ; Poignant, Johann ; Besacier, Laurent ; Quenot, Georges
Author_Institution
LIG, Univ. Grenoble Alpes, Grenoble, France
fYear
2014
fDate
18-20 June 2014
Firstpage
1
Lastpage
4
Abstract
In this paper an approach to human annotation propagation for person identification in the multimodal context is proposed. A system is used, which combines speaker diarization and face clustering to produce multimodal clusters. The whole multimodal clusters are later annotated rather than just single tracks, which is done by propagation. Optical character recognition systems provides initial annotation. Four different strategies, which select candidates for annotation, are tested. The initial results of annotation propagation are promising. With the use of a proper active learning selection strategy the human annotator involvement could be reduced even further.
Keywords
learning (artificial intelligence); optical character recognition; pattern clustering; speaker recognition; TV shows; active learning selection strategy; face clustering; human annotation propagation; manual annotation automatic propagation; multimodal clusters; multimodal person identification; optical character recognition systems; speaker diarization; Character recognition; Face; Manuals; Multimedia communication; Optical character recognition software; TV; Videos;
fLanguage
English
Publisher
ieee
Conference_Titel
Content-Based Multimedia Indexing (CBMI), 2014 12th International Workshop on
Conference_Location
Klagenfurt
Type
conf
DOI
10.1109/CBMI.2014.6849849
Filename
6849849
Link To Document