Title :
Fusion of speaker and lexical information for topic segmentation: A co-segmentation approach
Author :
Charlet, Delphine ; Damnati, Geraldine ; Bouchekif, Abdessalam ; Douib, Ameur
Author_Institution :
Orange Labs., Lannion, France
Abstract :
In this work, we investigate how speaker-based information and lexical-based information can be fused efficiently for topic segmentation of spoken contents. While in recent work, we have proposed an early fusion scheme, so as to jointly model speaker and lexical distribution, we propose here a co-segmentation framework, between segmentations performed in the speaker space and in the lexical space. Experiments carried out on two distinct corpora (Radio talk show and TV Broadcast News) show that, even if performances of speaker information are contrasted and closely related to the content structure, its integration with lexical information, either by early fusion or by co-segmentation, is always effective. Absolute gains of 16% (Radio corpus) and 5% (TV corpus) are observed for topic boundary detection performance.
Keywords :
computational linguistics; speaker recognition; lexical information; lexical space; speaker space; speaker-based information; spoken contents; topic segmentation; Acoustics; Classification algorithms; Indexes; Legged locomotion; Speech; TV; Topic segmentation; co-segmentation; lexical cohesion; speaker cohesion;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178975