DocumentCode
2973336
Title
Audio data indexing: Use of second-order statistics for speaker-based segmentation
Author
Delacourt, Perrine ; Wellekens, Christian
Author_Institution
Inst. EURECOM, Sophia Antipolis, France
Volume
2
fYear
1999
fDate
36342
Firstpage
959
Abstract
The content-based indexing task considered in this paper consists in recognizing from their voice, speakers involved in a conversation. A new approach for speaker-based segmentation, which is the first necessary step for this indexing task, is described. Our study is done under the assumptions that no prior information on speakers is available, that the number of speakers is unknown and that people do not speak simultaneously. Audio data indexing is commonly divided in two parts : audio data is first segmented with respect to speakers utterances and then resulting segments associated with a given speaker are merged together. In this work, we focus on the first part and we propose a new segmentation method based on second order statistics. The practical significance of this study is illustrated by applying our new technique to real data to show its efficiency
Keywords
content-based retrieval; database indexing; multimedia databases; speech recognition; audio data indexing; content-based indexing; second order statistics; second-order statistics; speaker-based segmentation; Audio databases; Bayesian methods; Context modeling; Indexing; Information retrieval; Multimedia databases; Speech recognition; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Computing and Systems, 1999. IEEE International Conference on
Conference_Location
Florence
Print_ISBN
0-7695-0253-9
Type
conf
DOI
10.1109/MMCS.1999.778619
Filename
778619
Link To Document