DocumentCode :
3167342
Title :
Acoustic TextTiling for story segmentation of spoken documents
Author :
Zheng, Lilei ; Leung, Cheung-Chi ; Xie, Lei ; Ma, Bin ; Li, Haizhou
Author_Institution :
Shaanxi Provincial Key Lab. of Speech & Image Inf. Process., Northwestern Polytech. Univ., Xi´´an, China
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
5121
Lastpage :
5124
Abstract :
We propose an acoustic TextTiling method based on segmental dynamic time warping for automatic story segmentation of spoken documents. Different from most of the existing methods using LVCSR transcripts, this method detects story boundaries directly from audio streams. In analogy to the cosine-based lexical similarity between two text blocks in a transcript, we define the acoustic similarity measure between two pseudo-sentences in an audio stream. Experiments on TDT2 Mandarin corpus show that acoustic TextTiling can achieve comparable performance to lexical TextTiling based on LVCSR transcripts. Moreover, we use MFCCs and Gaussian posteriorgrams as the acoustic representations in our experiments. Our experiments show that Gaussian posteriorgrams are more robust to perform segmentation for the stories each with multiple speakers.
Keywords :
Gaussian processes; audio streaming; speech processing; Gaussian posteriorgrams; LVCSR transcripts; TDT2 Mandarin corpus; acoustic TextTiling method; acoustic representations; audio streams; cosine-based lexical similarity; lexical TextTiling; segmental dynamic time warping; spoken document story segmentation; text blocks; Acoustic measurements; Acoustics; Glass; Heuristic algorithms; Speech; Speech processing; Vectors; TextTiling; segmental dynamic time warping; spoken document processing; story segmentation; topic segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289073
Filename :
6289073
Link To Document :
بازگشت