DocumentCode :
1863559
Title :
Speech segmentation without speech recognition
Author :
Dong Wang ; Lu, Lie ; Hong-Jiang Zhang
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume :
1
fYear :
2003
fDate :
6-9 July 2003
Abstract :
In this paper, we presented a semantic speech segmentation approach, in particular sentence segmentation, without speech recognition. In order to get phoneme level information without word recognition information, a novel vowel/consonant/pause (V/C/P) classification is proposed. An adaptive pause detection method is also presented to adapt to various background and environment. Three feature sets, which include pause, rate of speech and prosody, are used to discriminate the sentence boundary. Experiments on broadcasting news indicate that the performance of proposed algorithm is satisfying.
Keywords :
speech processing; adaptive pause detection method; prosody; sentence boundary; sentence segmentation; speech segmentation; Acoustic applications; Acoustic noise; Asia; Broadcasting; Feature extraction; Indexing; Multimedia communication; Natural languages; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
Type :
conf
DOI :
10.1109/ICME.2003.1220940
Filename :
1220940
Link To Document :
بازگشت