Speech segmentation without speech recognition

Author

Wang, Dong ; Lu, Lie ; Zhang, Hong-Jiang

Author_Institution

Dept. of Electron. Eng., Tsinghua Univ., Beijing, China

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

In this paper, we presented a semantic speech segmentation approach, in particular sentence segmentation, without speech recognition. In order to get phoneme level information without word recognition information, a novel vowel/consonant/pause (V/C/P) classification is proposed. An adaptive pause detection method is also presented to adapt to various backgrounds and environments. Three feature sets, which include pause, rate of speech and prosody, are used to discriminate the sentence boundary. Experiments on broadcasting news indicate that the performance of the proposed algorithm is satisfying.

Keywords

feature extraction; signal classification; speech processing; V/C/P classification; adaptive pause detection; broadcasting news; feature sets; performance; phoneme level information; prosody; rate of speech; semantic speech segmentation; sentence boundary discrimination; sentence segmentation; vowel/consonant/pause classification; Acoustic applications; Acoustic noise; Asia; Broadcasting; Feature extraction; Indexing; Multimedia communication; Natural languages; Speech recognition; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198819

Filename

1198819