• DocumentCode
    1863559
  • Title

    Speech segmentation without speech recognition

  • Author

    Dong Wang ; Lu, Lie ; Hong-Jiang Zhang

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • Volume
    1
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    In this paper, we presented a semantic speech segmentation approach, in particular sentence segmentation, without speech recognition. In order to get phoneme level information without word recognition information, a novel vowel/consonant/pause (V/C/P) classification is proposed. An adaptive pause detection method is also presented to adapt to various background and environment. Three feature sets, which include pause, rate of speech and prosody, are used to discriminate the sentence boundary. Experiments on broadcasting news indicate that the performance of proposed algorithm is satisfying.
  • Keywords
    speech processing; adaptive pause detection method; prosody; sentence boundary; sentence segmentation; speech segmentation; Acoustic applications; Acoustic noise; Asia; Broadcasting; Feature extraction; Indexing; Multimedia communication; Natural languages; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1220940
  • Filename
    1220940