• DocumentCode
    2695113
  • Title

    Interruption point detection of spontaneous speech using prior knowledge and multiple features

  • Author

    Liang, Wei-Bin ; Yeh, Jui-Feng ; Wu, Chung-Hsien ; Liou, Chi-Chiuan

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan
  • fYear
    2008
  • fDate
    June 23 2008-April 26 2008
  • Firstpage
    1457
  • Lastpage
    1460
  • Abstract
    This paper presents an approach to interruption point (IP) detection of spontaneous speech based on conditional random fields using prior knowledge and multiple features. The features adopted in this study consist of subsyllable boundaries and prosodic features. Conditional random fields (CRFs) and variable-length contextual features are employed for IP modeling. In order to apply the features with continuous values to the CRF models, the K-means clustering algorithm is adopted for the quantization of the prosodic features. In the experimental results, Mandarin Conversional Dialogue Corpus (MCDC) was used to evaluate the proposed method. The IP detection error rate achieved almost 20% reduction in Rt04 measure. The experimental results show that the proposed model can effectively detect the interruption point in spontaneous speech.
  • Keywords
    pattern clustering; speech processing; IP modeling; K-means clustering algorithm; Mandarin conversional dialogue corpus; conditional random field; interruption point detection; multiple features; prior knowledge; prosodic feature quantization; spontaneous speech; subsyllable boundary; variable-length contextual feature; Acoustic signal detection; Automatic speech recognition; Clustering algorithms; Computer science; Event detection; Hidden Markov models; Knowledge engineering; Natural languages; Speech recognition; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2008 IEEE International Conference on
  • Conference_Location
    Hannover
  • Print_ISBN
    978-1-4244-2570-9
  • Electronic_ISBN
    978-1-4244-2571-6
  • Type

    conf

  • DOI
    10.1109/ICME.2008.4607720
  • Filename
    4607720