• DocumentCode
    774733
  • Title

    Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system

  • Author

    Wu, Chung-Hsien ; Yan, Gwo-Lang

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    13
  • Issue
    3
  • fYear
    2005
  • fDate
    5/1/2005 12:00:00 AM
  • Firstpage
    330
  • Lastpage
    344
  • Abstract
    This work presents an approach to modeling speech acts and verifying spontaneous speech with disfluency in a spoken dialogue system. According to this approach, semantic information, syntactic structure and fragment class of an input utterance are statistically encapsulated in a proposed speech act hidden Markov model (SAHMM) to characterize the speech act. An interpolation mechanism is exploited to re-estimate the state transition probability in SAHMM, to deal with the problem of disfluency in a sparse training corpus. Finally, a Bayesian belief model (BBM), based on latent semantic analysis (LSA), is adopted to verify the potential speech acts and output the final speech act. Experiments were conducted to evaluate the proposed approach using a spoken dialogue system for providing air travel information. A testing database from 25 speakers, with 480 dialogues that include 3038 sentences, was established and used for evaluation. Experimental results show that the proposed approach identifies 95.3% of speech act at a rejection rate of 5%, and the semantic accuracy is 4.2% better than that obtained using a keyword-based system. The proposed strategy also effectively alleviates the disfluency problem in spontaneous speech.
  • Keywords
    Bayes methods; hidden Markov models; interactive systems; interpolation; natural languages; probability; speech processing; state estimation; statistical analysis; Bayesian belief model; air travel information; disfluency problem; input utterance fragment class; input utterance syntactic structure; interpolation mechanism; keyword-based system; latent semantic analysis; semantic information; sparse training corpus; speech act hidden Markov model; speech act modeling; spoken dialogue system; spontaneous speech verification; state transition probability; Bayesian methods; Databases; Hidden Markov models; Indexing; Interpolation; Probability; Robustness; Speech analysis; Testing; Weather forecasting; Bayesian belief model; disfluency modeling; speech act modeling; spoken dialogue;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2005.845820
  • Filename
    1420368