• DocumentCode
    602042
  • Title

    Customizable cloud-healthcare dialogue system based on LVCSR with prosodic-contextual post-processing

  • Author

    Bo-Wei Chen ; Po-Yi Shih ; Bharanitharan, K. ; Po-Chuan Lin ; Jhing-Fa Wang ; Chia-Ming Chen

  • Author_Institution
    Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • fYear
    2013
  • fDate
    12-16 March 2013
  • Firstpage
    246
  • Lastpage
    249
  • Abstract
    This work presents a customized cloud-healthcare dialogue system design based on large vocabulary continuous speech recognition (LVCSR) with prosodic-contextual post-processing. The customized cloud-healthcare dialogue system includes two parts. The first part is the cloud dialogue management and strategy, which manage and provide the services on demand. The second part is a web-based reminder and a customizable interface, which offer settings of reminding events and the customizable dialogue system. Moreover, for higher accuracy of speech recognition, this work proposes prosodic-contextual post-processing mechanism, which can find the best sentence from potential recognition results by using syllable segmentation, pitch analysis, and contextual analysis. In the experiment, five healthcare scenarios for the elderly are designed for evaluation. The analysis indicates that the average mean opinion score (MOS) can reach as high as 4.23. Additionally, the word error rate (WER) of LVCSR with the proposed prosodic-contextual post-processing is improved by 9.21%. Such results show that the proposed system is suitable for the elderly in daily living and demonstrates feasibility of our idea.
  • Keywords
    biomedical communication; cloud computing; health care; interactive systems; speech recognition; LVCSR; MOS; WER; cloud dialogue management; contextual analysis; customizable dialogue system; customized cloud-healthcare dialogue system; large vocabulary continuous speech recognition; mean opinion score; pitch analysis; prosodic contextual post-processing; syllable segmentation; word error rate; Cloud computing; Educational institutions; Senior citizens; Servers; Speech; Speech recognition; Cloud computing; LVCSR; dialogue system; healthcare services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Orange Technologies (ICOT), 2013 International Conference on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4673-5934-4
  • Type

    conf

  • DOI
    10.1109/ICOT.2013.6521203
  • Filename
    6521203