DocumentCode
602042
Title
Customizable cloud-healthcare dialogue system based on LVCSR with prosodic-contextual post-processing
Author
Bo-Wei Chen ; Po-Yi Shih ; Bharanitharan, K. ; Po-Chuan Lin ; Jhing-Fa Wang ; Chia-Ming Chen
Author_Institution
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear
2013
fDate
12-16 March 2013
Firstpage
246
Lastpage
249
Abstract
This work presents a customized cloud-healthcare dialogue system design based on large vocabulary continuous speech recognition (LVCSR) with prosodic-contextual post-processing. The customized cloud-healthcare dialogue system includes two parts. The first part is the cloud dialogue management and strategy, which manage and provide the services on demand. The second part is a web-based reminder and a customizable interface, which offer settings of reminding events and the customizable dialogue system. Moreover, for higher accuracy of speech recognition, this work proposes prosodic-contextual post-processing mechanism, which can find the best sentence from potential recognition results by using syllable segmentation, pitch analysis, and contextual analysis. In the experiment, five healthcare scenarios for the elderly are designed for evaluation. The analysis indicates that the average mean opinion score (MOS) can reach as high as 4.23. Additionally, the word error rate (WER) of LVCSR with the proposed prosodic-contextual post-processing is improved by 9.21%. Such results show that the proposed system is suitable for the elderly in daily living and demonstrates feasibility of our idea.
Keywords
biomedical communication; cloud computing; health care; interactive systems; speech recognition; LVCSR; MOS; WER; cloud dialogue management; contextual analysis; customizable dialogue system; customized cloud-healthcare dialogue system; large vocabulary continuous speech recognition; mean opinion score; pitch analysis; prosodic contextual post-processing; syllable segmentation; word error rate; Cloud computing; Educational institutions; Senior citizens; Servers; Speech; Speech recognition; Cloud computing; LVCSR; dialogue system; healthcare services;
fLanguage
English
Publisher
ieee
Conference_Titel
Orange Technologies (ICOT), 2013 International Conference on
Conference_Location
Tainan
Print_ISBN
978-1-4673-5934-4
Type
conf
DOI
10.1109/ICOT.2013.6521203
Filename
6521203
Link To Document