DocumentCode
3077829
Title
Double ended speech enabled system in Indian travel & tourism industry
Author
Mohanty, S. ; Swain, Basanta Kumar
Author_Institution
Dept. of Comput. Sc. & Applic., Utkal Univ., Bhubaneswar, India
fYear
2013
fDate
26-28 Dec. 2013
Firstpage
1
Lastpage
7
Abstract
In this research paper we emphasized on development of double ended voice enabled system in order to receive the voice query and convey voiced output message related to travel and tourism domain in Indian language. The voice enable system was developed using multiple components such as automatic speech recognizer (ASR) engine, query classifier and speech synthesis engine. The speech recognition engine plays very crucial role in speech based system which we have evaluated using multiple pattern recognition algorithms namely Hidden Markov Model (HMM), Support Vector Machine (SVM), ontology based feed forward back propagation neural network (OFFBPNN), dynamic time warping (DTW). The performance of SVM AND HMM were seen superior with respect to OFFBPNN, DTW which were measured in terms of word accuracy and word error rate. The output of ASR is fed to k-nearest neighbour (KNN) query classifier and the end result of classifier is finally passed to Odia speech synthesizer to deliver the response in voice mode. We have employed voice transformation technique in speech synthesis system to produce the spoken output in male, female, child and robotic voice. The developed double ended voice enabled system is operational over Odia spoken query and delivered the response in synthesized Odia voice.
Keywords
audio databases; backpropagation; error statistics; feedforward neural nets; hidden Markov models; natural language processing; ontologies (artificial intelligence); query processing; signal classification; speech recognition; speech synthesis; support vector machines; travel industry; ASR engine; DTW; HMM; Indian language; Indian travel & tourism industry; KNN query classifier; OFFBPNN; Odia speech synthesizer; Odia spoken query; Odia voice; SVM; automatic speech recognizer; double ended voice enabled system; dynamic time warping; hidden Markov model; k-nearest neighbour query classifier; ontology based feed forward back propagation neural network; pattern recognition algorithms; robotic voice; speech based system; speech recognition engine; speech synthesis engine; speech synthesis system; support vector machine; voice mode; voice query; voice transformation technique; voiced output message; word accuracy; word error rate; Databases; Hidden Markov models; Industries; Speech; Speech recognition; Support vector machines; Vectors; ANN; ASR; DTW; SVM; speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Computing Research (ICCIC), 2013 IEEE International Conference on
Conference_Location
Enathi
Print_ISBN
978-1-4799-1594-1
Type
conf
DOI
10.1109/ICCIC.2013.6724164
Filename
6724164
Link To Document