Double ended speech enabled system in Indian travel & tourism industry

Author

Mohanty, S. ; Swain, Basanta Kumar

Author_Institution

Dept. of Comput. Sc. & Applic., Utkal Univ., Bhubaneswar, India

fYear

2013

fDate

26-28 Dec. 2013

Firstpage

1

Lastpage

7

Abstract

In this research paper we emphasized on development of double ended voice enabled system in order to receive the voice query and convey voiced output message related to travel and tourism domain in Indian language. The voice enable system was developed using multiple components such as automatic speech recognizer (ASR) engine, query classifier and speech synthesis engine. The speech recognition engine plays very crucial role in speech based system which we have evaluated using multiple pattern recognition algorithms namely Hidden Markov Model (HMM), Support Vector Machine (SVM), ontology based feed forward back propagation neural network (OFFBPNN), dynamic time warping (DTW). The performance of SVM AND HMM were seen superior with respect to OFFBPNN, DTW which were measured in terms of word accuracy and word error rate. The output of ASR is fed to k-nearest neighbour (KNN) query classifier and the end result of classifier is finally passed to Odia speech synthesizer to deliver the response in voice mode. We have employed voice transformation technique in speech synthesis system to produce the spoken output in male, female, child and robotic voice. The developed double ended voice enabled system is operational over Odia spoken query and delivered the response in synthesized Odia voice.

Keywords

audio databases; backpropagation; error statistics; feedforward neural nets; hidden Markov models; natural language processing; ontologies (artificial intelligence); query processing; signal classification; speech recognition; speech synthesis; support vector machines; travel industry; ASR engine; DTW; HMM; Indian language; Indian travel & tourism industry; KNN query classifier; OFFBPNN; Odia speech synthesizer; Odia spoken query; Odia voice; SVM; automatic speech recognizer; double ended voice enabled system; dynamic time warping; hidden Markov model; k-nearest neighbour query classifier; ontology based feed forward back propagation neural network; pattern recognition algorithms; robotic voice; speech based system; speech recognition engine; speech synthesis engine; speech synthesis system; support vector machine; voice mode; voice query; voice transformation technique; voiced output message; word accuracy; word error rate; Databases; Hidden Markov models; Industries; Speech; Speech recognition; Support vector machines; Vectors; ANN; ASR; DTW; SVM; speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Intelligence and Computing Research (ICCIC), 2013 IEEE International Conference on

Conference_Location

Enathi

Print_ISBN

978-1-4799-1594-1

Type

conf

DOI

10.1109/ICCIC.2013.6724164

Filename

6724164