• DocumentCode
    3077829
  • Title

    Double ended speech enabled system in Indian travel & tourism industry

  • Author

    Mohanty, S. ; Swain, Basanta Kumar

  • Author_Institution
    Dept. of Comput. Sc. & Applic., Utkal Univ., Bhubaneswar, India
  • fYear
    2013
  • fDate
    26-28 Dec. 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In this research paper we emphasized on development of double ended voice enabled system in order to receive the voice query and convey voiced output message related to travel and tourism domain in Indian language. The voice enable system was developed using multiple components such as automatic speech recognizer (ASR) engine, query classifier and speech synthesis engine. The speech recognition engine plays very crucial role in speech based system which we have evaluated using multiple pattern recognition algorithms namely Hidden Markov Model (HMM), Support Vector Machine (SVM), ontology based feed forward back propagation neural network (OFFBPNN), dynamic time warping (DTW). The performance of SVM AND HMM were seen superior with respect to OFFBPNN, DTW which were measured in terms of word accuracy and word error rate. The output of ASR is fed to k-nearest neighbour (KNN) query classifier and the end result of classifier is finally passed to Odia speech synthesizer to deliver the response in voice mode. We have employed voice transformation technique in speech synthesis system to produce the spoken output in male, female, child and robotic voice. The developed double ended voice enabled system is operational over Odia spoken query and delivered the response in synthesized Odia voice.
  • Keywords
    audio databases; backpropagation; error statistics; feedforward neural nets; hidden Markov models; natural language processing; ontologies (artificial intelligence); query processing; signal classification; speech recognition; speech synthesis; support vector machines; travel industry; ASR engine; DTW; HMM; Indian language; Indian travel & tourism industry; KNN query classifier; OFFBPNN; Odia speech synthesizer; Odia spoken query; Odia voice; SVM; automatic speech recognizer; double ended voice enabled system; dynamic time warping; hidden Markov model; k-nearest neighbour query classifier; ontology based feed forward back propagation neural network; pattern recognition algorithms; robotic voice; speech based system; speech recognition engine; speech synthesis engine; speech synthesis system; support vector machine; voice mode; voice query; voice transformation technique; voiced output message; word accuracy; word error rate; Databases; Hidden Markov models; Industries; Speech; Speech recognition; Support vector machines; Vectors; ANN; ASR; DTW; SVM; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Computing Research (ICCIC), 2013 IEEE International Conference on
  • Conference_Location
    Enathi
  • Print_ISBN
    978-1-4799-1594-1
  • Type

    conf

  • DOI
    10.1109/ICCIC.2013.6724164
  • Filename
    6724164