• DocumentCode
    672283
  • Title

    A MFCC based Hindi speech recognition technique using HTK Toolkit

  • Author

    Tripathy, Somanath ; Baranwal, Neha ; Nandi, Gora Chand

  • Author_Institution
    Robot. & AI Lab., Indian Inst. of Inf. Technol., Dewghat, India
  • fYear
    2013
  • fDate
    9-11 Dec. 2013
  • Firstpage
    539
  • Lastpage
    544
  • Abstract
    To utilize the robot´s capabilities, it is necessary for us to communicate with them efficiently. Thus, Human Robot Interaction is attracting the attention of most of the researchers these days. In this paper a speech recognition system has been developed using different feature extraction techniques like MFCC (mel frequency cepestral coefficient), LPC (linear predictive coding) and HMM (hidden markov model) is used as the classifier. Less work has been done for Hindi language in this field with a vocabulary size not very large. So, work in this paper has been done for Hindi database, with a vocabulary size a bit extended. HMM has been implemented using HTK Toolkit. Afterwards the performances of both of the techniques used have been compared. The work has been done using audacity for sound recordings and Cygwin to execute the HTK commands in Linux type environment in windows platform. As well as, the system developed has been tested in the speaker dependent and speaker independent both types of environments, whose performance results, as well as, the comparison graph of the system shows that MFCC performs well as compared to LPC in each and every condition.
  • Keywords
    feature extraction; hidden Markov models; human-robot interaction; natural language processing; speech recognition; HMM; HTK toolkit; LPC; Linux type environment; MFCC based Hindi speech recognition technique; feature extraction techniques; human robot interaction; linear predictive coding; mel frequency cepestral coefficient; sound recordings; speaker dependent; speaker independent; vocabulary size; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Robots; Speech; Speech recognition; Vocabulary; HMM; LPC; MFCC; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Information Processing (ICIIP), 2013 IEEE Second International Conference on
  • Conference_Location
    Shimla
  • Print_ISBN
    978-1-4673-6099-9
  • Type

    conf

  • DOI
    10.1109/ICIIP.2013.6707650
  • Filename
    6707650