• DocumentCode
    2480927
  • Title

    Role of Synthetically Generated Samples on Speech Recognition in a Resource-Scarce Language

  • Author

    Chakraborty, Rupayan ; Garain, Utpal

  • Author_Institution
    St. Thomas´´ Coll. of Eng. & Technol., Kolkata, India
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    1618
  • Lastpage
    1621
  • Abstract
    Speech recognition systems that make use of statistical classifiers require a large number of training samples. However, collection of real samples has always been a difficult problem due to the involvement of substantial amount of human intervention and cost. Considering this problem, this paper presents a novel method for generating synthetic samples from a handful of real samples and investigates the role of these samples in designing a speech recognition system. Speaker dependent limited vocabulary isolated word recognition in an Indian language (i.e. Bengali) has been taken a reference to demonstrate the potential of the proposed framework. The role of synthetic samples is demonstrated by showing a significant improvement in recognition accuracy. A maximum improvement of 10% is achieved using the proposed approach.
  • Keywords
    speech recognition; statistical analysis; Indian language; resource scarce language; speech recognition; statistical classifiers; synthetically generated samples; Artificial neural networks; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; Vocabulary; Indian langauge; Synthetic sample; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.400
  • Filename
    5595954