• DocumentCode
    3431018
  • Title

    Subspace based for Indian languages

  • Author

    Mohan, Aanchan ; Umesh, S. ; Rose, Richard

  • Author_Institution
    Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
  • fYear
    2012
  • fDate
    2-5 July 2012
  • Firstpage
    35
  • Lastpage
    39
  • Abstract
    The interest in this paper is in efficient configuration of automatic speech recognition (ASR) systems for use by under-served speaker populations. A task domain involving Indian farmers accessing information on agricultural commodities through a spoken dialog system in multiple languages is presented. To facilitate the development of ASR system for this domain, a speech corpus was collected in rural areas from speakers of four languages over wireless cellular channels. This paper investigates the problem of ASR acoustic modelling for this task domain. Continuous density hidden Markov model (CDHMM) and subspace Gaussian mixture model (SGMM) [1] based techniques are used to train acoustic models in four languages: Assamese, Bengali, Hindi and Marathi. Issues relating to limited linguistic resources with their impact on ASR word accuracy for these languages are addressed.
  • Keywords
    Gaussian processes; hidden Markov models; natural language processing; speech recognition; ASR acoustic modelling; ASR system; ASR word accuracy; Assamese; Bengali; CDHMM; Hindi; Indian farmer; Indian language; Marathi; SGMM; agricultural commodity; automatic speech recognition; continuous density hidden Markov model; linguistic resources; speech corpus; spoken dialog system; subspace Gaussian mixture model; subspace based acoustic modelling; under-served speaker population; wireless cellular channel; Acoustics; Speech; Training; Vectors; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
  • Conference_Location
    Montreal, QC
  • Print_ISBN
    978-1-4673-0381-1
  • Electronic_ISBN
    978-1-4673-0380-4
  • Type

    conf

  • DOI
    10.1109/ISSPA.2012.6310575
  • Filename
    6310575