DocumentCode
3431018
Title
Subspace based for Indian languages
Author
Mohan, Aanchan ; Umesh, S. ; Rose, Richard
Author_Institution
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
fYear
2012
fDate
2-5 July 2012
Firstpage
35
Lastpage
39
Abstract
The interest in this paper is in efficient configuration of automatic speech recognition (ASR) systems for use by under-served speaker populations. A task domain involving Indian farmers accessing information on agricultural commodities through a spoken dialog system in multiple languages is presented. To facilitate the development of ASR system for this domain, a speech corpus was collected in rural areas from speakers of four languages over wireless cellular channels. This paper investigates the problem of ASR acoustic modelling for this task domain. Continuous density hidden Markov model (CDHMM) and subspace Gaussian mixture model (SGMM) [1] based techniques are used to train acoustic models in four languages: Assamese, Bengali, Hindi and Marathi. Issues relating to limited linguistic resources with their impact on ASR word accuracy for these languages are addressed.
Keywords
Gaussian processes; hidden Markov models; natural language processing; speech recognition; ASR acoustic modelling; ASR system; ASR word accuracy; Assamese; Bengali; CDHMM; Hindi; Indian farmer; Indian language; Marathi; SGMM; agricultural commodity; automatic speech recognition; continuous density hidden Markov model; linguistic resources; speech corpus; spoken dialog system; subspace Gaussian mixture model; subspace based acoustic modelling; under-served speaker population; wireless cellular channel; Acoustics; Speech; Training; Vectors; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4673-0381-1
Electronic_ISBN
978-1-4673-0380-4
Type
conf
DOI
10.1109/ISSPA.2012.6310575
Filename
6310575
Link To Document