Subspace based for Indian languages

Author

Mohan, Aanchan ; Umesh, S. ; Rose, Richard

Author_Institution

Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada

fYear

2012

fDate

2-5 July 2012

Firstpage

35

Lastpage

39

Abstract

The interest in this paper is in efficient configuration of automatic speech recognition (ASR) systems for use by under-served speaker populations. A task domain involving Indian farmers accessing information on agricultural commodities through a spoken dialog system in multiple languages is presented. To facilitate the development of ASR system for this domain, a speech corpus was collected in rural areas from speakers of four languages over wireless cellular channels. This paper investigates the problem of ASR acoustic modelling for this task domain. Continuous density hidden Markov model (CDHMM) and subspace Gaussian mixture model (SGMM) [1] based techniques are used to train acoustic models in four languages: Assamese, Bengali, Hindi and Marathi. Issues relating to limited linguistic resources with their impact on ASR word accuracy for these languages are addressed.

Keywords

Gaussian processes; hidden Markov models; natural language processing; speech recognition; ASR acoustic modelling; ASR system; ASR word accuracy; Assamese; Bengali; CDHMM; Hindi; Indian farmer; Indian language; Marathi; SGMM; agricultural commodity; automatic speech recognition; continuous density hidden Markov model; linguistic resources; speech corpus; spoken dialog system; subspace Gaussian mixture model; subspace based acoustic modelling; under-served speaker population; wireless cellular channel; Acoustics; Speech; Training; Vectors; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on

Conference_Location

Montreal, QC

Print_ISBN

978-1-4673-0381-1

Electronic_ISBN

978-1-4673-0380-4

Type

conf

DOI

10.1109/ISSPA.2012.6310575

Filename

6310575