Title :
Investigation of different acoustic modeling techniques for low resource Indian language data
Author :
Sriranjani, R. ; Murali Karthick, B. ; Umesh, S.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
fDate :
Feb. 27 2015-March 1 2015
Abstract :
In this paper, we investigate the performance of deep neural network (DNN) and Subspace Gaussian mixture model (SGMM) in low-resource condition. Even though DNN outperforms SGMM and continuous density hidden Markov models (CDHMM) for high-resource data, it degrades in performance while modeling low-resource data. Our experimental results show that SGMM outperforms DNN for limited transcribed data. To resolve this problem in DNN, we propose to train DNN containing bottleneck layer in two stages: First stage involves extraction of bottleneck features. In second stage, the extracted bottleneck features from first stage are used to train DNN having bottleneck layer. All our experiments are performed using two Indian languages (Tamil & Hindi) in Mandi database. Our proposed method shows improved performance when compared to baseline SGMM and DNN models for limited training data.
Keywords :
Gaussian processes; hidden Markov models; mixture models; natural language processing; neural nets; CDHMM; DNN; Mandi database; SGMM; continuous density hidden Markov models; deep neural network; different acoustic modeling techniques; low resource Indian language data; subspace Gaussian mixture model; Acoustics; Data models; Databases; Feature extraction; Hidden Markov models; Training; Training data; DNN; Hindi; Indian languages; SGMM; Tamil; bottleneck; low resource data;
Conference_Titel :
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location :
Mumbai
DOI :
10.1109/NCC.2015.7084860