DocumentCode :
701782
Title :
Investigation of different acoustic modeling techniques for low resource Indian language data
Author :
Sriranjani, R. ; Murali Karthick, B. ; Umesh, S.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
fYear :
2015
fDate :
Feb. 27 2015-March 1 2015
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, we investigate the performance of deep neural network (DNN) and Subspace Gaussian mixture model (SGMM) in low-resource condition. Even though DNN outperforms SGMM and continuous density hidden Markov models (CDHMM) for high-resource data, it degrades in performance while modeling low-resource data. Our experimental results show that SGMM outperforms DNN for limited transcribed data. To resolve this problem in DNN, we propose to train DNN containing bottleneck layer in two stages: First stage involves extraction of bottleneck features. In second stage, the extracted bottleneck features from first stage are used to train DNN having bottleneck layer. All our experiments are performed using two Indian languages (Tamil & Hindi) in Mandi database. Our proposed method shows improved performance when compared to baseline SGMM and DNN models for limited training data.
Keywords :
Gaussian processes; hidden Markov models; mixture models; natural language processing; neural nets; CDHMM; DNN; Mandi database; SGMM; continuous density hidden Markov models; deep neural network; different acoustic modeling techniques; low resource Indian language data; subspace Gaussian mixture model; Acoustics; Data models; Databases; Feature extraction; Hidden Markov models; Training; Training data; DNN; Hindi; Indian languages; SGMM; Tamil; bottleneck; low resource data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location :
Mumbai
Type :
conf
DOI :
10.1109/NCC.2015.7084860
Filename :
7084860
Link To Document :
بازگشت