Title :
Cross-lingual acoustic modeling for Indian languages based on Subspace Gaussian Mixture Models
Author :
Joy, Neethu Mariam ; Abraham, Bassam ; Navneeth, K. ; Umesh, S.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
fDate :
Feb. 28 2014-March 2 2014
Abstract :
Cross-lingual acoustic modeling using Subspace Gaussian Mixture Model for low-resource languages of Indian origin is investigated. Building acoustic model for a low-resource language with limited vocabulary by leveraging resources from another language with comparatively larger resources was focused upon. Experiments were done on Bengali and Tamil corpus from MANDI database, with Tamil having greater resources than Bengali. We observed that the word accuracy of cross-lingual acoustic model of Bengali was approximately 2.5% above it´s CDHMM model and gave equivalent performance as it´s monolingual SGMM model.
Keywords :
Gaussian processes; acoustic signal processing; mixture models; speech processing; speech recognition; vocabulary; Bengali corpus; CDHMM model; Indian languages; MANDI database; Tamil corpus; cross-lingual acoustic modeling; limited vocabulary; low-resource languages; monolingual SGMM model; subspace Gaussian mixture models; Acoustics; Data models; Gaussian mixture model; Hidden Markov models; Speech; Speech recognition; Vectors; Cross-lingual acoustic model; SGMM;
Conference_Titel :
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location :
Kanpur
DOI :
10.1109/NCC.2014.6811282