Title :
Optimizing the acoustic modeling from an unbalanced bi-lingual corpus
Author :
Lyu, Dau-Cheng ; Lyu, Ren-Yuan
Author_Institution :
Dept. of Electr. Eng., Chang Gung Univ., Taoyuan
fDate :
March 31 2008-April 4 2008
Abstract :
Phoneme set clustering of accurate modeling is important in the task of multilingual speech recognition, especially when each of the available language training corpora is mismatched, such as is the case between a major language, like Mandarin, and a minor language, like Taiwanese. In this paper, we present a data-driven approach for not only acquiring a proper phoneme set but optimizing the acoustic modeling in this situation. In order to obtain the phoneme set that is suitable for the unbalanced corpus, we use an agglomerative hierarchical clustering with delta Bayesian information criteria. Then for training each of the acoustic models, we choose a parametric modeling technique, model complexity selection, to adjust the number of mixtures for optimizing the acoustic model between the new phoneme set and the available training data. The experimental results are very encouraging in that the proposed approach reduces relative syllable error rate by 7.8% over the best result of the knowledge-based approach.
Keywords :
Bayes methods; speech recognition; acoustic modeling; agglomerative hierarchical clustering; delta Bayesian information criteria; model complexity selection; multilingual speech recognition; parametric modeling technique; phoneme set clustering; unbalanced bilingual corpus; Acoustic measurements; Acoustical engineering; Bayesian methods; Computer science; Error analysis; Large-scale systems; Natural languages; Parametric statistics; Speech recognition; Training data; delta-BIC; multilingual speech recognition; phoneme set clustering;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518606