DocumentCode :
417176
Title :
Language boundary detection and identification of mixed-language speech based on MAP estimation
Author :
Shia, Chi-Jiun ; Chiu, Yu-Hsien ; Hsieh, Jia-Hsin ; Wu, Chung-Hsien
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The paper proposes a maximum a posteriori (MAP) based approach to segment and identify jointly an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bigram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.
Keywords :
Gaussian processes; least squares approximations; maximum likelihood estimation; natural languages; speech recognition; statistical analysis; vector quantisation; GMM; LSA; MAP estimation; VQ; bigram language model; language boundary detection; language identification; likelihood ratio test; maximum a posteriori estimation; mixed-language speech; Acoustic signal detection; Application software; Computer science; Humans; Matrix converters; Natural languages; Probability; Robustness; Speech; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326002
Filename :
1326002
Link To Document :
بازگشت