Semantics-based language modeling for Cantonese-English code-mixing speech recognition

Author

Cao, Houwei ; Ching, P.C. ; Lee, Tan ; Yeung, Yu Ting

Author_Institution

Dept. of Electron. Eng., Chinese Univ. of Hong Hong, Hong Kong, China

fYear

2010

fDate

Nov. 29 2010-Dec. 3 2010

Firstpage

246

Lastpage

250

Abstract

This paper addresses the problem of language modeling for LVCSR of Cantonese-English code-mixing utterances spoken in daily communications. In the absence of sufficient amount of code-mixing text data, translation-based and semantics-based mapping are applied on n-grams to better estimate the probability of low-frequency and unseen mixed-language n-grams events. In translation-based mapping scheme, the Cantonese-to-English translation dictionary is adopted to transcribe monolingual Cantonese n-grams to mixed-language n-grams. In semantics-based mapping scheme, n-gram mapping is based on the meaning and syntactic function of the English words in the lexicon. Different semantics-based language models are trained with different mapping schemes. They are evaluated in terms of perplexity and in the task of LVCSR. Experimental results confirm that, the more the observed mixed-language n-grams after mapping, the better the language model perplexity as well as the recognition performance. The proposed language models show significant improvement on recognition performance on embedded English words when they are compared with the baseline 3-gram LM. The best recognition accuracy attained is 63.9% and 74.7% respectively for the English words and Cantonese characters in code-mixing utterances.

Keywords

speech recognition; Cantonese English code mixing speech recognition; Cantonese characters; LVCSR; code mixing text data; code mixing utterances; english characters; semantics based language modeling; semantics based mapping; translation based mapping; Accuracy; Data models; Dictionaries; Hidden Markov models; Speech; Speech recognition; Training; ASR; Cantonese-English code-mixing; language modeling; semantics;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on

Conference_Location

Tainan

Print_ISBN

978-1-4244-6244-5

Type

conf

DOI

10.1109/ISCSLP.2010.5684900

Filename

5684900