Title :
Phone set construction based on context-sensitive articulatory attributes for code-switching speech recognition
Author :
Wu, Chung-Hsien ; Shen, Han-Ping ; Yang, Yan-Ting
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
Bilingual speakers are known for their ability to code-switch or mix their languages during communication. This phenomenon occurs when bilinguals substitute a word or phrase from one language with a phrase or word from another language. For code-switching speech recognition, it is essential to collect a large-scale code-switching speech database for model training. In order to ease the negative effect caused by the data sparseness problem in training code-switching speech recognizers, this study proposes a data-driven approach to phone set construction by integrating acoustic features and cross-lingual context-sensitive articulatory features into distance measure between phone units. KL-divergence and a hierarchical phone unit clustering algorithm are used in this study to cluster similar phone units to reduce the need of the training data for model construction. The experimental results show that the proposed method outperforms other traditional phone set construction methods.
Keywords :
speech coding; speech recognition; KL-divergence; bilingual speakers; code-switching speech database; code-switching speech recognition; context-sensitive articulatory; cross-lingual context-sensitive articulatory; data sparseness problem; data-driven approach; hierarchical phone unit clustering algorithm; phone set construction; phone set construction methods; Accuracy; Acoustics; Feature extraction; Hidden Markov models; Speech; Speech recognition; Training; articulatory attribute; code-switching; phone set construction; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289009