DocumentCode :
1689809
Title :
Language diarization for code-switch conversational speech
Author :
Dau-Cheng Lyu ; Eng-Siong Chng ; Haizhou Li
Author_Institution :
Temasek Labs., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2013
Firstpage :
7314
Lastpage :
7318
Abstract :
This paper examines the process of language diarization, the process to perform language segmentation and recognition, in a code-switched speech. Towards this task, we have developed a 63 hours conversational code-switch corpus recorded from Singapore/Malaysia speakers. We show that code-switching can occur frequently and the average language interval may be as short as one second. As such, language diarization is challenging task. To process such short segments, we propose a language diarization system using long term context feature across several phone-based segments and the combination of acoustics and phonotactic information. We achieved a frame error rate of 14.7% for language diarization performance on a Mandarin-English code-switch corpus. To evaluate our system, we measured the language recognition performance on monolingual segments extracted from the code-switch corpus against published techniques of LID systems - we obtained a relative equal error rate reduction of 5.2%, 13.8%, 15.1% and 17.9% on speech durations of 0.1 to 0.5 sec., 0.5 to 1 sec., 1 to 3 sec. and 3 to 9 sec respectively.
Keywords :
error statistics; natural language processing; speaker recognition; speech coding; LID system; Malaysia speaker; Mandarin-English code switch corpus; Singapore speaker; acoustics information; code-switch conversational speech; frame error rate; language diarization process; language diarization system; language recognition; language segmentation; monolingual segment; phone-based segment; phonotactic information; time 0.1 s to 9 s; time 63 hour; Acoustics; Error analysis; Feature extraction; Speech; Speech coding; Speech processing; Speech recognition; code-switch; conversational speech; language diarization; language recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639083
Filename :
6639083
Link To Document :
بازگشت