DocumentCode
2696861
Title
Language identification using lattice-based phonotactic and syllabotactic approaches
Author
Zhu, Dong ; Adda-Decker, Martine
Author_Institution
LIMSI-CNRS, Paris Univ., Orsay
fYear
2006
fDate
28-30 June 2006
Firstpage
1
Lastpage
4
Abstract
Recent results have shown that phone lattices may significantly improve language recognition in a PPRLM (parallel phone recognition followed by language dependent modeling) framework. In this contribution we investigate the effectiveness of lattices to language identification (LID) when using different sets of multilingual phone and syllable inventories and corresponding multilingual acoustic phone models. The LID system that achieves best results is a PPRLM structure composed of four acoustic recognizers using both phone- and syllable-based decoders. A 7-language broadcast news corpus of approximately 150 hours of speech is used for training, development and test of our LID systems. Experimental results show that the use of lattices consistently improves LID results for all multilingual acoustic model sets and for both phonotactic and syllabotactic approaches
Keywords
acoustic signal processing; decoding; linguistics; natural languages; speech processing; speech recognition; 7-language broadcast news corpus; LID; PPRLM; language identification; lattice-based phonotactic approach; multilingual acoustic phone model; parallel phone recognition; syllable-based decoder; Acoustic testing; Broadcasting; Decoding; Lattices; NIST; Natural languages; Speech; Support vector machine classification; Support vector machines; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location
San Juan
Print_ISBN
1-424400471-1
Electronic_ISBN
1-4244-0472-X
Type
conf
DOI
10.1109/ODYSSEY.2006.248102
Filename
4013519
Link To Document