DocumentCode
2964171
Title
Spoken term detection from bilingual spontaneous speech using code-switched lattice-based structures for words and subword units
Author
Lee, Hung-yi ; Tang, Yueh-Lien ; Tang, Hao ; Lee, Lin-shan
Author_Institution
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
410
Lastpage
415
Abstract
This paper presents the first work known publicly on spoken term detection from bilingual spontaneous speech using code-switched lattice-based structures for word and subword units. The corpus used is the lectures with Chinese as the host language and English as the guest language recorded for a real course offered in National Taiwan University. The techniques reported here have been successfully implemented and tested in a real lecture system now available on-line over the Internet. We also present the approaches of using word fragment as the subword unit for English, and analyse the difficult issues when code-switched lattice-based structures for subword units are used for tasks involving languages of quite different natures.
Keywords
Internet; speech recognition; English host language; Internet; National Taiwan University; bilingual spontaneous speech; code switched lattice based structures; quite different natures; real course offered; real lecture system; spoken term detection; subword units; tasks involving languages; work known publicly; Audio recording; Computer science; Globalization; Information retrieval; Internet; Lattices; Natural languages; Speech processing; Speech recognition; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5372901
Filename
5372901
Link To Document