• DocumentCode
    2964171
  • Title

    Spoken term detection from bilingual spontaneous speech using code-switched lattice-based structures for words and subword units

  • Author

    Lee, Hung-yi ; Tang, Yueh-Lien ; Tang, Hao ; Lee, Lin-shan

  • Author_Institution
    Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    410
  • Lastpage
    415
  • Abstract
    This paper presents the first work known publicly on spoken term detection from bilingual spontaneous speech using code-switched lattice-based structures for word and subword units. The corpus used is the lectures with Chinese as the host language and English as the guest language recorded for a real course offered in National Taiwan University. The techniques reported here have been successfully implemented and tested in a real lecture system now available on-line over the Internet. We also present the approaches of using word fragment as the subword unit for English, and analyse the difficult issues when code-switched lattice-based structures for subword units are used for tasks involving languages of quite different natures.
  • Keywords
    Internet; speech recognition; English host language; Internet; National Taiwan University; bilingual spontaneous speech; code switched lattice based structures; quite different natures; real course offered; real lecture system; spoken term detection; subword units; tasks involving languages; work known publicly; Audio recording; Computer science; Globalization; Information retrieval; Internet; Lattices; Natural languages; Speech processing; Speech recognition; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372901
  • Filename
    5372901