• DocumentCode
    792316
  • Title

    Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese

  • Author

    Chen, Berlin ; Wang, Hsin-Min ; Lee, Lin-shan

  • Author_Institution
    Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
  • Volume
    10
  • Issue
    5
  • fYear
    2002
  • fDate
    7/1/2002 12:00:00 AM
  • Firstpage
    303
  • Lastpage
    314
  • Abstract
    With the rapidly growing use of the audio and multimedia information over the Internet, the technology for retrieving speech information using voice queries is becoming more and more important. In this paper, considering the monosyllabic structure of the Chinese language, a whole class of syllable-based indexing features, including overlapping segments of syllables and syllable pairs separated by a few syllables, is extensively investigated based on a Mandarin broadcast news database. The strong discriminating capabilities of such syllable-based features were verified by comparing with the word- or character-based features. Good approaches for better utilizing such capabilities, including fusion with the word- and character-level information and improved approaches to obtain better syllable-based features and query expressions, were extensively investigated. Very encouraging experimental results were obtained.
  • Keywords
    feature extraction; information retrieval; natural languages; speech processing; Chinese language; Internet; Mandarin Chinese; Mandarin broadcast news database; audio information; character-based features; discriminating capabilities; monosyllabic structure; multimedia information; overlapping segments; query expressions; speech information retrieval; syllable-based approaches; syllable-based indexing features; voice retrieval; word-based features; Digital multimedia broadcasting; Indexing; Information retrieval; Information science; Internet; Multimedia communication; Natural languages; Spatial databases; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2002.802541
  • Filename
    1021073