• DocumentCode
    699232
  • Title

    Combination of phone N-grams for a MPEG-7-based spoken document retrieval system

  • Author

    Moreau, Nicolas ; Hyoung-Gook Kim ; Sikora, Thomas

  • Author_Institution
    Dept. of Commun. Syst., Tech. Univ. of Berlin, Berlin, Germany
  • fYear
    2004
  • fDate
    6-10 Sept. 2004
  • Firstpage
    549
  • Lastpage
    552
  • Abstract
    In this paper, we present a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. The audio part of MPEG-7 aims at standardizing the indexing of audio documents. It encloses a SpokenContent tool that provides a description framework of the semantic content of speech signals. In the context of MPEG-7, we propose an indexing and retrieval method that uses phonetic information only and a vector space IR model. Different strategies based on the use of phone N-gram indexing terms are experimented.
  • Keywords
    information retrieval; speech processing; MPEG-7-based spoken document retrieval system; SpokenContent tool; audio documents; phone N-gram indexing terms; phone N-grams combination; phonetic information; semantic content; space IR model; speech signals; Abstracts; Films; Indexing; Lattices; Transform coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2004 12th European
  • Conference_Location
    Vienna
  • Print_ISBN
    978-320-0001-65-7
  • Type

    conf

  • Filename
    7079762