• DocumentCode
    2180493
  • Title

    Handling verbose queries for spoken document retrieval

  • Author

    Lin, Shih-Hsiang ; Jan, Ea-Ee ; Chen, Berlin

  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5552
  • Lastpage
    5555
  • Abstract
    Query-by-example information retrieval provides users a flexible but efficient way to accurately describe their information needs. The query exemplars are usually long and in the form of either a partial or even a full document. However, they may contain extraneous terms that would have potential negative impacts on the retrieval performance. In order to alleviate those negative impacts, we propose a novel term-based query reduction mechanism so as to improve the informativeness of verbose query exemplars. We also explore the notion of term discrimination power to select a salient subset of query terms automatically. Experiments on the TDT Chinese collection show that the proposed approach is indeed effective and promising.
  • Keywords
    document handling; natural language processing; query processing; speech recognition; TDT Chinese collection; query by example information retrieval; spoken document retrieval; term based query reduction mechanism; verbose queries handling; verbose query exemplars; Entropy; Hidden Markov models; Information retrieval; Markov processes; Semantics; Supervised learning; Training; Query-by-example; information retrieval; term-based query reduction; verbose query;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947617
  • Filename
    5947617