• DocumentCode
    3132163
  • Title

    Improved semantic retrieval of spoken content by language models enhanced with acoustic similarity graph

  • Author

    Hung-yi Lee ; Tsung-Hsien Wen ; Lin-Shan Lee

  • Author_Institution
    Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    182
  • Lastpage
    187
  • Abstract
    Retrieving objects semantically related to the query has been widely studied in text information retrieval. However, when applying the text-based techniques on spoken content, the inevitable recognition errors may seriously degrade the performance. In this paper, we propose to enhance the expected term frequencies estimated from spoken content by acoustic similarity graphs. For each word in the lexicon, a graph is constructed describing acoustic similarity among spoken segments in the archive. Score propagation over the graph helps in estimating the expected term frequencies. The enhanced expected term frequencies can be used in the language modeling retrieval approach, as well as semantic retrieval techniques such as the document expansion based on latent semantic analysis, and query expansion considering both words and latent topic information. Preliminary experiments performed on Mandarin broadcast news indicated that improved performance were achievable under different conditions.
  • Keywords
    graph theory; information retrieval; natural language processing; text analysis; Mandarin broadcast news; acoustic similarity graph; enhanced expected term frequencies; expected term frequencies; improved semantic retrieval; inevitable recognition errors; language modeling retrieval approach; language models; latent semantic analysis; lexicon; query expansion; score propagation; semantic retrieval techniques; spoken content; text information retrieval; text-based techniques; Acoustics; Computational modeling; Information retrieval; Interpolation; Lattices; Manuals; Semantics; Document Expansion; Latent Semantic Analysis; Query Expansion; Random Walk;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424219
  • Filename
    6424219