• DocumentCode
    2932431
  • Title

    Web-based topic language modeling for audio indexing

  • Author

    Iso, Ken-ichi

  • Author_Institution
    Yahoo! JAPAN Res., Yahoo Japan Corp., Tokyo, Japan
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    826
  • Lastpage
    829
  • Abstract
    We describe the implementation of a scalable architecture for audio indexing, in which topic-dependent language models (LMs) were trained on Web pages categorized in a portal Web directory and stored on distributed servers. Input speech was decoded in parallel on servers that each had an individual topic LM. From the decoders´ outputs, an optimal hypothesis was chosen for each utterance by a topic-selection criterion minimizing an energy function with three terms: likelihood scores for the utterances; keyword co-occurrence statistics to measure the long-distance correlation; and Web-based hypothesis verification scores, which penalize misrecognized trigrams through web search results. Experimental results showed that the proposed approach outperformed the baseline topic-independent system by 6.0% absolutely (20.0% relatively) in character accuracy.
  • Keywords
    Internet; information analysis; natural language processing; speech processing; Web-based hypothesis verification scores; Web-based topic language modeling; audio indexing; keyword cooccurrence statistics; optimal hypothesis; portal Web directory; topic-selection criterion; Decoding; Indexing; Linear discriminant analysis; Poles and towers; Scalability; Service oriented architecture; Speech recognition; System testing; Web pages; Web search; audio indexing; topic language models; utterance verification; web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202622
  • Filename
    5202622