Title :
Web-based topic language modeling for audio indexing
Author_Institution :
Yahoo! JAPAN Res., Yahoo Japan Corp., Tokyo, Japan
fDate :
June 28 2009-July 3 2009
Abstract :
We describe the implementation of a scalable architecture for audio indexing, in which topic-dependent language models (LMs) were trained on Web pages categorized in a portal Web directory and stored on distributed servers. Input speech was decoded in parallel on servers that each had an individual topic LM. From the decoders´ outputs, an optimal hypothesis was chosen for each utterance by a topic-selection criterion minimizing an energy function with three terms: likelihood scores for the utterances; keyword co-occurrence statistics to measure the long-distance correlation; and Web-based hypothesis verification scores, which penalize misrecognized trigrams through web search results. Experimental results showed that the proposed approach outperformed the baseline topic-independent system by 6.0% absolutely (20.0% relatively) in character accuracy.
Keywords :
Internet; information analysis; natural language processing; speech processing; Web-based hypothesis verification scores; Web-based topic language modeling; audio indexing; keyword cooccurrence statistics; optimal hypothesis; portal Web directory; topic-selection criterion; Decoding; Indexing; Linear discriminant analysis; Poles and towers; Scalability; Service oriented architecture; Speech recognition; System testing; Web pages; Web search; audio indexing; topic language models; utterance verification; web search;
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2009.5202622