DocumentCode :
2932431
Title :
Web-based topic language modeling for audio indexing
Author :
Iso, Ken-ichi
Author_Institution :
Yahoo! JAPAN Res., Yahoo Japan Corp., Tokyo, Japan
fYear :
2009
fDate :
June 28 2009-July 3 2009
Firstpage :
826
Lastpage :
829
Abstract :
We describe the implementation of a scalable architecture for audio indexing, in which topic-dependent language models (LMs) were trained on Web pages categorized in a portal Web directory and stored on distributed servers. Input speech was decoded in parallel on servers that each had an individual topic LM. From the decoders´ outputs, an optimal hypothesis was chosen for each utterance by a topic-selection criterion minimizing an energy function with three terms: likelihood scores for the utterances; keyword co-occurrence statistics to measure the long-distance correlation; and Web-based hypothesis verification scores, which penalize misrecognized trigrams through web search results. Experimental results showed that the proposed approach outperformed the baseline topic-independent system by 6.0% absolutely (20.0% relatively) in character accuracy.
Keywords :
Internet; information analysis; natural language processing; speech processing; Web-based hypothesis verification scores; Web-based topic language modeling; audio indexing; keyword cooccurrence statistics; optimal hypothesis; portal Web directory; topic-selection criterion; Decoding; Indexing; Linear discriminant analysis; Poles and towers; Scalability; Service oriented architecture; Speech recognition; System testing; Web pages; Web search; audio indexing; topic language models; utterance verification; web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
ISSN :
1945-7871
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2009.5202622
Filename :
5202622
Link To Document :
بازگشت