DocumentCode :
3132163
Title :
Improved semantic retrieval of spoken content by language models enhanced with acoustic similarity graph
Author :
Hung-yi Lee ; Tsung-Hsien Wen ; Lin-Shan Lee
Author_Institution :
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
182
Lastpage :
187
Abstract :
Retrieving objects semantically related to the query has been widely studied in text information retrieval. However, when applying the text-based techniques on spoken content, the inevitable recognition errors may seriously degrade the performance. In this paper, we propose to enhance the expected term frequencies estimated from spoken content by acoustic similarity graphs. For each word in the lexicon, a graph is constructed describing acoustic similarity among spoken segments in the archive. Score propagation over the graph helps in estimating the expected term frequencies. The enhanced expected term frequencies can be used in the language modeling retrieval approach, as well as semantic retrieval techniques such as the document expansion based on latent semantic analysis, and query expansion considering both words and latent topic information. Preliminary experiments performed on Mandarin broadcast news indicated that improved performance were achievable under different conditions.
Keywords :
graph theory; information retrieval; natural language processing; text analysis; Mandarin broadcast news; acoustic similarity graph; enhanced expected term frequencies; expected term frequencies; improved semantic retrieval; inevitable recognition errors; language modeling retrieval approach; language models; latent semantic analysis; lexicon; query expansion; score propagation; semantic retrieval techniques; spoken content; text information retrieval; text-based techniques; Acoustics; Computational modeling; Information retrieval; Interpolation; Lattices; Manuals; Semantics; Document Expansion; Latent Semantic Analysis; Query Expansion; Random Walk;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424219
Filename :
6424219
Link To Document :
بازگشت