Title :
Retrieval and browsing of spoken content
Author :
Chelba, C. ; Hazen, Timothy J. ; Saraclar, Murat
Author_Institution :
Johns Hopkins Univ., Baltimore
fDate :
5/1/2008 12:00:00 AM
Abstract :
Ever-increasing computing power and connectivity bandwidth, together with falling storage costs, are resulting in an overwhelming amount of data of various types being produced, exchanged, and stored. Consequently, information search and retrieval has emerged as a key application area. Text-based search is the most active area, with applications that range from Web and local network search to searching for personal information residing on one´s own hard-drive. Speech search has received less attention perhaps because large collections of spoken material have previously not been available. However, with cheaper storage and increased broadband access, there has been a subsequent increase in the availability of online spoken audio content such as news broadcasts, podcasts, and academic lectures. A variety of personal and commercial uses also exist. As data availability increases, the lack of adequate technology for processing spoken documents becomes the limiting factor to large-scale access to spoken content. In this article, we strive to discuss the technical issues involved in the development of information retrieval systems for spoken audio documents, concentrating on the issue of handling the errorful or incomplete output provided by ASR systems. We focus on the usage case where a user enters search terms into a search engine and is returned a collection of spoken document hits.
Keywords :
audio signal processing; content-based retrieval; search engines; speech recognition; ASR systems; information retrieval systems; search engine; speech search; spoken audio documents; spoken content browsing; spoken content retrieval; text-based search; Automatic speech recognition; Bandwidth; Computer errors; Content based retrieval; Costs; Digital audio broadcasting; Document handling; Information retrieval; Large-scale systems; Search engines;
Journal_Title :
Signal Processing Magazine, IEEE
DOI :
10.1109/MSP.2008.917992