DocumentCode :
1961069
Title :
A content-based Chinese speech document retrieval system design and implementation
Author :
Zhong, Cencen ; Miao, Zhenjiang ; Zhang, Jie ; Du, Luyan ; Kang, Dandan
Author_Institution :
Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
fYear :
2009
fDate :
23-26 Aug. 2009
Firstpage :
117
Lastpage :
122
Abstract :
The rapid development of speech processing technology provides a potential for speech retrieval. This paper designs and implements a content-based Chinese speech document retrieval system using keyword spotting and text classification. In this system, a segment of unknown spontaneous speech will be converted into a series of keywords and then classified into a certain category, called topic, hoping to establish a retrieval model with two-level semantic information, which enables users to search for desired speech by keyword or topic query. Besides, based on the theory of mutual information, text classification is also used to react on the keywords to remove some false alarms. This paper mainly describes the structure, principle and completion situation of this retrieval system, finally gives the experimental results and discussions.
Keywords :
content-based retrieval; natural languages; speech processing; text analysis; content-based Chinese speech document retrieval system; keyword spotting; speech processing technology; text classification; Automatic speech recognition; Content based retrieval; Explosions; Information retrieval; Information science; Mutual information; Sampling methods; Signal processing; Speech processing; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and Signal Processing, 2009. PacRim 2009. IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
978-1-4244-4560-8
Electronic_ISBN :
978-1-4244-4561-5
Type :
conf
DOI :
10.1109/PACRIM.2009.5291386
Filename :
5291386
Link To Document :
بازگشت