DocumentCode :
2330035
Title :
Efficient data selection for spoken document retrieval based on prior confidence estimation using speech and context independent models
Author :
Kobashikawa, S. ; Asami, Takuya ; Yamaguchi, Yoshio ; Masataki, Hirokazu ; Takahashi, Satoshi
Author_Institution :
NTT Cyber Space Labs., NTT Corp., Tokyo, Japan
fYear :
2010
fDate :
12-15 Dec. 2010
Firstpage :
200
Lastpage :
205
Abstract :
This paper proposes an efficient speech sample selection technique that can identify those samples that will be well recognized. Conventional confidence measures can identify well-recognized speech samples, but they require speech recognition to estimate confidence scores. Speech samples with low confidence should not undergo recognition since they yield speech documents that will eventually be rejected. The proposed technique can select the samples that will justify the application of speech recognition. It is based on rapid prior confidence estimation by using speech and context independent models to calculate acoustic likelihood values on a frame-by-frame basis. Tests show that the proposed confidence estimation technique is over 50 times faster than the conventional posterior confidence measure while maintaining equivalent data selection performance for speech recognition and spoken document retrieval.
Keywords :
document handling; information retrieval; speech recognition; acoustic likelihood values; confidence estimation; context independent model; data selection; speech independent model; speech recognition; speech sample selection technique; spoken document retrieval; confidence measure; data selection; speech recognition; spoken document retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2010 IEEE
Conference_Location :
Berkeley, CA
Print_ISBN :
978-1-4244-7904-7
Electronic_ISBN :
978-1-4244-7902-3
Type :
conf
DOI :
10.1109/SLT.2010.5700851
Filename :
5700851
Link To Document :
بازگشت