مرکز منطقه ای اطلاع رساني علوم و فناوري - Efficient data selection for spoken document retrieval based on prior confidence estimation using speech and context independent models

DocumentCode :

2330035

Title :

Efficient data selection for spoken document retrieval based on prior confidence estimation using speech and context independent models

Author :

Kobashikawa, S. ; Asami, Takuya ; Yamaguchi, Yoshio ; Masataki, Hirokazu ; Takahashi, Satoshi

Author_Institution :

NTT Cyber Space Labs., NTT Corp., Tokyo, Japan

fYear :

2010

fDate :

12-15 Dec. 2010

Firstpage :

200

Lastpage :

205

Abstract :

This paper proposes an efficient speech sample selection technique that can identify those samples that will be well recognized. Conventional confidence measures can identify well-recognized speech samples, but they require speech recognition to estimate confidence scores. Speech samples with low confidence should not undergo recognition since they yield speech documents that will eventually be rejected. The proposed technique can select the samples that will justify the application of speech recognition. It is based on rapid prior confidence estimation by using speech and context independent models to calculate acoustic likelihood values on a frame-by-frame basis. Tests show that the proposed confidence estimation technique is over 50 times faster than the conventional posterior confidence measure while maintaining equivalent data selection performance for speech recognition and spoken document retrieval.

Keywords :

document handling; information retrieval; speech recognition; acoustic likelihood values; confidence estimation; context independent model; data selection; speech independent model; speech recognition; speech sample selection technique; spoken document retrieval; confidence measure; data selection; speech recognition; spoken document retrieval;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language Technology Workshop (SLT), 2010 IEEE

Conference_Location :

Berkeley, CA

Print_ISBN :

978-1-4244-7904-7

Electronic_ISBN :

978-1-4244-7902-3

Type :

conf

DOI :

10.1109/SLT.2010.5700851

Filename :

5700851

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2330035