DocumentCode :
294581
Title :
Improved topic spotting through statistical modelling of keyword dependencies
Author :
Wright, Jerry H. ; Carey, Michael J. ; Parris, Eluned S.
Author_Institution :
Ensigma Ltd., Chepstow, UK
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
313
Abstract :
Keywords are chosen on the basis of their usefulness for discriminating a topic from background speech. Good topic recognition can be achieved with a small set of well-chosen keywords, but particular combinations of keywords often achieve better discrimination than can be accounted for by regarding them as independent. This paper describes a higher-order statistical approach involving models of keyword-topic interdependence. A linear-logistic model brings some improvement in performance, but better results are obtained using log-linear contingency table models. Although the potential number of these is very large, good models tend to be simple and are suggested by heuristic measures inferred from the training data. The approach is tested using a broadcast radio database
Keywords :
higher order statistics; radio broadcasting; speech processing; speech recognition; background speech; broadcast radio database; discrimination; heuristic measures; higher-order statistical approach; keyword dependencies; keyword-topic interdependence; linear-logistic mode; log-linear contingency table models; performance; statistical modelling; topic recognition; topic spotting; training data; Character generation; Databases; Frequency measurement; Logistics; Radio broadcasting; Smoothing methods; Speech; Testing; Training data; Yield estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479536
Filename :
479536
Link To Document :
بازگشت