مرکز منطقه ای اطلاع رساني علوم و فناوري - Direct sub-word confidence estimation with hidden-state conditional random fields

DocumentCode :

178315

Title :

Direct sub-word confidence estimation with hidden-state conditional random fields

Author :

Seigel, M.S. ; Woodland, Philip C.

Author_Institution :

Eng. Dept., Cambridge Univ., Cambridge, UK

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

2307

Lastpage :

2311

Abstract :

The estimation of accurate confidence scores for sub-word-level units within automatic speech recognition (ASR) system transcriptions is investigated in this work. This is achieved through the application of linear-chain and hidden-state conditional random field (CRF) models to the task. A method for evaluating the significance of results quoted in terms of the normalised cross entropy (NCE) is also introduced. Instead of using sub-word-level information to improve wordlevel confidence scores, sub-word and word-level predictor features are combined to improve the accuracy of confidence scores in each sub-word being correct. The use of CRFs to model transitions between consecutive correct/incorrect sub-words yields large performance improvements. The scale of these gains is shown to increase further with the application of hidden-state CRFs. This is attributed to the fact that the hidden states make it possible for longer-span runs of consecutive correct/incorrect sub-words to be modelled, with these runs also not being constrained by word-level boundaries.

Keywords :

feature extraction; speech recognition; statistical analysis; ASR system; NCE; automatic speech recognition; confidence scores; direct sub-word confidence estimation; hidden-state CRF model; hidden-state conditional random fields; linear-chain CRF model; normalised cross entropy; sub-word predictor features; word-level boundaries; word-level predictor features; Entropy; Error analysis; Estimation; Feature extraction; Hidden Markov models; Lattices; Speech recognition; Hidden-state conditional random fields; confidence estimation; sub-words;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854011

Filename :

6854011

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178315