DocumentCode :
119764
Title :
Hypothesis combination for Slovak dictation speech recognition
Author :
Lojka, Martin ; Juhar, Jozef
Author_Institution :
Tech. Univ. of Kosice, Kosice, Slovakia
fYear :
2014
fDate :
10-12 Sept. 2014
Firstpage :
1
Lastpage :
4
Abstract :
Combination of multiple speech recognition systems is the most used method for improving speech recognition accuracy. The combination is performed at the feature level or the systems are exchanging informations between each other during decoding process or are combined afterwards using their outputs in form of N-best hypothesis or lattices. This paper provides initial experiments with system combination for Slovak language speech recognition using well known combination tool, the Recognition Output Voting Error Reduction (ROVER) from National Institute of Standards and Technology (NIST). Two kinds of scores provided to ROVER are here explored. The first one is based on normalized posterior probabilities and the second one on confidence scores of words in recognized sentence. Also new method for improving the efficiency of combination by smoothing the scores is presented.
Keywords :
natural language processing; probability; smoothing methods; speech coding; speech recognition; N-best hypothesis; N-best lattices; NIST; National Institute of Standards and Technology; ROVER; Slovak dictation speech recognition; Slovak language speech recognition; confidence scores; decoding process; hypothesis combination; normalized posterior probabilities; recognition output voting error reduction; score smoothing; speech recognition accuracy improvement; Decoding; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Smoothing methods; Speech recognition; Hypothesis Combination; ROVER; Speech Recognition; System Combination;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ELMAR (ELMAR), 2014 56th International Symposium
Conference_Location :
Zadar
Type :
conf
DOI :
10.1109/ELMAR.2014.6923311
Filename :
6923311
Link To Document :
بازگشت