Title :
N-best decision for Thai stressed speech recognition with parallel hidden Makov model
Author :
Amomkul, P. ; Kumhom, P. ; Chanmongthai, K.
Author_Institution :
Fac. of Eng., King Mongkut´´s Univ. of Technol., Bangkok, Thailand
Abstract :
In integrating multi-isolated-word recognizers into a speech recognition for various stressed speeches, the best likelihood scopes as outputs of each recognizer are not guaranteed a correct recognition result. Since training sometimes does not cover all speakers, likelihood score of the correct recognition result is not the best and causes misrecognition. Moreover, the difference among recognizers also leads to mis-understanding. This paper proposes a decision-making method for Thai stressed speech recognition with parallel hidden Markov model. In this method, a voting scheme is applied on the words with the N-best likelihood score. Firstly, if the score margin between the first and the second-best is greater than a threshold, the voting is applied on the words with highest scores from each recognizer. If there is no clear winner, decided by considering the voting score, the next best score are included into the voting scheme. The process goes on until a winner is found or there is a tied score, its which case the average of the likelihood score of each tied word is used to decide the winner. The experiments were conducted with 4-stress speeches including angry, toward, loud, and neutral. It showed that the proposed method helped increase the recognition rate to 96.545% comparing with previous decision making techniques.
Keywords :
decision making; hidden Markov models; natural languages; speech recognition; N-best decision; Thai stressed speech recognition; angry speech; decision-making method; lombard speech; loud speech; multiisolated-word recognizers; neutral speech; parallel hidden Makov model; voting scheme; Continuous time systems; Decision making; Hidden Markov models; Lattices; Minimization methods; Speech recognition; Topology; Voting; Waste materials;
Conference_Titel :
Intelligent Signal Processing and Communication Systems, 2005. ISPACS 2005. Proceedings of 2005 International Symposium on
Print_ISBN :
0-7803-9266-3
DOI :
10.1109/ISPACS.2005.1595337