DocumentCode :
2704066
Title :
High-Accuracy Phone Recognition By Combining High-Performance Lattice Generation and Knowledge Based Rescoring
Author :
Siniscalchi, Sabato M. ; Schwarz, Petr ; Lee, Chin-Hui
Author_Institution :
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This study is a result of a collaboration project between two groups, one from Brno University of Technology and the other from Georgia Institute of Technology (GT). Recently the Brno recognizer is known to outperform many state-of-the-art systems on phone recognition, while the GT knowledge-based lattice rescoring module has been shown to improve system performance on a number of speech recognition tasks. We believe a combination of the two system results in high-accuracy phone recognition. To integrate the two very different modules, we modify Brno´s phone recognizer into a phone lattice hypothesizer to produce high-quality phone lattices, and feed them directly into the knowledge-based module to rescore the lattices. We test the combined system on the TIMIT continuous phone recognition task without retraining the individual subsystems, and we observe that the phone error rate was effectively reduced to 19.78% from 24.41% produced by the Brno phone recognizer. To the best of the authors´ knowledge this result represents the lowest ever error rate reported on the TIMIT continuous phone recognition task.
Keywords :
knowledge based systems; speech recognition; Brno University of Technology; Brno recognizer; Georgia Institute of Technology; TIMIT continuous phone recognition task; collaboration project; high-accuracy phone recognition; high-performance lattice generation; high-quality phone lattices; knowledge based rescoring; phone error rate; phone lattice hypothesizer; state-of-the-art systems; Automatic speech recognition; Collaboration; Decoding; Error analysis; Feeds; Hidden Markov models; Knowledge engineering; Lattices; Speech recognition; System performance; Knowledge based system; hidden Markov models; neural networks; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2007.367208
Filename :
4218239
Link To Document :
بازگشت