DocumentCode
3485423
Title
The IBM 2011 GALE Arabic speech transcription system
Author
Mangu, Lidia ; Kuo, Hong-Kwang ; Chu, Stephen ; Kingsbury, Brian ; Saon, George ; Soltau, Hagen ; Biadsy, Fadi
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2011
fDate
11-15 Dec. 2011
Firstpage
272
Lastpage
277
Abstract
We describe the Arabic broadcast transcription system fielded by IBM in the GALE Phase 5 machine translation evaluation. Key advances over our Phase 4 system include a new Bayesian Sensing HMM acoustic model; multistream neural network features; a MADA vowelized acoustic model; and the use of a variety of language model techniques with significant additive gains. These advances were instrumental in achieving a word error rate of 7.4% on the Phase 5 evaluation set, and an absolute improvement of 0.9% word error rate over our 2009 system on the unsequestered Phase 4 evaluation data.
Keywords
Bayes methods; hidden Markov models; neural nets; speech processing; Bayesian sensing HMM acoustic model; GALE Phase 5 machine translation evaluation; IBM 2011 GALE Arabic speech transcription system; MADA vowelized acoustic model; language model techniques; multistream neural network features; phase 4 system; unsequestered phase 4 evaluation data; Acoustics; Computational modeling; Dictionaries; Hidden Markov models; Lattices; Training; Transforms; large vocabulary speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location
Waikoloa, HI
Print_ISBN
978-1-4673-0365-1
Electronic_ISBN
978-1-4673-0366-8
Type
conf
DOI
10.1109/ASRU.2011.6163943
Filename
6163943
Link To Document