DocumentCode
2789650
Title
Morphological and syntactic features for Arabic speech recognition
Author
Kuo, Hong-Kwang Jeff ; Mangu, Lidia ; Emami, Ahmad ; Zitouni, Imed
Author_Institution
IBM T.J.Watson Res. Center, Yorktown Heights, NY, USA
fYear
2010
fDate
14-19 March 2010
Firstpage
5190
Lastpage
5193
Abstract
In this paper, we study the use of morphological and syntactic context features to improve speech recognition of a morphologically rich language like Arabic. We examine a variety of syntactic features, including part-of-speech tags, shallow parse tags, and exposed head words and their non-terminal labels both before and after the word to be predicted. Neural network LMs are used to model these features since they generalize better to unseen events by modeling words and other context features in continuous space. Using morphological and syntactic features, we can improve the word error rate (WER) significantly on various test sets, including EVAL´08U, the unsequestered portion of the DARPA GALE Phase 3 evaluation test set.
Keywords
neural nets; speech recognition; Arabic speech recognition; exposed head words; morphological feature; neural network language model; part-of-speech tags; shallow parse tags; syntactic feature; word error rate; Context modeling; Error analysis; Feature extraction; Lattices; Morphology; Natural languages; Neural networks; Speech recognition; Testing; Vocabulary; Syntax; morphology; neural network language model; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495010
Filename
5495010
Link To Document