DocumentCode
2180638
Title
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition
Author
Kurata, Gakuto ; Itoh, Nobuyasu ; Nishimura, Masafumi ; Sethy, Abhinav ; Ramabhadran, Bhuvana
Author_Institution
IBM Res. - Tokyo, Yamato, Japan
fYear
2011
fDate
22-27 May 2011
Firstpage
5572
Lastpage
5575
Abstract
Named Entity (NE) recognition from the results of Automatic Speech Recognition (ASR) is challenging because of ASR errors. To detect NEs, one of the options is to use a statistical NE model that is usually trained with ASR one-best results. In order to make NE recognition more robust to ASR errors, we propose using Word Confusion Networks (WCNs), sequences of bundled words, for both NE modeling and recognition by regarding the word bundles as units instead of the independent words. This is done by clustering similar word bundles that may originate from the same word. We trained the NE models with the maximum entropy principle and evaluated the performance using real-life call-center data. The results showed that by using the WCNs, the error of NE recognition was relatively reduced by up to 33.0%.
Keywords
call centres; maximum entropy methods; speech recognition; ASR; NE recognition; WCN; automatic speech recognition; conversational telephone speech leveraging word confusion networks; maximum entropy principle; named entity recognition; real-life call-center data; word confusion network; Companies; Context; Speech recognition; Training; Conversational Telephone Speech; Maximum Entropy Model; Named Entity Recognition; Word Confusion Network;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947622
Filename
5947622
Link To Document