• DocumentCode
    2180638
  • Title

    Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition

  • Author

    Kurata, Gakuto ; Itoh, Nobuyasu ; Nishimura, Masafumi ; Sethy, Abhinav ; Ramabhadran, Bhuvana

  • Author_Institution
    IBM Res. - Tokyo, Yamato, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5572
  • Lastpage
    5575
  • Abstract
    Named Entity (NE) recognition from the results of Automatic Speech Recognition (ASR) is challenging because of ASR errors. To detect NEs, one of the options is to use a statistical NE model that is usually trained with ASR one-best results. In order to make NE recognition more robust to ASR errors, we propose using Word Confusion Networks (WCNs), sequences of bundled words, for both NE modeling and recognition by regarding the word bundles as units instead of the independent words. This is done by clustering similar word bundles that may originate from the same word. We trained the NE models with the maximum entropy principle and evaluated the performance using real-life call-center data. The results showed that by using the WCNs, the error of NE recognition was relatively reduced by up to 33.0%.
  • Keywords
    call centres; maximum entropy methods; speech recognition; ASR; NE recognition; WCN; automatic speech recognition; conversational telephone speech leveraging word confusion networks; maximum entropy principle; named entity recognition; real-life call-center data; word confusion network; Companies; Context; Speech recognition; Training; Conversational Telephone Speech; Maximum Entropy Model; Named Entity Recognition; Word Confusion Network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947622
  • Filename
    5947622