• DocumentCode
    2052980
  • Title

    Named Entity Recognition of Spoken Documents Using Subword Units

  • Author

    Paaß, Gerhard ; Pilz, Anja ; Schwenninger, Jochen

  • Author_Institution
    Fraunhofer Inst. Intell. Anal. & Inf. Syst. (IAIS), St. Augustin, Germany
  • fYear
    2009
  • fDate
    14-16 Sept. 2009
  • Firstpage
    529
  • Lastpage
    534
  • Abstract
    The output of a speech recognition system is a stream of text features that is overlayed by noise resulting from errors in the system´s statistical classification of the audio input. Conditional random fields (CRFs), which have already proven themselves to be efficient, high-performance named entity recognizers (NERs) for named entities from text, offer the promise to compensate part of these errors. In this paper, we use CRFs to extract named entities from spoken audio documents. We consider a real-world audio information extraction scenario under which CRFs are trained to recognize named entities in unedited radio audio documents that have been converted into a stream of text features by a speech recognition system. The automatic speech recognition system (ASR) is able to produce word transcriptions as well as syllables. It uses general speaker-independent acoustic models and a domain-independent statistical language model, insuring that recognizer performance is not specific to the experimental domain. Using an additional syllable model increases the generality of the spoken document classification system, giving it the flexibility to handle words that are not present in the vocabulary. In this paper we apply for the first time CRFs to different features produced by German ASR. The experiments confirm that using transcribed syllables together with words can compensate for part of the NER errors caused by ASR transcription.
  • Keywords
    document handling; random processes; speech recognition; audio information extraction; automatic speech recognition system; conditional random field; named entity recognition; spoken document classification system; statistical language model; subword units; syllable model; Automatic speech recognition; Data mining; Hidden Markov models; Information analysis; Information systems; Speech analysis; Speech recognition; Streaming media; Text recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing, 2009. ICSC '09. IEEE International Conference on
  • Conference_Location
    Berkeley, CA
  • Print_ISBN
    978-1-4244-4962-0
  • Electronic_ISBN
    978-0-7695-3800-6
  • Type

    conf

  • DOI
    10.1109/ICSC.2009.78
  • Filename
    5298608