• DocumentCode
    1857748
  • Title

    Impact of automatic comma prediction on pos/name tagging of speech

  • Author

    Hillard, D. ; Huang, Z. ; Ji, Hong ; Grishman, R. ; Hakkani-Tur, D. ; Harper, M. ; Ostendorf, M. ; Wang, W.

  • Author_Institution
    Electr. Eng. Dept, Univ. of Washington, Seattle, WA
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    58
  • Lastpage
    61
  • Abstract
    This work looks at the impact of automatically predicted commas on part-of-speech (POS) and name tagging of speech recognition transcripts of Mandarin broadcast news. There is a significant gain in both POS and name tagging accuracy due to using automatically predicted commas over sentence boundary prediction alone. One difference between Mandarin and English is that there are two types of commas, and experiments here show that, while they can be reliably distinguished in automatic prediction, the distinction does not give a clear benefit for POS or name tagging.
  • Keywords
    natural language processing; speech recognition; Mandarin broadcast news; automatic comma prediction; name tagging; part-of-speech; speech recognition transcripts; Automatic speech recognition; Broadcasting; Data mining; Error analysis; Natural languages; Performance loss; Speech processing; Speech recognition; Tagging; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326816
  • Filename
    4123361