DocumentCode
1857748
Title
Impact of automatic comma prediction on pos/name tagging of speech
Author
Hillard, D. ; Huang, Z. ; Ji, Hong ; Grishman, R. ; Hakkani-Tur, D. ; Harper, M. ; Ostendorf, M. ; Wang, W.
Author_Institution
Electr. Eng. Dept, Univ. of Washington, Seattle, WA
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
58
Lastpage
61
Abstract
This work looks at the impact of automatically predicted commas on part-of-speech (POS) and name tagging of speech recognition transcripts of Mandarin broadcast news. There is a significant gain in both POS and name tagging accuracy due to using automatically predicted commas over sentence boundary prediction alone. One difference between Mandarin and English is that there are two types of commas, and experiments here show that, while they can be reliably distinguished in automatic prediction, the distinction does not give a clear benefit for POS or name tagging.
Keywords
natural language processing; speech recognition; Mandarin broadcast news; automatic comma prediction; name tagging; part-of-speech; speech recognition transcripts; Automatic speech recognition; Broadcasting; Data mining; Error analysis; Natural languages; Performance loss; Speech processing; Speech recognition; Tagging; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location
Palm Beach
Print_ISBN
1-4244-0872-5
Type
conf
DOI
10.1109/SLT.2006.326816
Filename
4123361
Link To Document