DocumentCode
3136743
Title
Japanese Ellipsis Resolution in "A NO B" Noun Phrases for Colloquial Inquiry Text Using Latent Topic Models
Author
Harada, Tatsuya ; Suzuki, Nobuhiro ; Tsuda, Kazuhiko
Author_Institution
Grad. Sch. of Syst. & Inf. Eng., Univ. of Tsukuba, Tokyo, Japan
fYear
2013
fDate
2-5 Dec. 2013
Firstpage
901
Lastpage
908
Abstract
Generally inquiries through Web forms and e-mails are increasing. These inquiry texts usually include many informal expressions use of the colloquial style, such as a spoken language, and many omitted words. An omitted word causes the meaning of a sentence to become ambiguous and may make the reader misread and misunderstand the context. In this paper we focus on the frequently omitted noun ``B´´ in the noun phrase ``A NO1 B´´ (usually meaning B of A) seen in the colloquial style inquiry text and propose a method to predict omitted noun ``B´´ from context and knowledge using topic information. From the results of the evaluation experiment, we have confirmed that our method improved 11.34 points from the conventional method, and predicted the omitted word with an accuracy rate of more than 75% using ``Latent Dirichlet Allocation´´ (LDA.).
Keywords
natural language processing; text analysis; A NO1 B noun phrases; Japanese ellipsis resolution; LDA; colloquial style inquiry text; informal expressions; latent Dirichlet allocation; latent topic models; omitted noun prediction; omitted words; spoken language; topic information; Accuracy; Context; DVD; Educational institutions; Electronic mail; Mathematical model; Probability; Colloquial expressions; Ellipsis; Gibbs sampling; LDA; Statistical topic models;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal-Image Technology & Internet-Based Systems (SITIS), 2013 International Conference on
Conference_Location
Kyoto
Type
conf
DOI
10.1109/SITIS.2013.147
Filename
6727297
Link To Document