Title :
Japanese sentence compression using Simple English Wikipedia
Author :
Shunsuke Takeno;Kazuhide Yamamoto
Author_Institution :
Nagaoka University of Technology, 1603-1 Kamitomioka, Niigata, 940-2188 Japan
Abstract :
We describe a cross-lingual approach for sentence compression of articles of Japanese Wikipedia using the correspondence of articles of Simple English Wikipedia. Taking advantages of the nature of the corpus, we can find essential parts from encyclopedic description without highly depending on the statistical information which are noisy. We manually explored the correspondences between the articles of Japanese Wikipedia and those of Simple English Wikipedia and then proposed a cross-lingual alignment method using simple matching algorithm. We provide an analysis of the abovementioned correspondence and the preliminary result of sentence compression using Simple English Wikipedia.
Keywords :
"Encyclopedias","Electronic publishing","Internet","Indexes"
Conference_Titel :
Asian Language Processing (IALP), 2015 International Conference on
Print_ISBN :
978-1-4673-9595-3
DOI :
10.1109/IALP.2015.7451533