Title :
KANSHIN: A Cross-Lingual Concern Analysis System Using Multilingual Blog Articles
Author :
Fukuhara, Tomohiro ; Kimura, Akifumi ; Arai, Yoshiaki ; Yoshinaka, Takayuki ; Masuda, Hidetaka ; Utsuro, Takehito ; Nakagawa, Hiroshi
Author_Institution :
RACE, Univ. of Tokyo, Kashiwa
Abstract :
An architecture of cross-lingual concern analysis (CLCA) using multilingual blog articles, and its prototype system are described. As various people who are living in various countries use the Web, cross-lingual information retrieval (CLIR) plays an important role in the next generation search. In this paper, we propose a CLCA as one of CLIR applications for facilitating users to find concerns of people across languages. We propose a layer architecture of CLCA, and its prototype system called KANSHIN. The system collects Japanese, Chinese, Korean, and English blog articles, and analyzes concerns across languages. Users can find concerns from several viewpoints such as temporal, geographical, and a network of blog sites. The system also facilitates users to browse multilingual keywords using Wikipedia, and the system facilitates users to find spam blogs. An overview of the CLCA architecture and the system are described.
Keywords :
Web sites; information retrieval; natural language processing; KANSHIN; Web; Wikipedia; cross-lingual concern analysis system; cross-lingual information retrieval; multilingual blog articles; next generation search; spam blogs; Electronic mail; Information analysis; Information retrieval; Information services; Internet; Natural languages; Prototypes; Social network services; Web sites; Wikipedia; concern analysis using multilingual weblogs; cross-lingual blog analysis;
Conference_Titel :
Information-Explosion and Next Generation Search, 2008. INGS '08. International Workshop on
Conference_Location :
Shenyang
Print_ISBN :
978-0-7695-3300-1
DOI :
10.1109/INGS.2008.20