DocumentCode :
1931404
Title :
Mining Multilingual Texts using Growing Hierarchical Self-Organizing Maps
Author :
Yang, Hsin-Chang ; Chen, Ding-Wen ; Lee, Chung-Hong
Author_Institution :
Nat. Univ. of Kaohsiung, Kaohsiung
Volume :
4
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
2263
Lastpage :
2268
Abstract :
The WWW provides an ultimate source of information for all kinds of knowledge in various kinds of languages. There are emerging needs for searching documents in different languages, causing multilingual information retrieval an active research topic recently. The performance of such task depends on the degree of understanding for the relationships between different languages. Multilingual text mining aims at discovering interesting relationships between different languages. In this work, we applied the growing hierarchical self-organizing map model to cluster multilingual text documents and find the relationships between two languages. We use a set of parallel corpora to train the map and apply a discovering process to identify the semantic groups and hierarchical structures of keywords for these languages. The discovered knowledge can then be applied to tasks such as multilingual information retrieval and automatic multilingual thesaurus construction.
Keywords :
Internet; computational linguistics; data mining; information retrieval; pattern clustering; self-organising feature maps; text analysis; thesauri; WWW; automatic multilingual thesaurus construction; document searching; growing hierarchical self-organizing maps; interesting relationship discovery; multilingual information retrieval; multilingual text document clustering; multilingual text mining; Cybernetics; Data mining; Information management; Information retrieval; Machine learning; Natural languages; Self organizing feature maps; Text mining; Thesauri; World Wide Web; Growing hierarchical self-organizing Map; Multilingual information retrieval; Multilingual text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370522
Filename :
4370522
Link To Document :
بازگشت