Title :
On the abstraction and presentation of multi-source knowledge
Author :
Wang, Hsien-chang ; Chan, Yueh-chin
Author_Institution :
Dept. of Inf. Manage., Chang Jung Christian Univ., Tainan
Abstract :
This paper proposed a knowledge abstraction and presentation system by information gathered Internet web pages. Documents gathered from different Websites are first segmented into different paragraphs according to their topics. The linguistic processing such as word segmentation, word tagging and word frequency evaluation are applied to these corpora first. Then two types of similarities are calculated in our study: the paragraph-based and sentence-based similarity.
Keywords :
Internet; abstracting; information retrieval; natural language processing; word processing; Internet web pages; knowledge abstraction; knowledge presentation; linguistic processing; mean opinion score evaluation; multi-source knowledge; paragraph-based similarity; sentence-based similarity; word frequency evaluation; word segmentation; word tagging; Birds; Cybernetics; Data mining; Frequency; Information filtering; Information filters; Intelligent systems; Internet; Machine learning; Web pages;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4620976