DocumentCode :
3362138
Title :
Extracting Web table information in cooperative learning activities based on abstract semantic model
Author :
Gu, Ning ; Wu, Guowen ; Wu, Xiaoyuan ; Shi, Baile
Author_Institution :
Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai, China
fYear :
2001
fDate :
2001
Firstpage :
492
Lastpage :
497
Abstract :
A great deal of Web table information exists in cooperative learning activities. The paper presents a new method that extracts information from tables of Web documents. Using a tabled abstract semantic model to describe complicated tables and understand tables from the point of view of semantics, the method reduces the dependence for the design difference of table constructions in the extraction process. At the same time, it utilizes the characteristics of HTML and the techniques of natural language processing to design some heuristic rules, and thus aids the identification of table items. On the above basis, we design a prototype, “EXTable”, and then gain a better result according to experimentation
Keywords :
computational linguistics; educational computing; groupware; hypermedia markup languages; information resources; natural languages; teaching; EXTable; HTML; Web documents; Web table information extraction; abstract semantic model; complicated tables; cooperative learning activities; extraction process; heuristic rules; natural language processing; semantics; table constructions; table items; Computer science; Context modeling; Data mining; HTML; Internet; Natural languages; Process design; Prototypes; Search engines; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Supported Cooperative Work in Design, The Sixth International Conference on, 2001
Conference_Location :
London, Ont.
Print_ISBN :
0-660-18493-1
Type :
conf
DOI :
10.1109/CSCWD.2001.942309
Filename :
942309
Link To Document :
بازگشت