DocumentCode :
2247293
Title :
An agent-based system framework for multi-slot Web information extraction
Author :
Zhang, Shudong ; Qin, Ye ; Yao, Naiming
Author_Institution :
Coll. of Inf. Eng., Capital Normal Univ., Beijing, China
Volume :
3
fYear :
2010
fDate :
6-7 March 2010
Firstpage :
200
Lastpage :
203
Abstract :
At present, the scale and diversity of Web information are immense. Acquiring Web information simply relies on search engine which is increasingly unable to meet user needs, thus Web information extraction (WebIE) technology attracts widely attentions. In this paper, a framework of distributed multi-slot WebIE system based on agent is proposed. It includes user agent, mediator agent, wrapper agent, data store agent, page preprocessing agent and corresponding knowledge base. The agents communicate each other and cooperate together to carry out the general goal of the system. Moreover, aiming at multi-slot extraction, the approaches of extraction rule learning and repair are presented, which enable to enhance adaptability of the system.
Keywords :
Internet; information retrieval; knowledge based systems; multi-agent systems; agent-based system framework; data store agent; distributed multislot WebIE system; knowledge-base system; mediator agent; multislot Web information extraction; page preprocessing agent; search engine; user agent; wrapper agent; Artificial intelligence; Asia; Automatic control; Data mining; Informatics; Internet; Robot control; Robotics and automation; Search engines; Web pages; Web information extraction; agent; distributed; extraction rule;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics in Control, Automation and Robotics (CAR), 2010 2nd International Asia Conference on
Conference_Location :
Wuhan
ISSN :
1948-3414
Print_ISBN :
978-1-4244-5192-0
Electronic_ISBN :
1948-3414
Type :
conf
DOI :
10.1109/CAR.2010.5456664
Filename :
5456664
Link To Document :
بازگشت