Title :
An agent-based system framework for multi-slot Web information extraction
Author :
Zhang, Shudong ; Qin, Ye ; Yao, Naiming
Author_Institution :
Coll. of Inf. Eng., Capital Normal Univ., Beijing, China
Abstract :
At present, the scale and diversity of Web information are immense. Acquiring Web information simply relies on search engine which is increasingly unable to meet user needs, thus Web information extraction (WebIE) technology attracts widely attentions. In this paper, a framework of distributed multi-slot WebIE system based on agent is proposed. It includes user agent, mediator agent, wrapper agent, data store agent, page preprocessing agent and corresponding knowledge base. The agents communicate each other and cooperate together to carry out the general goal of the system. Moreover, aiming at multi-slot extraction, the approaches of extraction rule learning and repair are presented, which enable to enhance adaptability of the system.
Keywords :
Internet; information retrieval; knowledge based systems; multi-agent systems; agent-based system framework; data store agent; distributed multislot WebIE system; knowledge-base system; mediator agent; multislot Web information extraction; page preprocessing agent; search engine; user agent; wrapper agent; Artificial intelligence; Asia; Automatic control; Data mining; Informatics; Internet; Robot control; Robotics and automation; Search engines; Web pages; Web information extraction; agent; distributed; extraction rule;
Conference_Titel :
Informatics in Control, Automation and Robotics (CAR), 2010 2nd International Asia Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5192-0
Electronic_ISBN :
1948-3414
DOI :
10.1109/CAR.2010.5456664