Title :
Distributed service-oriented architecture for information extraction system "Semanta"
Author :
Jastrzebski, Lukasz ; Piasecki, Meciej ; Strzelecki, Grzegorz ; Wilkosz, Krzysztof
Author_Institution :
Inst. of Appl. Informatics, Wroclaw Univ. of Technol.
Abstract :
Our objective is to provide a flexible, scalable, distributed architecture that assures a high performance for information extraction (IE) systems working in Internet. The architecture is based on both the general paradigm of the service-oriented architecture, client-server approach and strong separation of concerns between storage and processing components. An experimental IE system, named Semanta, utilising the proposed architecture is also presented. In the following document, we describe five main Semanta services, which are Web user interface (WebUI), Web crawler service (WCS), parsing service (PS), IE service and manager
Keywords :
Internet; information retrieval systems; user interfaces; Internet; Semanta services; Web crawler service; Web user interface; client-server approach; distributed service-oriented architecture; parsing service; Computer architecture; Crawlers; Data mining; Distributed computing; Informatics; Java; Service oriented architecture; User interfaces; Web and internet services; Web services;
Conference_Titel :
Intelligent Systems Design and Applications, 2005. ISDA '05. Proceedings. 5th International Conference on
Conference_Location :
Warsaw
Print_ISBN :
0-7695-2286-6
DOI :
10.1109/ISDA.2005.39