DocumentCode
3059518
Title
Distributed service-oriented architecture for information extraction system "Semanta"
Author
Jastrzebski, Lukasz ; Piasecki, Meciej ; Strzelecki, Grzegorz ; Wilkosz, Krzysztof
Author_Institution
Inst. of Appl. Informatics, Wroclaw Univ. of Technol.
fYear
2005
fDate
2005
Firstpage
61
Lastpage
66
Abstract
Our objective is to provide a flexible, scalable, distributed architecture that assures a high performance for information extraction (IE) systems working in Internet. The architecture is based on both the general paradigm of the service-oriented architecture, client-server approach and strong separation of concerns between storage and processing components. An experimental IE system, named Semanta, utilising the proposed architecture is also presented. In the following document, we describe five main Semanta services, which are Web user interface (WebUI), Web crawler service (WCS), parsing service (PS), IE service and manager
Keywords
Internet; information retrieval systems; user interfaces; Internet; Semanta services; Web crawler service; Web user interface; client-server approach; distributed service-oriented architecture; parsing service; Computer architecture; Crawlers; Data mining; Distributed computing; Informatics; Java; Service oriented architecture; User interfaces; Web and internet services; Web services;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications, 2005. ISDA '05. Proceedings. 5th International Conference on
Conference_Location
Warsaw
Print_ISBN
0-7695-2286-6
Type
conf
DOI
10.1109/ISDA.2005.39
Filename
1578761
Link To Document