• DocumentCode
    3059518
  • Title

    Distributed service-oriented architecture for information extraction system "Semanta"

  • Author

    Jastrzebski, Lukasz ; Piasecki, Meciej ; Strzelecki, Grzegorz ; Wilkosz, Krzysztof

  • Author_Institution
    Inst. of Appl. Informatics, Wroclaw Univ. of Technol.
  • fYear
    2005
  • fDate
    2005
  • Firstpage
    61
  • Lastpage
    66
  • Abstract
    Our objective is to provide a flexible, scalable, distributed architecture that assures a high performance for information extraction (IE) systems working in Internet. The architecture is based on both the general paradigm of the service-oriented architecture, client-server approach and strong separation of concerns between storage and processing components. An experimental IE system, named Semanta, utilising the proposed architecture is also presented. In the following document, we describe five main Semanta services, which are Web user interface (WebUI), Web crawler service (WCS), parsing service (PS), IE service and manager
  • Keywords
    Internet; information retrieval systems; user interfaces; Internet; Semanta services; Web crawler service; Web user interface; client-server approach; distributed service-oriented architecture; parsing service; Computer architecture; Crawlers; Data mining; Distributed computing; Informatics; Java; Service oriented architecture; User interfaces; Web and internet services; Web services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2005. ISDA '05. Proceedings. 5th International Conference on
  • Conference_Location
    Warsaw
  • Print_ISBN
    0-7695-2286-6
  • Type

    conf

  • DOI
    10.1109/ISDA.2005.39
  • Filename
    1578761