DocumentCode
2143199
Title
Sem@ntica: A system for semantic extraction and logical querying of text corpora
Author
McMichael, Daniel W. ; Fu, Raymond ; Williams, Simon ; Jarrad, Geoff A.
Author_Institution
Commonwealth Sci. & Ind. Res. Organ. of Australia (CSIRO), Clayton, VIC
fYear
2008
fDate
17-20 June 2008
Firstpage
277
Lastpage
278
Abstract
Sem@ntica is a system for extracting the information contained in collections of documents into a knowledge base. It combines high quality conventional named entity analysis with an ontology class labeling capability for open class words. The ontology comprises an upper ontology and one or more domain ontologies. The system has tools for rapidly designing the ontology and mapping segments of Word Net on to ontology classes. The extracted knowledge base can be queried directly using the KM language and tools are provided for rapid construction of dynamic web page reports and for portal design. The system is open and its capabilities can be controlled and viewed via an extended suite of dynamic Web page visualizations. Its component capabilities are available via TCP/IP remote procedure calls and via SOAP Web services. Sem@ntica is designed to provide the front-end processing for national and commercial intelligence processing; it has a wide field of potential application.
Keywords
Web services; document handling; knowledge acquisition; KM language; SOAP Web services; Sem@ntica; Word Net; dynamic Web page; entity analysis; information extraction; knowledge extraction; logical querying; ontology class labeling capability; semantic extraction; text corpora; Control systems; Data mining; Labeling; Ontologies; Portals; Simple object access protocol; TCPIP; Visualization; Web pages; Web services; artificial intelligence; information extraction; knowledge base; named entity analysis; ontology;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligence and Security Informatics, 2008. ISI 2008. IEEE International Conference on
Conference_Location
Taipei
Print_ISBN
978-1-4244-2414-6
Electronic_ISBN
978-1-4244-2415-3
Type
conf
DOI
10.1109/ISI.2008.4565083
Filename
4565083
Link To Document