DocumentCode
3302903
Title
Semantic Relation Extraction by Analysis of Terms Correlation in Documents
Author
Botero, Sergio William ; Ricarte, Ivan L M
Author_Institution
Dept. de Eng. de Comput. e Automacao Ind., Univ. Estadual de Campinas (UNICAMP), Campinas, Brazil
fYear
2009
fDate
8-11 Sept. 2009
Firstpage
17
Lastpage
26
Abstract
Ontologies are important to organize and describe information, but are hard to create and maintain, which motivates the development of tools to help in this task. This article presents a strategy to extract, from a corpora of documents in a given domain, semantic elements expressing proximity relations between terms and concepts to help the construction of domain ontologies. The technique presented here, ACT, is based on linguistic processing, machine learning, and biclustering. Results show that concepts obtained by ACT are at least as good as those from similar techniques, such as LSI and NMF. In relation to those techniques, it additionally has the advantage of allowing the supervision by a domain expert.
Keywords
Computer industry; Data mining; Humans; Indexing; Large scale integration; Machine learning; Matrix decomposition; Ontologies; Single event transient; Information retrieval; Information retrieval system; Ontology; Semantic; Text Processing(Computation);
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Human Language Technology (STIL), 2009 Seventh Brazilian Symposium in
Conference_Location
Sao Carlos, TBD, Brazil
Print_ISBN
978-1-4244-6008-3
Type
conf
DOI
10.1109/STIL.2009.18
Filename
5532434
Link To Document