• DocumentCode
    3302903
  • Title

    Semantic Relation Extraction by Analysis of Terms Correlation in Documents

  • Author

    Botero, Sergio William ; Ricarte, Ivan L M

  • Author_Institution
    Dept. de Eng. de Comput. e Automacao Ind., Univ. Estadual de Campinas (UNICAMP), Campinas, Brazil
  • fYear
    2009
  • fDate
    8-11 Sept. 2009
  • Firstpage
    17
  • Lastpage
    26
  • Abstract
    Ontologies are important to organize and describe information, but are hard to create and maintain, which motivates the development of tools to help in this task. This article presents a strategy to extract, from a corpora of documents in a given domain, semantic elements expressing proximity relations between terms and concepts to help the construction of domain ontologies. The technique presented here, ACT, is based on linguistic processing, machine learning, and biclustering. Results show that concepts obtained by ACT are at least as good as those from similar techniques, such as LSI and NMF. In relation to those techniques, it additionally has the advantage of allowing the supervision by a domain expert.
  • Keywords
    Computer industry; Data mining; Humans; Indexing; Large scale integration; Machine learning; Matrix decomposition; Ontologies; Single event transient; Information retrieval; Information retrieval system; Ontology; Semantic; Text Processing(Computation);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Human Language Technology (STIL), 2009 Seventh Brazilian Symposium in
  • Conference_Location
    Sao Carlos, TBD, Brazil
  • Print_ISBN
    978-1-4244-6008-3
  • Type

    conf

  • DOI
    10.1109/STIL.2009.18
  • Filename
    5532434