• DocumentCode
    2536194
  • Title

    Discovering Informative Syntactic Relationships between Named Entities in Biomedical Literature

  • Author

    Appice, Annalisa ; Ceci, Michelangelo ; Loglisci, Corrado

  • Author_Institution
    Dipt. di Inf., Univ. degli Studi di Bari, Bari, Italy
  • fYear
    2010
  • fDate
    11-16 April 2010
  • Firstpage
    120
  • Lastpage
    125
  • Abstract
    The discovery of new and potentially meaningful relationships between named entities in biomedical literature can take great advantage from the application of multirelational data mining approaches in text mining. This is motivated by the peculiarity of multi-relational data mining to be able to express and manipulate relationships between entities. We investigate the application of such an approach to address the task of identifying informative syntactic structures, which are frequent in biomedical abstract corpora. Initially, named entities are annotated in text corpora according to some biomedical dictionary (e.g. MeSH taxonomy). Tagged entities are then integrated in syntactic structures with the role of subject and/or object of the corresponding verb. These structures are represented in a first-order language. Multi-relational approach to frequent pattern discovery allows to identify the verb-based relationships between the named entities which frequently occur in the corpora. Preliminary experiments with a collection of abstracts obtained by querying Medline on a specific disease are reported.
  • Keywords
    data mining; dictionaries; medical administrative data processing; medical computing; text analysis; MeSH taxonomy; Medline querying; biomedical abstract corpora; biomedical dictionary; biomedical literature; frequent pattern discovery; informative syntactic relationships; multirelational data mining; named entities; text corpora; text mining; verb-based relationships; Abstracts; Data mining; Databases; Dictionaries; Diseases; Proteins; Search methods; Speech; Taxonomy; Text mining; Biomedical Literature; Frequent Pattern Discovery; Multi-Relational Data Mining; Syntactic Text Structure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Databases Knowledge and Data Applications (DBKDA), 2010 Second International Conference on
  • Conference_Location
    Menuires
  • Print_ISBN
    978-1-4244-6081-6
  • Type

    conf

  • DOI
    10.1109/DBKDA.2010.14
  • Filename
    5477135