• DocumentCode
    1034461
  • Title

    Combining information extraction with genetic algorithms for text mining

  • Author

    Atkinson-Abutridy, John ; Mellish, Chris ; Aitken, Stuart

  • Author_Institution
    Edinburgh Univ., UK
  • Volume
    19
  • Issue
    3
  • fYear
    2004
  • Firstpage
    22
  • Lastpage
    30
  • Abstract
    An evolutionary approach that combines information extraction technology and genetic algorithms can produce a new, integrated model for text mining. Text mining discovers unseen patterns in textual databases. We´ve brought together the benefits of GAs for data mining and IE technology to propose a new approach for high-level knowledge discovery. Unlike previous KDT approaches, our model doesn´t rely on external resources or conceptual descriptions. Instead, it performs the discovery using only information from the original corpus of text documents and from training data computed from them. The GA that produces the hypotheses is strongly guided by semantic constraints, which means that several specifically defined metrics evaluate the quality and plausibility.
  • Keywords
    data mining; genetic algorithms; KDT; data mining; genetic algorithms; information extraction; knowledge discovery; text mining; textual database; Data mining; Databases; Genetic algorithms; Information analysis; Natural languages; Pattern analysis; Performance analysis; Robustness; Text mining; Training data; genetic algorithms; knowledge discovery from texts; multiobjective optimization; semantic analysis; text mining;
  • fLanguage
    English
  • Journal_Title
    Intelligent Systems, IEEE
  • Publisher
    ieee
  • ISSN
    1541-1672
  • Type

    jour

  • DOI
    10.1109/MIS.2004.4
  • Filename
    1315537