• DocumentCode
    1798351
  • Title

    Ensemble learning for keyword extraction from event descriptions

  • Author

    Geadas, Pedro ; Alves, Ana ; Ribeiro, Bernardete

  • Author_Institution
    Center of Inf. & Syst., Univ. of Coimbra, Coimbra, Portugal
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    2669
  • Lastpage
    2676
  • Abstract
    Automatic keyword extraction (AKE) from textual sources took a valuable step towards harnessing the problem of efficient scanning of large document collections. Particularly in the context of urban mobility, where the most relevant events in the city are advertised on-line, it becomes difficult to know exactly what is happening in a place. In this paper we tackle this problem by extracting a set of keywords from different kinds of textual sources, focusing on the urban events context. We propose an ensemble of automatic keyword extraction systems KEA (Keyphrase Extraction Algorithm) and KUSCO (Knowledge Unsupervised Search for instantiating Concepts on lightweight Ontologies) and Conditional Random Fields (CRF). Unlike KEA and KUSCO which are well-known tools for automatic keyword extraction, CRF needs further preprocessing. Therefore, a tool for handling AKE from the documents using CRF is developed. The architecture for the AKE ensemble system is designed and efficient integration of component applications is achieved. Finally, we empirically show that our AKE ensemble system significantly succeeds on baseline sources and urban events collections.
  • Keywords
    feature extraction; information retrieval; ontologies (artificial intelligence); random processes; text analysis; unsupervised learning; AKE ensemble system; CRF; KEA; KUSCO; automatic keyword extraction; conditional random fields; document collection; ensemble learning; event description; keyphrase extraction algorithm; knowledge unsupervised search for instantiating concepts on lightweight ontologies; textual sources; urban event collection; urban mobility; Abstracts; Data mining; Feature extraction; Focusing; Neural networks; Pragmatics; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889911
  • Filename
    6889911