• DocumentCode
    1798022
  • Title

    Traffic information extraction from a blogging platform using knowledge-based approaches and bootstrapping

  • Author

    Aching S, Jorge L. ; de Oliveira, Thiago B. F. ; Bazzan, Ana L. C.

  • Author_Institution
    Inst. de Inf., Univ. Fed. do Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    6
  • Lastpage
    13
  • Abstract
    In this paper we propose a strategy to use messages posted in a blogging platform for real-time sensing of traffic-related information. Specifically, we use the data that appear in a blog, in Portuguese language, which is managed by a Brazilian daily newspaper on its online edition. We propose a framework based on two modules to infer the location and traffic condition from unstructured, non georeferenced short post in Portuguese. The first module relates to name-entity recognition (NER). It automatically recognizes three classes of named-entities (NEs) from the input post (LOCATION, STATUS and DATE). Here, a bootstrapping approach is used to expand the initially given list of locations, identifying new locations as well as locations corresponding to spelling variants and typographical errors of the known locations. The second module relates to relation extraction (RE). It extracts binary and ternary relations between such entities to obtain relevant traffic information. In our experiments, the NER module has yielded a F-measure of 96%, while the RE module resulted in 87%. Also, results show that our bootstrapping approach identifies 1;058 new locations when 10;000 short posts are analyzed.
  • Keywords
    Web sites; knowledge acquisition; knowledge based systems; natural language processing; statistical analysis; traffic information systems; F-measure; Portuguese language; blogging platform; bootstrapping; knowledge-based approach; name-entity recognition; nongeoreferenced short post; real-time sensing; relation extraction; traffic information extraction; Blogs; Context; Data mining; Knowledge based systems; Training; Vectors; Bootstrapping; Named-Entity Recognition; Relation Extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Vehicles and Transportation Systems (CIVTS), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/CIVTS.2014.7009471
  • Filename
    7009471