DocumentCode
1798022
Title
Traffic information extraction from a blogging platform using knowledge-based approaches and bootstrapping
Author
Aching S, Jorge L. ; de Oliveira, Thiago B. F. ; Bazzan, Ana L. C.
Author_Institution
Inst. de Inf., Univ. Fed. do Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
fYear
2014
fDate
9-12 Dec. 2014
Firstpage
6
Lastpage
13
Abstract
In this paper we propose a strategy to use messages posted in a blogging platform for real-time sensing of traffic-related information. Specifically, we use the data that appear in a blog, in Portuguese language, which is managed by a Brazilian daily newspaper on its online edition. We propose a framework based on two modules to infer the location and traffic condition from unstructured, non georeferenced short post in Portuguese. The first module relates to name-entity recognition (NER). It automatically recognizes three classes of named-entities (NEs) from the input post (LOCATION, STATUS and DATE). Here, a bootstrapping approach is used to expand the initially given list of locations, identifying new locations as well as locations corresponding to spelling variants and typographical errors of the known locations. The second module relates to relation extraction (RE). It extracts binary and ternary relations between such entities to obtain relevant traffic information. In our experiments, the NER module has yielded a F-measure of 96%, while the RE module resulted in 87%. Also, results show that our bootstrapping approach identifies 1;058 new locations when 10;000 short posts are analyzed.
Keywords
Web sites; knowledge acquisition; knowledge based systems; natural language processing; statistical analysis; traffic information systems; F-measure; Portuguese language; blogging platform; bootstrapping; knowledge-based approach; name-entity recognition; nongeoreferenced short post; real-time sensing; relation extraction; traffic information extraction; Blogs; Context; Data mining; Knowledge based systems; Training; Vectors; Bootstrapping; Named-Entity Recognition; Relation Extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Vehicles and Transportation Systems (CIVTS), 2014 IEEE Symposium on
Conference_Location
Orlando, FL
Type
conf
DOI
10.1109/CIVTS.2014.7009471
Filename
7009471
Link To Document