DocumentCode :
1798022
Title :
Traffic information extraction from a blogging platform using knowledge-based approaches and bootstrapping
Author :
Aching S, Jorge L. ; de Oliveira, Thiago B. F. ; Bazzan, Ana L. C.
Author_Institution :
Inst. de Inf., Univ. Fed. do Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
fYear :
2014
fDate :
9-12 Dec. 2014
Firstpage :
6
Lastpage :
13
Abstract :
In this paper we propose a strategy to use messages posted in a blogging platform for real-time sensing of traffic-related information. Specifically, we use the data that appear in a blog, in Portuguese language, which is managed by a Brazilian daily newspaper on its online edition. We propose a framework based on two modules to infer the location and traffic condition from unstructured, non georeferenced short post in Portuguese. The first module relates to name-entity recognition (NER). It automatically recognizes three classes of named-entities (NEs) from the input post (LOCATION, STATUS and DATE). Here, a bootstrapping approach is used to expand the initially given list of locations, identifying new locations as well as locations corresponding to spelling variants and typographical errors of the known locations. The second module relates to relation extraction (RE). It extracts binary and ternary relations between such entities to obtain relevant traffic information. In our experiments, the NER module has yielded a F-measure of 96%, while the RE module resulted in 87%. Also, results show that our bootstrapping approach identifies 1;058 new locations when 10;000 short posts are analyzed.
Keywords :
Web sites; knowledge acquisition; knowledge based systems; natural language processing; statistical analysis; traffic information systems; F-measure; Portuguese language; blogging platform; bootstrapping; knowledge-based approach; name-entity recognition; nongeoreferenced short post; real-time sensing; relation extraction; traffic information extraction; Blogs; Context; Data mining; Knowledge based systems; Training; Vectors; Bootstrapping; Named-Entity Recognition; Relation Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Vehicles and Transportation Systems (CIVTS), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
Type :
conf
DOI :
10.1109/CIVTS.2014.7009471
Filename :
7009471
Link To Document :
بازگشت