DocumentCode :
169969
Title :
Exploratory Information Extraction from a Historical Dictionary
Author :
De Paiva, Valeria ; Borges Oliveira, Dario Augusto ; Higuchi, Suemi ; Rademaker, Alexandre ; De Melo, Gerard
Author_Institution :
Nuance Commun., USA
Volume :
2
fYear :
2014
fDate :
20-24 Oct. 2014
Firstpage :
11
Lastpage :
18
Abstract :
We describe a preliminary project of extracting information from an extant dictionary of historical biographies, the "Dicionário Histórico-Biográfico Brasileiro" (the Brazilian Historical and Biographical Dictionary, shortened as DHBB), a longstanding project at the \´Centro de Pesquisa e Documentação de História Contemporânea do Brasil\´ (CPDOC) of the Fundação Getulio Vargas (FGV). For information extraction, we rely on Natural Language Processing tools such as FreeLing as well as our resources NomLex-PT, a lexicon of nominalizations, and OpenWN-PT, a Portuguese version of Princeton\´s WordNet database. While our project currently highlights the potential of information extraction in a fun exploratory manner, we also discuss the engaging of historians interested in the affordances of digital tools.
Keywords :
biographies; dictionaries; history; natural language processing; Brazilian Historical and Biographical Dictionary; DHBB; Dicionário Histórico-Biográfico Brasileiro; FreeLing; NomLex-PT; OpenWN-PT; Princeton WordNet database; digital tools; exploratory information extraction; historical biographies; historical dictionary; natural language processing tools; Biographies; Cities and towns; Dictionaries; History; Organizations; Semantics; Biographical Dictionary; information extraction; nlp; nominalization; nomlex; wordnet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
e-Science (e-Science), 2014 IEEE 10th International Conference on
Conference_Location :
Sao Paulo
Print_ISBN :
978-1-4799-4288-6
Type :
conf
DOI :
10.1109/eScience.2014.50
Filename :
6972090
Link To Document :
بازگشت