DocumentCode :
3228705
Title :
TM-Gen: A Topic Map Generator from Text Documents
Author :
Garrido, Angel L. ; Buey, Maria G. ; Escudero, Sandra ; Ilarri, Sergio ; Mena, Eduardo ; Silveira, Sara B.
Author_Institution :
IIS Dept., Univ. of Zaragoza, Zaragoza, Spain
fYear :
2013
fDate :
4-6 Nov. 2013
Firstpage :
735
Lastpage :
740
Abstract :
The vast amount of text documents stored in digital format is growing at a frantic rhythm each day. Therefore, tools able to find accurate information searching in natural language information repositories are gaining great interest in recent years. In this context, there are especially interesting tools capable of dealing with large amounts of text information and deriving human-readable summaries. However, one step further is to be able not only to summarize, but to extract the knowledge stored in those texts, and even represent it graphically. In this paper we present an architecture to generate automatically a conceptual representation of knowledge stored in a set of text-based documents. For this purpose we have used the topic maps standard and we have developed a method that combines text mining, statistics, linguistic tools, and semantics to obtain a graphical representation of the information contained therein, which can be coded using a knowledge representation language such as RDF or OWL. The procedure is language-independent, fully automatic, self-adjusting, and it does not need manual configuration by the user. Although the validation of a graphic knowledge representation system is very subjective, we have been able to take advantage of an intermediate product of the process to make a experimental validation of our proposals.
Keywords :
data mining; knowledge representation languages; natural language processing; text analysis; OWL; RDF; TM-Gen; digital format; frantic rhythm; graphic knowledge representation system; graphical representation; human-readable summaries; information searching; knowledge representation language; linguistic tools; natural language information repositories; semantics; statistics; text information; text mining; text-based documents; topic map generator; topic maps standard; Context; Data mining; Ontologies; Proposals; Redundancy; Semantics; Syntactics; Knowledge acquisition; Linguistics; Ontologies; Semantics; Text mining; Topic maps;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2013 IEEE 25th International Conference on
Conference_Location :
Herndon, VA
ISSN :
1082-3409
Print_ISBN :
978-1-4799-2971-9
Type :
conf
DOI :
10.1109/ICTAI.2013.113
Filename :
6735324
Link To Document :
بازگشت