DocumentCode :
538555
Title :
Obtaining term similarities on concept extraction study
Author :
Balkan, Kerime ; Takçi, Hidayet
Author_Institution :
Gebze Yuksek Teknoloji Enstitusu, Gebze, Turkey
fYear :
2010
fDate :
2-5 Dec. 2010
Firstpage :
578
Lastpage :
582
Abstract :
Concept extraction work, promises to improve the performance of the term-based text mining which has high complexity. The first phase of the concept extraction is to detect the terms have notable frequency to represent the documents. With grouping these terms an important function will be implemented on the way conception. Transition from terms to concepts; by clustering the terms according to similarities between terms, and then by labeling these clusters with an expert. The parameters of clustering algorithm and the quality of the data set will affect the success of this process. In this study, the three methods for term similarity are examined and the the most successful one is tried to find. Study is performed on Turkish language.
Keywords :
data mining; natural language processing; pattern clustering; text analysis; Turkish language; clustering algorithm; concept extraction study; term-based text mining; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Digital signal processing; Information retrieval; Semantics; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical, Electronics and Computer Engineering (ELECO), 2010 National Conference on
Conference_Location :
Bursa
Print_ISBN :
978-1-4244-9588-7
Electronic_ISBN :
978-605-01-0013-6
Type :
conf
Filename :
5698108
Link To Document :
بازگشت