DocumentCode :
1955725
Title :
Automatic keyword extraction with relational clustering and Levenshtein distances
Author :
Runkler, Thomas A. ; Bezdek, James C.
Author_Institution :
Corp. Technol., Siemens AG, Munich, Germany
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
636
Abstract :
Alternating cluster estimation (ACE) is a generalized clustering model. Relational ACE is a modification of ACE that can be used to cluster data which do not possess a clear numerical representation, but for which a meaningful relation matrix can be defined. For text data sets we define (pairwise) relation matrices based on the Levenshtein string distance (1966). Relational ACE with Levenshtein distances is applied to four different texts. The cluster centers represent typical words in the texts, so this algorithm can be used to automatically determine keywords
Keywords :
matrix algebra; pattern clustering; text analysis; ACE; Levenshtein distances; Levenshtein string distance; alternating cluster estimation; automatic keyword extraction; generalized clustering model; numerical representation; pairwise relation matrices; relation matrix; relational clustering; Clustering algorithms; Communications technology; Computer science; Data analysis; Data mining; Image sensors; Image sequence analysis; Optimization methods; Pixel; Prototypes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems, 2000. FUZZ IEEE 2000. The Ninth IEEE International Conference on
Conference_Location :
San Antonio, TX
ISSN :
1098-7584
Print_ISBN :
0-7803-5877-5
Type :
conf
DOI :
10.1109/FUZZY.2000.839067
Filename :
839067
Link To Document :
بازگشت