DocumentCode
3739238
Title
Paradigmatic Clustering for NLP
Author
Julio Santisteban; Tejada-C?rcamo
Author_Institution
Univ. Catolica San Pablo, Arequipa, Peru
fYear
2015
Firstpage
814
Lastpage
820
Abstract
How can we retrieve meaningful information from a large and sparse graph?. Traditional approaches focus on generic clustering techniques and discovering dense cumulus in a network graph, however, they tend to omit interesting patterns such as the paradigmatic relations. In this paper, we propose a novel graph clustering technique modelling the relations of a node using the paradigmatic analysis. We exploit node´s relations to extract its existing sets of signifiers. The newly found clusters represent a different view of a graph, which provides interesting insights into the structure of a sparse network graph. Our proposed algorithm PaC (Paradigmatic Clustering) for clustering graphs uses paradigmatic analysis supported by a asymmetric similarity, in contrast to traditional graph clustering methods, our algorithm yields worthy results in tasks of word-sense disambiguation. In addition we propose a novel paradigmatic similarity measure. Extensive experiments and empirical analysis are used to evaluate our algorithm on synthetic and real data.
Keywords
"Clustering algorithms","Algorithm design and analysis","Bipartite graph","Dolphins","Mutual information","Benchmark testing","Conferences"
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
Electronic_ISBN
2375-9259
Type
conf
DOI
10.1109/ICDMW.2015.233
Filename
7395752
Link To Document