• DocumentCode
    3739238
  • Title

    Paradigmatic Clustering for NLP

  • Author

    Julio Santisteban; Tejada-C?rcamo

  • Author_Institution
    Univ. Catolica San Pablo, Arequipa, Peru
  • fYear
    2015
  • Firstpage
    814
  • Lastpage
    820
  • Abstract
    How can we retrieve meaningful information from a large and sparse graph?. Traditional approaches focus on generic clustering techniques and discovering dense cumulus in a network graph, however, they tend to omit interesting patterns such as the paradigmatic relations. In this paper, we propose a novel graph clustering technique modelling the relations of a node using the paradigmatic analysis. We exploit node´s relations to extract its existing sets of signifiers. The newly found clusters represent a different view of a graph, which provides interesting insights into the structure of a sparse network graph. Our proposed algorithm PaC (Paradigmatic Clustering) for clustering graphs uses paradigmatic analysis supported by a asymmetric similarity, in contrast to traditional graph clustering methods, our algorithm yields worthy results in tasks of word-sense disambiguation. In addition we propose a novel paradigmatic similarity measure. Extensive experiments and empirical analysis are used to evaluate our algorithm on synthetic and real data.
  • Keywords
    "Clustering algorithms","Algorithm design and analysis","Bipartite graph","Dolphins","Mutual information","Benchmark testing","Conferences"
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
  • Electronic_ISBN
    2375-9259
  • Type

    conf

  • DOI
    10.1109/ICDMW.2015.233
  • Filename
    7395752