• DocumentCode
    3489482
  • Title

    Word Sense Induction Using Correlated Topic Model

  • Author

    Hoang, T.T. ; Nguyen, P.T.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Eng. & Technol., Hanoi, Vietnam
  • fYear
    2012
  • fDate
    13-15 Nov. 2012
  • Firstpage
    41
  • Lastpage
    44
  • Abstract
    Word sense induction (WSI) is the problem of automatic identification of word senses given the corpus. This paper presents a method for solving WSI problem based on the context clustering approach. The idea behind this approach is that similar contexts indicate similar meanings. Specifically, we have successfully applied Correlated Topic Model (CTM) to partition contexts of a word into clusters, each representing a sense of that word. Different from some previous systems where a single model is built for all words, in our system, each word has its own model. Experimental result on the SemEval-2010 dataset shows that CTM is a strong tool for modelling the word´s contexts. Our system has significantly better performance than all systems participated in the SemEval-2010 workshop. In comparison to the use of other topic models for WSI, our system can explore additional useful information which is the relationship between senses of a word. The prospect of using CTM for discovering the correlation between senses of multiple words is also discussed at the end of this paper.
  • Keywords
    natural language processing; pattern clustering; CTM; WSI; context clustering approach; correlated topic model; word sense automatic identification; word sense induction; Buildings; Conferences; Context; Context modeling; Correlation; System performance; Testing; context clustering; correlated topic model; word sense induction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2012 International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4673-6113-2
  • Electronic_ISBN
    978-0-7695-4886-9
  • Type

    conf

  • DOI
    10.1109/IALP.2012.73
  • Filename
    6473691