• DocumentCode
    3724347
  • Title

    Vector Similarity of Related Words in the Japanese Word Net

  • Author

    Takuya Hirao;Nao Wariishi;Takahiko Suzuki;Sachio Hirokawa

  • Author_Institution
    Grad. Sch. of Inf. Sci. &
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    142
  • Lastpage
    147
  • Abstract
    Word2vec is a tool that produces vector representation of words from a large amount of text data. In this paper, we show that only a part of the vector space produced by word2vec is enough to represent the collective sense of a set of related words in the Japanese WordNet. Further, we will show that there is a subspace in the vector space which do not relate to the collective sense. We construct a compact decision tree by using the vectors in order to distinguish whether a given word belongs to the set of related words.
  • Keywords
    "Decision trees","Encyclopedias","Electronic publishing","Internet","Dairy products","Information science"
  • Publisher
    ieee
  • Conference_Titel
    Advanced Applied Informatics (IIAI-AAI), 2015 IIAI 4th International Congress on
  • Print_ISBN
    978-1-4799-9957-6
  • Type

    conf

  • DOI
    10.1109/IIAI-AAI.2015.254
  • Filename
    7373891