DocumentCode
3724347
Title
Vector Similarity of Related Words in the Japanese Word Net
Author
Takuya Hirao;Nao Wariishi;Takahiko Suzuki;Sachio Hirokawa
Author_Institution
Grad. Sch. of Inf. Sci. &
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
142
Lastpage
147
Abstract
Word2vec is a tool that produces vector representation of words from a large amount of text data. In this paper, we show that only a part of the vector space produced by word2vec is enough to represent the collective sense of a set of related words in the Japanese WordNet. Further, we will show that there is a subspace in the vector space which do not relate to the collective sense. We construct a compact decision tree by using the vectors in order to distinguish whether a given word belongs to the set of related words.
Keywords
"Decision trees","Encyclopedias","Electronic publishing","Internet","Dairy products","Information science"
Publisher
ieee
Conference_Titel
Advanced Applied Informatics (IIAI-AAI), 2015 IIAI 4th International Congress on
Print_ISBN
978-1-4799-9957-6
Type
conf
DOI
10.1109/IIAI-AAI.2015.254
Filename
7373891
Link To Document