Title :
The extraction method of the word meaning class
Author :
Tsuda, Kazuhiko ; Nakamura, Masami
Author_Institution :
Graduate Sch. of Syst. Manage., Tsukuba Univ., Tokyo, Japan
Abstract :
In natural language processing, the semantic class information about a word is an important piece of knowledge. For a thesaurus dictionary, which shows the semantic information between words, the editing work is normally carried out manually, which means that a great number of man-hours is necessary for the editing work. This paper proposes a method of extracting the semantic class information on a word from a set of documents. This information is extracted by using the characteristic that the frequency of abstract words is high while the frequency of concrete words is small. As a result of this experiment, it was confirmed that about 20% of the extracted words should be registered in the thesaurus dictionary
Keywords :
dictionaries; natural languages; thesauri; abstract word frequency; concrete word frequency; document set; natural language processing; semantic class information; thesaurus dictionary; word meaning class extraction method; Concrete; Data mining; Dictionaries; Frequency; Intelligent structures; Intelligent systems; Knowledge management; Natural language processing; Natural languages; Thesauri;
Conference_Titel :
Knowledge-Based Intelligent Information Engineering Systems, 1999. Third International Conference
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-5578-4
DOI :
10.1109/KES.1999.820241