Title :
Constructing Field Association words using declinable words and concurrent words
Author :
Atlam, El-Sayed ; Morita, Kazuhiro ; Fuketa, Masoa ; Aoe, Jun-Ichi
Author_Institution :
Dept. of Inf. Sci. & Intell. Syst., Univ. of Tokushima, Tokushima
Abstract :
Readers can know the subject of many document fields by reading only some specific field association (FA) words. Document fields can be decided efficiently if there are many rank 1 FA words (words that direct connect to terminal fields) and if the frequency rate is high. This paper proposes a new method for increasing rank 1 FA words using declinable words and concurrent words which relate to narrow association categories and eliminate FA word ambiguity. Concurrent words become concurrent field association words (CFA words) if there is a little field overlap. Usually, efficient CFA words are difficult to extract using only frequency, so this paper proposes weighting according to degree of importance of concurrent words. The new weighting method causes Precision and Recall to be higher by 40% and 30% than by using frequency alone.
Keywords :
information retrieval; word processing; concurrent field association word; declinable word; document field association word ranking; field association word ambiguity; information retrieval; Data mining; Dictionaries; Frequency; Humans; Information retrieval; Information science; Intelligent systems; Nominations and elections; Shape; Tree data structures;
Conference_Titel :
Innovations in Information Technology, 2008. IIT 2008. International Conference on
Conference_Location :
Al Ain
Print_ISBN :
978-1-4244-3396-4
Electronic_ISBN :
978-1-4244-3397-1
DOI :
10.1109/INNOVATIONS.2008.4781658