DocumentCode
2716943
Title
Constructing Field Association words using declinable words and concurrent words
Author
Atlam, El-Sayed ; Morita, Kazuhiro ; Fuketa, Masoa ; Aoe, Jun-Ichi
Author_Institution
Dept. of Inf. Sci. & Intell. Syst., Univ. of Tokushima, Tokushima
fYear
2008
fDate
16-18 Dec. 2008
Firstpage
322
Lastpage
326
Abstract
Readers can know the subject of many document fields by reading only some specific field association (FA) words. Document fields can be decided efficiently if there are many rank 1 FA words (words that direct connect to terminal fields) and if the frequency rate is high. This paper proposes a new method for increasing rank 1 FA words using declinable words and concurrent words which relate to narrow association categories and eliminate FA word ambiguity. Concurrent words become concurrent field association words (CFA words) if there is a little field overlap. Usually, efficient CFA words are difficult to extract using only frequency, so this paper proposes weighting according to degree of importance of concurrent words. The new weighting method causes Precision and Recall to be higher by 40% and 30% than by using frequency alone.
Keywords
information retrieval; word processing; concurrent field association word; declinable word; document field association word ranking; field association word ambiguity; information retrieval; Data mining; Dictionaries; Frequency; Humans; Information retrieval; Information science; Intelligent systems; Nominations and elections; Shape; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Information Technology, 2008. IIT 2008. International Conference on
Conference_Location
Al Ain
Print_ISBN
978-1-4244-3396-4
Electronic_ISBN
978-1-4244-3397-1
Type
conf
DOI
10.1109/INNOVATIONS.2008.4781658
Filename
4781658
Link To Document