DocumentCode :
2865151
Title :
A random walk through human associations
Author :
Tamir, Raz
Author_Institution :
The Hebrew Univ. of Jerusalem, Israel
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
Letting one\´s thoughts wander is not simply an arbitrary or rambling process. It can better be described as "associative thinking", where a complex chain of associative thoughts and ideas are linked. It is our contention that this seemingly chaotic process can be modeled by a random walk in a weighted directed graph. Furthermore, is it possible to predict mathematically the "steady state" of such a process, to determine where such wandering is leading. The random walk process uses rules of association, defined by the Local Confidence Gain (LCG) interestingness measure. Extracted concepts are used as nodes of a directed graph. The associative "forces" between any two concepts (measured by LCG) are used to weigh the edges connecting the nodes that create a graph of associations. It is common, yet not trivial, for people to look for data about a subject without knowing its exact nomenclature (for example, finding the name of a disease just by knowing its symptoms). Random walk in association graphs can discover highly informative phrases that can be used for query expansion in a way that better expresses the user\´s initial search goals. A different usage is to create a user profile representing his current interests. We used a modified version of the Turing Test to show that the random walk process discovers association rules that conform to a human associations generating process. By constructing the user associations we were able to build a profile representing the user\´s "line of thoughts". The suggested algorithm can be used in any database and can implement the ranking measures of other association rules.
Keywords :
data mining; directed graphs; association graph; association rule discovery; associative thinking; human associations; interestingness measure; local confidence gain; random walk; user associations; user profile; weighted directed graph; Association rules; Chaos; Data mining; Diseases; Force measurement; Gain measurement; Humans; Joining processes; Steady-state; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.12
Filename :
1565710
Link To Document :
بازگشت