Title :
Semi-supervised OWA aggregation for link-based similarity evaluation and alias detection
Author :
Boongoen, Tossapon ; Shen, Qiang
Author_Institution :
Dept. of Comput. Sci., Aberystwyth Univ., Aberystwyth, UK
Abstract :
Within the past decades, many fuzzy aggregation techniques, ordered weighted averaging (OWA) in particular, have proven effective for a wide range of information processing tasks, such as decision making, image analysis, database and machine learning. Despite reported successes, their potentials have yet to be explored for the emerging problem of link analysis, which aims to discover similarity and relations amongst objects through their associations. Recently, several link-based similarity methods have been put forward to identifying similar objects in the Internet and publication domains. However, these techniques only take into account the cardinality property of a link structure that is highly sensitive to noise and causes a great number of false positives. In light of such challenge, this paper presents a novel OWA aggregation model that is capable of efficiently deriving a similarity measure through the integration of multiple link properties. The underlying approach is based on the methodology of stress function by which the aggregation behavior can be easily interpreted and modeled. In addition, a semi-supervised method is introduced to assist a user in designing a stress function, i.e. the weighting scheme of link properties, appropriate for a particular link network. The application of the OWA aggregation approach to alias detection is demonstrated and evaluated, against state-of-art link-based techniques, over datasets specifically related to terrorism, publication and email domains.
Keywords :
Internet; data mining; fuzzy set theory; learning (artificial intelligence); mathematical operators; search engines; Internet; alias detection; cardinality property; data mining; database; decision making; email domain; fuzzy aggregation technique; image analysis; information processing task; link analysis; link-based similarity evaluation; machine learning; ordered weighted averaging technique; publication domain; search engine; semisupervised OWA aggregation; stress function; terrorism; Data analysis; Decision making; Image analysis; Information processing; Noise measurement; Open wireless architecture; Particle measurements; Pattern recognition; Stress; Terrorism;
Conference_Titel :
Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on
Conference_Location :
Jeju Island
Print_ISBN :
978-1-4244-3596-8
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2009.5277168