• DocumentCode
    1624711
  • Title

    Semi-supervised OWA aggregation for link-based similarity evaluation and alias detection

  • Author

    Boongoen, Tossapon ; Shen, Qiang

  • Author_Institution
    Dept. of Comput. Sci., Aberystwyth Univ., Aberystwyth, UK
  • fYear
    2009
  • Firstpage
    288
  • Lastpage
    293
  • Abstract
    Within the past decades, many fuzzy aggregation techniques, ordered weighted averaging (OWA) in particular, have proven effective for a wide range of information processing tasks, such as decision making, image analysis, database and machine learning. Despite reported successes, their potentials have yet to be explored for the emerging problem of link analysis, which aims to discover similarity and relations amongst objects through their associations. Recently, several link-based similarity methods have been put forward to identifying similar objects in the Internet and publication domains. However, these techniques only take into account the cardinality property of a link structure that is highly sensitive to noise and causes a great number of false positives. In light of such challenge, this paper presents a novel OWA aggregation model that is capable of efficiently deriving a similarity measure through the integration of multiple link properties. The underlying approach is based on the methodology of stress function by which the aggregation behavior can be easily interpreted and modeled. In addition, a semi-supervised method is introduced to assist a user in designing a stress function, i.e. the weighting scheme of link properties, appropriate for a particular link network. The application of the OWA aggregation approach to alias detection is demonstrated and evaluated, against state-of-art link-based techniques, over datasets specifically related to terrorism, publication and email domains.
  • Keywords
    Internet; data mining; fuzzy set theory; learning (artificial intelligence); mathematical operators; search engines; Internet; alias detection; cardinality property; data mining; database; decision making; email domain; fuzzy aggregation technique; image analysis; information processing task; link analysis; link-based similarity evaluation; machine learning; ordered weighted averaging technique; publication domain; search engine; semisupervised OWA aggregation; stress function; terrorism; Data analysis; Decision making; Image analysis; Information processing; Noise measurement; Open wireless architecture; Particle measurements; Pattern recognition; Stress; Terrorism;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on
  • Conference_Location
    Jeju Island
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-3596-8
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2009.5277168
  • Filename
    5277168