• DocumentCode
    561110
  • Title

    Data management support via spectrum perturbation-based subspace classification in collaborative environments

  • Author

    Chen, Chao ; Shyu, Mei-Ling ; Chen, Shu-Ching

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Miami, Coral Gables, FL, USA
  • fYear
    2011
  • fDate
    15-18 Oct. 2011
  • Firstpage
    67
  • Lastpage
    76
  • Abstract
    Data management support to enable effective and efficient information sharing in collaborative environments is critical, especially in semantics based search and retrieval. In this paper, a novel spectrum perturbation-based subspace classification is proposed to mine semantics and other useful information from a large-scale dataset by utilizing a lower-dimensional subspace to discriminate different classes of the dataset. Among the existing subspace-based approaches, the principal component (PC) subspace is the most prevailing one and has been well studied. After investigating previous work related to PC subspace, we found that none of them had considered the perturbation on spectrum when building the subspace learning models. However, such perturbation is of certain importance and is able to provide discriminant information that helps improve classification performance by measuring the closeness of each testing data instance towards a subspace model by a closeness score based on the spectrum perturbation. Each testing data instance is assigned to its closest class by searching the smallest closeness score. Experiments are conducted to evaluate our proposed subspace classifier using data sets from three different sources, and the experimental results show that it achieves promising results and outperforms comparative subspace classifiers as well as some other commonly used classifiers.
  • Keywords
    data mining; groupware; information retrieval; learning (artificial intelligence); pattern classification; principal component analysis; PC subspace; closeness score; collaborative environment; data management support; discriminant information; information mining; information sharing; large-scale dataset; lower dimensional subspace; principal component subspace; semantic based retrieval; semantic based search; semantic mining; spectrum perturbation based subspace classification; subspace based approach; subspace classifier; subspace learning models; testing data; Collaborative environment; Principal component (PC) subspace; classification; closeness score; spectrum perturbation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2011 7th International Conference on
  • Conference_Location
    Orlando, FL
  • Print_ISBN
    978-1-4673-0683-6
  • Electronic_ISBN
    978-1-936968-32-9
  • Type

    conf

  • Filename
    6144790