Title :
Data management support via spectrum perturbation-based subspace classification in collaborative environments
Author :
Chen, Chao ; Shyu, Mei-Ling ; Chen, Shu-Ching
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Miami, Coral Gables, FL, USA
Abstract :
Data management support to enable effective and efficient information sharing in collaborative environments is critical, especially in semantics based search and retrieval. In this paper, a novel spectrum perturbation-based subspace classification is proposed to mine semantics and other useful information from a large-scale dataset by utilizing a lower-dimensional subspace to discriminate different classes of the dataset. Among the existing subspace-based approaches, the principal component (PC) subspace is the most prevailing one and has been well studied. After investigating previous work related to PC subspace, we found that none of them had considered the perturbation on spectrum when building the subspace learning models. However, such perturbation is of certain importance and is able to provide discriminant information that helps improve classification performance by measuring the closeness of each testing data instance towards a subspace model by a closeness score based on the spectrum perturbation. Each testing data instance is assigned to its closest class by searching the smallest closeness score. Experiments are conducted to evaluate our proposed subspace classifier using data sets from three different sources, and the experimental results show that it achieves promising results and outperforms comparative subspace classifiers as well as some other commonly used classifiers.
Keywords :
data mining; groupware; information retrieval; learning (artificial intelligence); pattern classification; principal component analysis; PC subspace; closeness score; collaborative environment; data management support; discriminant information; information mining; information sharing; large-scale dataset; lower dimensional subspace; principal component subspace; semantic based retrieval; semantic based search; semantic mining; spectrum perturbation based subspace classification; subspace based approach; subspace classifier; subspace learning models; testing data; Collaborative environment; Principal component (PC) subspace; classification; closeness score; spectrum perturbation;
Conference_Titel :
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2011 7th International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4673-0683-6
Electronic_ISBN :
978-1-936968-32-9