DocumentCode
561110
Title
Data management support via spectrum perturbation-based subspace classification in collaborative environments
Author
Chen, Chao ; Shyu, Mei-Ling ; Chen, Shu-Ching
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of Miami, Coral Gables, FL, USA
fYear
2011
fDate
15-18 Oct. 2011
Firstpage
67
Lastpage
76
Abstract
Data management support to enable effective and efficient information sharing in collaborative environments is critical, especially in semantics based search and retrieval. In this paper, a novel spectrum perturbation-based subspace classification is proposed to mine semantics and other useful information from a large-scale dataset by utilizing a lower-dimensional subspace to discriminate different classes of the dataset. Among the existing subspace-based approaches, the principal component (PC) subspace is the most prevailing one and has been well studied. After investigating previous work related to PC subspace, we found that none of them had considered the perturbation on spectrum when building the subspace learning models. However, such perturbation is of certain importance and is able to provide discriminant information that helps improve classification performance by measuring the closeness of each testing data instance towards a subspace model by a closeness score based on the spectrum perturbation. Each testing data instance is assigned to its closest class by searching the smallest closeness score. Experiments are conducted to evaluate our proposed subspace classifier using data sets from three different sources, and the experimental results show that it achieves promising results and outperforms comparative subspace classifiers as well as some other commonly used classifiers.
Keywords
data mining; groupware; information retrieval; learning (artificial intelligence); pattern classification; principal component analysis; PC subspace; closeness score; collaborative environment; data management support; discriminant information; information mining; information sharing; large-scale dataset; lower dimensional subspace; principal component subspace; semantic based retrieval; semantic based search; semantic mining; spectrum perturbation based subspace classification; subspace based approach; subspace classifier; subspace learning models; testing data; Collaborative environment; Principal component (PC) subspace; classification; closeness score; spectrum perturbation;
fLanguage
English
Publisher
ieee
Conference_Titel
Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom), 2011 7th International Conference on
Conference_Location
Orlando, FL
Print_ISBN
978-1-4673-0683-6
Electronic_ISBN
978-1-936968-32-9
Type
conf
Filename
6144790
Link To Document