DocumentCode
1923068
Title
Learning Out-Of Sample Mapping in Non-Vectorial Data Reduction using Constrained Twin Kernel Embedding
Author
Guo, Yi ; Gao, Junbin ; Kwan, Paul W.
Author_Institution
Univ. of New England, Armidale
Volume
1
fYear
2007
fDate
19-22 Aug. 2007
Firstpage
19
Lastpage
24
Abstract
Twin kernel embedding (TKE) is a powerful non-vectorial data reduction algorithm proposed for advanced applications in clustering and visualization, manifold learning, etc. Due to the requirement of online processing in many cutting edge research problems involving highly structured data like DNA, protein sequences and biometric features that are non-vectorial in nature, learning the out-of-sample (OOS) mapping becomes a necessity. To address this, we propose constrained TKE, which is an OOS extension of TKE capable of learning such a mapping function. This is achieved by including the mapping in the objective function optimized by the TKE algorithm. More broadly, this mapping function can be applied in other data reduction methods as an OOS extension. Furthermore, to improve the accuracy of predictions in case where new samples are presented in batch, a refinement strategy is introduced by exploiting the similarity between new samples which is often ignored by other methods. Experimental results on the Reuters-21578 text collection confirmed the usefulness of the proposed method.
Keywords
data reduction; data visualisation; learning (artificial intelligence); pattern clustering; DNA; Reuters-21578 text collection; biometric features; constrained twin kernel embedding; data clustering; data visualization; manifold learning; nonvectorial data reduction; objective function; out-of sample mapping learning; protein sequences; Cybernetics; Kernel; Machine learning; Dimensionality reduction; Out-Of-Sample; TKE;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-0973-0
Electronic_ISBN
978-1-4244-0973-0
Type
conf
DOI
10.1109/ICMLC.2007.4370108
Filename
4370108
Link To Document