DocumentCode :
939233
Title :
On FastMap and the convex hull of multivariate data: toward fast and robust dimension reduction
Author :
Ostrouchov, George ; Samatova, Nagiza F.
Author_Institution :
Div. of Comput. Sci. & Math., Oak Ridge Nat. Lab., TN, USA
Volume :
27
Issue :
8
fYear :
2005
Firstpage :
1340
Lastpage :
1343
Abstract :
FastMap is a dimension reduction technique that operates on distances between objects. Although only distances are used, implicitly the technique assumes that the objects are points in a p-dimensional Euclidean space. It selects a sequence of k ≤ p orthogonal axes defined by distant pairs of points (called pivots) and computes the projection of the points onto the orthogonal axes. We show that FastMap uses only the outer envelope of a data set. Pivots are taken from the faces, usually vertices, of the convex hull of the data points in the original implicit Euclidean space. This provides a bridge to results in robust statistics, where the convex hull is used as a tool in multivariate outlier detection and in robust estimation methods. The connection sheds new light on the properties of FastMap, particularly its sensitivity to outliers, and provides an opportunity for a new class of dimension reduction algorithms, RobustMaps, that retain the speed of FastMap and exploit ideas in robust statistics.
Keywords :
computational geometry; principal component analysis; Euclidean space; FastMap; convex hull; dimension reduction algorithms; dimension reduction technique; multivariate data; multivariate outlier detection; principal component analysis; Bridges; Data visualization; Euclidean distance; Extraterrestrial measurements; Face detection; Multidimensional systems; Principal component analysis; Robustness; Statistical distributions; Statistics; Euclidean distance.; FastMap; Index Terms- Dimension reduction; RobustMap; convex hull; multidimensional scaling; principal components; robust statistics; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Models, Statistical; Multivariate Analysis; Pattern Recognition, Automated; Signal Processing, Computer-Assisted; Subtraction Technique;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2005.164
Filename :
1453521
Link To Document :
بازگشت