Title :
Dimensionality reduction in unsupervised learning of conditional Gaussian networks
Author :
Peña, Jose Manuel ; Lozano, Jose Antonio ; Larranaga, Pedro ; Inza, Iñaki
Author_Institution :
Dept. of Comput. Sci. & Artificial Intelligenc, Univ. of the Basque Country, Spain
fDate :
6/1/2001 12:00:00 AM
Abstract :
This paper introduces a novel enhancement for unsupervised learning of conditional Gaussian networks that benefits from feature selection. Our proposal is based on the assumption that, in the absence of labels reflecting the cluster membership of each case of the database, those features that exhibit low correlation with the rest of the features can be considered irrelevant for the learning process. Thus, we suggest performing this process using only the relevant features. Then, every irrelevant feature is added to the learned model to obtain an explanatory model for the original database which is our primary goal. A simple and, thus, efficient measure to assess the relevance of the features for the learning process is presented. Additionally, the form of this measure allows us to calculate a relevance threshold to automatically identify the relevant features. The experimental results reported for synthetic and real-world databases show the ability of our proposal to distinguish between relevant and irrelevant features and to accelerate learning, while still obtaining good explanatory models for the original database
Keywords :
Gaussian distribution; correlation methods; directed graphs; reduced order systems; unsupervised learning; cluster membership; conditional Gaussian networks; dimensionality reduction; directed acyclic graphs; unsupervised learning; Acceleration; Gaussian distribution; Intelligent networks; Machine learning; Pattern recognition; Probability density function; Proposals; Spatial databases; Testing; Unsupervised learning;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on