Title :
Efficient Sparse Modeling With Automatic Feature Grouping
Author :
Zhong, L.W. ; Kwok, J.T.
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Kowloon, China
Abstract :
For high-dimensional data, it is often desirable to group similar features together during the learning process. This can reduce the estimation variance and improve the stability of feature selection, leading to better generalization. Moreover, it can also help in understanding and interpreting data. Octagonal shrinkage and clustering algorithm for regression (OSCAR) is a recent sparse-modeling approach that uses a l1 -regularizer and a pairwise l∞-regularizer on the feature coefficients to encourage such feature grouping. However, computationally, its optimization procedure is very expensive. In this paper, we propose an efficient solver based on the accelerated gradient method. We show that its key proximal step can be solved by a highly efficient simple iterative group merging algorithm. Given d input features, this reduces the empirical time complexity from O(d2 ~ d5) for the existing solvers to just O(d). Experimental results on a number of toy and real-world datasets demonstrate that OSCAR is a competitive sparse-modeling approach, but with the added ability of automatic feature grouping.
Keywords :
computational complexity; computational geometry; data handling; gradient methods; learning (artificial intelligence); pattern clustering; regression analysis; OSCAR; accelerated gradient method; automatic feature grouping; data interpretation; data understanding; empirical time complexity reduction; estimation variance reduction; feature coefficients; feature selection stability improvement; high-dimensional data; iterative group merging algorithm; l1 -regularizer; learning process; octagonal shrinkage-and-clustering algorithm-for-regression; optimization procedure; pairwise l∞-regularizer; sparse modeling; sparse-modeling approach; Acceleration; Algorithm design and analysis; Complexity theory; Computational modeling; Convergence; Gradient methods; Accelerated gradient descent; feature grouping; sparse modeling; structured sparsity;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2012.2200262