Title :
Audio Segment Classification Using Online Learning Based Tensor Representation Feature Discrimination
Author :
Shi, Ziqiang ; Han, Jiqing ; Zheng, Tieran ; Deng, Shiwen
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
In order to naturally combine audio information from different dimensions and build robust audio processing system, a novel framework based on low-rank tensor representation features for audio segment classification is proposed in this paper. The audio signal is first transformed into tensor format data, and then these tensor data are mapped to a low-rank space which is insensitive under certain noises, especially white Gaussian noise and gross corruptions. For these low-rank tensor based features, tensor classification via a linear classifier based on minimization a smooth loss function regularized by the trace norm proposed recently is used. Most previous methods find the weight tensor and bias in batch-mode learning, which makes them inefficient for large-scale problems. In this paper, we propose to address this problem with an online learning algorithm based on the accelerated proximal gradient (APG) method, which scales up gracefully to large data sets. Experiments on simulation and real audio data demonstrate the efficiency of the methods.
Keywords :
Gaussian noise; audio signal processing; gradient methods; learning (artificial intelligence); signal classification; tensors; APG method; accelerated proximal gradient method; audio processing system; audio segment classification; batch-mode learning; gross corruptions; large data sets; linear classifier; low-rank tensor based features; online learning algorithm; smooth loss function minimization; tensor format data; tensor representation feature discrimination; weight tensor; white Gaussian noise; Equations; Feature extraction; Noise; Robustness; Speech; Speech processing; Tensile stress; Accelerated proximal gradient (APG); audio segment classification; low-rank approximation; online learning; tensor classification; trace norm regularization;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2012.2215598