Title :
Deep learning vector quantization for acoustic information retrieval
Author :
Zhen Huang ; Chao Weng ; Kehuang Li ; You-Chi Cheng ; Chin-Hui Lee
Author_Institution :
Sch. of ECE, Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
We propose a novel deep learning vector quantization (DLVQ) algorithm based on deep neural networks (DNNs). Utilizing a strong representation power of this deep learning framework, with any vector quantization (VQ) method as an initializer, the proposed DLVQ technique is capable of learning a code-constrained codebook and thus improves over conventional VQ to be used in classification problems. Tested on an audio information retrieval task, the proposed DLVQ achieves a quite promising performance when it is initialized by the k-means VQ technique. A 10.5% relative gain in mean average precision (MAP) is obtained after fusing the k-means and DLVQ results together.
Keywords :
acoustic signal processing; audio signal processing; information retrieval; neural nets; vector quantisation; DLVQ algorithm; DNN; acoustic information retrieval; audio information retrieval task; code-constrained codebook; deep learning vector quantization; deep neural network; k-means VQ technique; mean average precision; Acoustics; Information retrieval; Neural networks; Speech; Training; Vector quantization; Vectors; Deep neural network; information retrieval; k-means; learning vector quantization;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853817