DocumentCode
178402
Title
Contrastive auto-encoder for phoneme recognition
Author
Xin Zheng ; Zhiyong Wu ; Meng, Hsiang-Yun ; Lianhong Cai
Author_Institution
Tsinghua-CUHK Joint Res. Center for Media Sci., Tsinghua Univ., Shenzhen, China
fYear
2014
fDate
4-9 May 2014
Firstpage
2529
Lastpage
2533
Abstract
Speech data typically contains task irrelevant information lying within features. Specifically, phonetic information, speaker characteristic information, emotional information and noise are always mixed together and tend to impair one another for certain task. We propose a new type of auto-encoder for feature learning called contrastive auto-encoder. Unlike other variants of auto-encoders, contrastive auto-encoder is able to leverage class labels in constructing its representation layer. We achieve this by modeling two autoencoders together and making their differences contribute to the total loss function. The transformation built with contrastive auto-encoder can be seen as a task-specific and invariant feature learner. Our experiments on TIMIT clearly show the superiority of the feature extracted from contrastive auto-encoder over original acoustic feature, feature extracted from deep auto-encoder, and feature extracted from a model that contrastive auto-encoder originates from.
Keywords
feature extraction; speech processing; speech recognition; acoustic feature; contrastive auto-encoder; emotional information; feature extraction; invariant feature learner; phoneme recognition; phonetic information; representation layer; speaker characteristic information; speech data; task irrelevant information; task-specific learner; Acoustics; DNA; Data models; Feature extraction; Hidden Markov models; Neurons; Training; auto-encoder; contrastive auto-encoder; deep neural network (DNN); phoneme recognition; restricted Boltzmann machine (RBM);
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854056
Filename
6854056
Link To Document