DocumentCode :
3124834
Title :
A feature-transform based approach to unsupervised task adaptation and personalization
Author :
Jian Xu ; Zhi-Jie Yan ; Qiang Huo
Author_Institution :
Dept. of Autom., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
229
Lastpage :
232
Abstract :
This paper presents a feature-transform based approach to unsupervised task adaptation and personalization for speech recognition. Given task-specific speech data collected from a deployed service, an “acoustic sniffing” module is built first by using a so-called i-vector technique with a number of acoustic conditions identified via i-vector clustering. Unsupervised maximum likelihood training is then performed to estimate a task-dependent feature transform for each acoustic condition, while pre-trained HMM parameters of acoustic models are kept unchanged. Given an unknown utterance, an appropriate feature transform is selected via “acoustic sniffing”, which is used to transform the feature vectors of the unknown utterance for decoding. The effectiveness of the proposed method is confirmed in a task adaptation scenario from a conversational telephone speech transcription task to a short message dictation task. The same method is expected to work for personalization as well.
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; unsupervised learning; HMM parameters; acoustic sniffing module; feature-transform based approach; i-vector clustering; i-vector technique; personalization; short message dictation task; speech recognition; task-dependent feature transform; unsupervised maximum likelihood training; unsupervised task adaptation; Abstracts; Estimation; Hidden Markov models; Indexes; Switches; Training; Transforms; acoustic sniffing; i-vector; personalization; unsupervised task adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423513
Filename :
6423513
Link To Document :
بازگشت