DocumentCode
381265
Title
Acoustic factorisation
Author
Gales, M. J E
Author_Institution
Dept. of Eng., Cambridge Univ., UK
fYear
2001
fDate
2001
Firstpage
77
Lastpage
80
Abstract
This paper describes a new technique for training a speech recognition system on inhomogeneous training data. The proposed technique, acoustic factorisation, attempts to model explicitly all the factors that affect the acoustic signal. By explicitly modelling all the factors, the trained model set may be used in a more flexible fashion than in standard adaptive training schemes. Since an individual model is trained for each factor, it is possible to factor-in only those factors that are appropriate to a particular target domain, for example the distribution over all training speakers. The target domain specific factors are simply estimated from limited target specific data, for example the target acoustic environment. The paper describes the theory of this new approach for the transforms for a particular speaker and environment. Initial experiments on a large vocabulary speech recognition task are presented.
Keywords
acoustic signal processing; learning (artificial intelligence); parameter estimation; speech recognition; acoustic factorisation; adaptive training; inhomogeneous training data; specific factor estimation; speech recognition; Acoustic noise; Acoustic testing; Acoustical engineering; Data engineering; Degradation; Loudspeakers; Maximum likelihood linear regression; Speech recognition; Training data; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034593
Filename
1034593
Link To Document