Acoustic factorisation

Author

Gales, M. J E

Author_Institution

Dept. of Eng., Cambridge Univ., UK

fYear

2001

fDate

2001

Firstpage

77

Lastpage

80

Abstract

This paper describes a new technique for training a speech recognition system on inhomogeneous training data. The proposed technique, acoustic factorisation, attempts to model explicitly all the factors that affect the acoustic signal. By explicitly modelling all the factors, the trained model set may be used in a more flexible fashion than in standard adaptive training schemes. Since an individual model is trained for each factor, it is possible to factor-in only those factors that are appropriate to a particular target domain, for example the distribution over all training speakers. The target domain specific factors are simply estimated from limited target specific data, for example the target acoustic environment. The paper describes the theory of this new approach for the transforms for a particular speaker and environment. Initial experiments on a large vocabulary speech recognition task are presented.

Keywords

acoustic signal processing; learning (artificial intelligence); parameter estimation; speech recognition; acoustic factorisation; adaptive training; inhomogeneous training data; specific factor estimation; speech recognition; Acoustic noise; Acoustic testing; Acoustical engineering; Data engineering; Degradation; Loudspeakers; Maximum likelihood linear regression; Speech recognition; Training data; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on

Print_ISBN

0-7803-7343-X

Type

conf

DOI

10.1109/ASRU.2001.1034593

Filename

1034593