• DocumentCode
    381265
  • Title

    Acoustic factorisation

  • Author

    Gales, M. J E

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    77
  • Lastpage
    80
  • Abstract
    This paper describes a new technique for training a speech recognition system on inhomogeneous training data. The proposed technique, acoustic factorisation, attempts to model explicitly all the factors that affect the acoustic signal. By explicitly modelling all the factors, the trained model set may be used in a more flexible fashion than in standard adaptive training schemes. Since an individual model is trained for each factor, it is possible to factor-in only those factors that are appropriate to a particular target domain, for example the distribution over all training speakers. The target domain specific factors are simply estimated from limited target specific data, for example the target acoustic environment. The paper describes the theory of this new approach for the transforms for a particular speaker and environment. Initial experiments on a large vocabulary speech recognition task are presented.
  • Keywords
    acoustic signal processing; learning (artificial intelligence); parameter estimation; speech recognition; acoustic factorisation; adaptive training; inhomogeneous training data; specific factor estimation; speech recognition; Acoustic noise; Acoustic testing; Acoustical engineering; Data engineering; Degradation; Loudspeakers; Maximum likelihood linear regression; Speech recognition; Training data; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034593
  • Filename
    1034593