• DocumentCode
    3348292
  • Title

    Acoustic space dimensionality selection and combination using the maximum entropy principle

  • Author

    Abdel-Haleem, Yasser H. ; Renals, Steve ; Lawrence, Neil D.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sheffield, UK
  • Volume
    5
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We propose a discriminative approach to acoustic space dimensionality selection based on maximum entropy modelling. We form a set of constraints by composing the acoustic space with the space of phone classes, and use a continuous feature formulation of maximum entropy modelling to select an optimal feature set. The suggested approach has two steps: (1) the selection of the best acoustic space that efficiently and economically represents the acoustic data and its variability; (2) the combination of selected acoustic features in the maximum entropy framework to estimate the posterior probabilities over the phonetic labels given the acoustic input. Specific contributions of the paper include a parameter estimation algorithm (generalized improved iterative scaling) that enables the use of negative features, the parameterization of constraint functions using Gaussian mixture models, and experimental results using the TIMIT database.
  • Keywords
    Gaussian processes; acoustic signal processing; iterative methods; learning (artificial intelligence); maximum entropy methods; parameter estimation; probability; speech processing; Gaussian mixture models; acoustic space dimensionality selection; constraint functions; continuous feature formulation; generalized improved iterative scaling; maximum entropy principle; parameter estimation algorithm; phone classes; phonetic labels; posterior probability estimation; supervised machine learning approach; Computer science; Continuous-stirred tank reactor; Entropy; Iterative algorithms; Natural language processing; Natural languages; Parameter estimation; Probability distribution; Spatial databases; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1327191
  • Filename
    1327191