• DocumentCode
    337478
  • Title

    Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition

  • Author

    Kannan, Ashvin ; Khudanpur, Sanjeev

  • Author_Institution
    Nuance Commun., Menlo Park, CA, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    769
  • Abstract
    Two models of statistical dependence between the acoustic model parameters of a large vocabulary conversational speech recognition (LVCSR) system are investigated for the purpose of rapid speaker- and environment-adaptation from a very small amount of speech: (i) a Gaussian multiscale process governed by a stochastic linear dynamical system on a tree, and (ii) a simple hierarchical tree-structured prior. Both methods permit Bayesian (MAP) estimation of acoustic model parameters without parameter-tying even when no samples are available to independently estimate some parameters due to the limited amount of adaptation data. Modeling methodologies are contrasted, and comparative performance of the two on the Switchboard task is presented under identical test conditions for supervised and unsupervised adaptation with controlled amounts of adaptation speech. Both methods provide significant (1% absolute) gain in accuracy over adaptation methods that do not exploit the dependence between acoustic model parameters
  • Keywords
    Bayes methods; Gaussian processes; speech recognition; statistical analysis; tree data structures; Bayesian estimation; Gaussian multiscale process; MAP estimation; Switchboard task; acoustic model parameters; adaptation speech; hierarchical tree-structured prior; large vocabulary conversational speech recognition; parameter dependence; rapid adaptation; statistical dependence; stochastic linear dynamical system; supervised adaptation; tree-structured models; unsupervised adaptation; Acoustic testing; Bayesian methods; Gaussian noise; Hidden Markov models; Loudspeakers; Speech processing; Speech recognition; Stochastic systems; System testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759782
  • Filename
    759782