• DocumentCode
    336725
  • Title

    Progress in Broadcast News transcription at Dragon Systems

  • Author

    Wegmann, Steven ; Zhan, Puming ; Gillick, Larry

  • Author_Institution
    Dragon Syst. Inc., Newton, MA, USA
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    33
  • Abstract
    We report on progress in acoustic modelling and preprocessing in our Broadcast News transcription system. We have gone back to basics in acoustic modelling, and re-examined some of our standard practices, in particular the use of IMELDA and frequency warping, in the context of the Broadcast News corpus. We also report on some preliminary experiments with a generalization of IMELDA, “semi-tied covariances”. In combination, these improvements lead to a 3.5% absolute improvement over our eval97 models. We also describe our attempts to fix our rather primitive, silence-based preprocessing system, including initial results using a new speaker-change detection algorithm based on Hotelling´s T2-test
  • Keywords
    speech recognition; Broadcast News transcription; Dragon Systems; Hotelling´s T2-test; IMELDA; acoustic modelling; frequency warping; preprocessing; semi-tied covariance; silence-based preprocessing system; speaker-change detection algorithm; Cepstral analysis; Context modeling; Degradation; Detection algorithms; Frequency; Gaussian processes; Radio broadcasting; Speech; TV broadcasting; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758055
  • Filename
    758055