• DocumentCode
    701488
  • Title

    Recognition of phonemes from estimation errors

  • Author

    Baghai-Ravary, L ; Beet, S W

  • Author_Institution
    Department of Electronic and Electrical Engineering, The University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK
  • fYear
    1996
  • fDate
    10-13 Sept. 1996
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Speech recognition systems generally use delta and delta-delta (velocity and acceleration) coefficients to characterise the dynamics apparent in frame-based representations of speech. These coefficients can be thought of as the errors of simple predictors. This paper describes the use of error coefficients derived from more advanced (and accurate) forms of prediction and interpolation. Both overall recognition accuracy and the detailed confusions observed are compared with those of the ‘traditional’ methods. The task used is speaker-independent phoneme recognition using a subset of the TIMIT database, and four different speech representations. The error coefficient performance on this task appears to be directly related to the robustness of the estimator used, with the best of the new methods out-performing delta-delta coefficients by around 10%.
  • Keywords
    Accuracy; Databases; Hidden Markov models; Interpolation; Speech; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    European Signal Processing Conference, 1996. EUSIPCO 1996. 8th
  • Conference_Location
    Trieste, Italy
  • Print_ISBN
    978-888-6179-83-6
  • Type

    conf

  • Filename
    7083214