• DocumentCode
    310562
  • Title

    Explicit, N-best formant features for vowel classification

  • Author

    Schmid, Philipp ; Barnard, Etienne

  • Author_Institution
    Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    991
  • Abstract
    We demonstrate the use of explicit formant features for vowel and semi-vowel classification. The formant trajectories are approximated by either three line segments or Legendre polynomials. Together with formant amplitude, formant bandwidth, pitch, and segment duration, these formant features form a compact feature representation which performs as well (71.8%) as a cepstral-based feature representation (71.6%). The combination of the formant and cepstral feature improves the accuracy further to 73.4%. Additionally, we outline future experiments using our robust, N-best formant tracker
  • Keywords
    Legendre polynomials; approximation theory; cepstral analysis; feature extraction; signal representation; speech processing; speech recognition; tracking; Legendre polynomials; N-best formant features; N-best formant tracker; cepstral based feature representation; experiments; explicit formant features; formant amplitude; formant bandwidth; formant features; formant trajectories; line segments; pitch; segment duration; semivowel classification; speech recognition; vowel classification; Bandwidth; Cepstral analysis; Delay; Government; History; Mel frequency cepstral coefficient; Natural languages; Robustness; Speech; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596106
  • Filename
    596106