• DocumentCode
    310567
  • Title

    Phone classification with segmental features and a binary-pair partitioned neural network classifier

  • Author

    Zahorian, Stephen A. ; Silsbee, Peter ; Wang, Xihong

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1011
  • Abstract
    This paper presents methods and experimental results for phonetic classification using 39 phone classes and the NIST recommended training and test sets for NTIMIT and TIMIT. Spectral/temporal features which represent the smoothed trajectory of FFT derived speech spectra over 300 ms intervals are used for the analysis. Classification tests are made with both a binary-pair partitioned (BPP) neural network system (one neural network for each of the 741 pairs of phones) and a single large neural network. The classification accuracy is very similar for the two types of networks, but the BPP method has the advantage of a much shorter training time. The best results obtained (77% for TIMIT and 67.4% for NTIMIT) compare favorably to the best results reported in the literature for this task
  • Keywords
    acoustic signal processing; backpropagation; discrete Fourier transforms; discrete cosine transforms; feature extraction; neural nets; spectral analysis; speech processing; transforms; DCT; FFT derived speech spectra; NIST recommended test sets; NIST recommended training sets; NTIMIT; TIMIT; acoustic phonetic classification; backpropagation; binary pair partitioned neural network classifier; classification accuracy; classification tests; discrete cosine transform; experimental results; phone classes; phone classification; segmental features; smoothed trajectory; spectral/temporal features; training time; Acoustic signal processing; Cepstral analysis; Feature extraction; NIST; Neural networks; Spectral analysis; Speech analysis; Speech processing; System testing; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596111
  • Filename
    596111