• DocumentCode
    1878848
  • Title

    An acoustical pattern classifier based on N-depth projection on privileged eigenstructures

  • Author

    Falcone, Mauro ; Paoloni, Andrea

  • Author_Institution
    Fondazione Ugo Bordoni, Rome, Italy
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    3301
  • Abstract
    A geometrical vector classifier is applied to the problem of phonetic classification in several experimental environments. The algorithm is based on the measure of similarity between the original vector and the ones reconstructed using a N-depth projection on the eigenvectors related to the covariance matrix of each category to be classified. For each category (i.e. for each phoneme) there is a privileged subspace of arbitrary dimension and with N axes where the similarity of the training vector set is maximized. These geometrical subspaces are characterized in relation to databases, speaker dependence, speech emission, and signal parametrization. Experiments were performed using three small databases: a four-speaker continuous speech, a single-speaker isolated words, and a single-speaker continuous speech database. Results are reported for closed tests (where training and classification were performed on the same database), and for open tests (where they were performed on different databases). It is concluded that the proposed method may, in some cases, successfully substitute for vector quantizer techniques
  • Keywords
    eigenvalues and eigenfunctions; speech analysis and processing; speech recognition; N-depth projection; acoustical pattern classifier; automatic speech recognition; closed tests; continuous speech database; covariance matrix; databases; eigenvectors; geometrical subspaces; geometrical vector classifier; isolated words database; open tests; phoneme; phonetic classification; signal parametrization; speaker dependence; speech emission; training vector set; Automatic speech recognition; Hidden Markov models; Information processing; Loudspeakers; Neural networks; Performance evaluation; Spatial databases; Speech analysis; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150159
  • Filename
    150159