DocumentCode
1878848
Title
An acoustical pattern classifier based on N-depth projection on privileged eigenstructures
Author
Falcone, Mauro ; Paoloni, Andrea
Author_Institution
Fondazione Ugo Bordoni, Rome, Italy
fYear
1991
fDate
14-17 Apr 1991
Firstpage
3301
Abstract
A geometrical vector classifier is applied to the problem of phonetic classification in several experimental environments. The algorithm is based on the measure of similarity between the original vector and the ones reconstructed using a N -depth projection on the eigenvectors related to the covariance matrix of each category to be classified. For each category (i.e. for each phoneme) there is a privileged subspace of arbitrary dimension and with N axes where the similarity of the training vector set is maximized. These geometrical subspaces are characterized in relation to databases, speaker dependence, speech emission, and signal parametrization. Experiments were performed using three small databases: a four-speaker continuous speech, a single-speaker isolated words, and a single-speaker continuous speech database. Results are reported for closed tests (where training and classification were performed on the same database), and for open tests (where they were performed on different databases). It is concluded that the proposed method may, in some cases, successfully substitute for vector quantizer techniques
Keywords
eigenvalues and eigenfunctions; speech analysis and processing; speech recognition; N-depth projection; acoustical pattern classifier; automatic speech recognition; closed tests; continuous speech database; covariance matrix; databases; eigenvectors; geometrical subspaces; geometrical vector classifier; isolated words database; open tests; phoneme; phonetic classification; signal parametrization; speaker dependence; speech emission; training vector set; Automatic speech recognition; Hidden Markov models; Information processing; Loudspeakers; Neural networks; Performance evaluation; Spatial databases; Speech analysis; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150159
Filename
150159
Link To Document