مرکز منطقه ای اطلاع رساني علوم و فناوري - Decision-tree based feature-space quantization for fast Gaussian computation

DocumentCode :

2980346

Title :

Decision-tree based feature-space quantization for fast Gaussian computation

Author :

Padmanabhan, M. ; Jan, E.E. ; Bahl, L.R. ; Picheny, M.

Author_Institution :

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

fYear :

1997

fDate :

14-17 Dec 1997

Firstpage :

325

Lastpage :

330

Abstract :

Presents a decision-tree based procedure to quantize the feature-space of a speech recognizer, with the motivation of reducing the computation time required for evaluating Gaussians in a speech recognition system. The entire feature space is quantized into non-overlapping regions, where each region is bounded by a number of hyperplanes. Each region is characterized by the occurrence of only a small number of the total alphabet of allophones (sub-phonetic speech units). By identifying the region in which a test feature vector lies, only the Gaussians that model the density of allophones that exist in that region need be evaluated. The quantization of the feature space is done in a hierarchical manner using a binary decision tree. Each node of the decision tree represents a region of the feature space, and is further characterized by a hyperplane (a vector v__n and a scalar threshold value h_n), that subdivides the region corresponding to the current node into two non-overlapping regions corresponding to the two children of the current node. Given a test feature vector, the process of finding the region that it lies in involves traversing this binary decision tree, which is computationally inexpensive. We present results of experiments that show that the Gaussian computation time can be reduced by as much as a factor of 20 with negligible degradation in accuracy. We also examine issues of robustness to different environments

Keywords :

Gaussian distribution; quantisation (signal); speech recognition; trees (mathematics); accuracy; allophone alphabet; allophone density modelling; binary decision tree; computation time; fast Gaussian computation; feature space regions; feature-space quantization; hyperplanes; nonoverlapping regions; robustness; scalar threshold value; speech recognition system; subphonetic speech units; test feature vector; Decision trees; Degradation; Error analysis; Robustness; Speech analysis; Speech recognition; Testing; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on

Conference_Location :

Santa Barbara, CA

Print_ISBN :

0-7803-3698-4

Type :

conf

DOI :

10.1109/ASRU.1997.659107

Filename :

659107

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2980346