مرکز منطقه ای اطلاع رساني علوم و فناوري - A diphone-based digit recognition system using neural networks

DocumentCode :

310487

Title :

A diphone-based digit recognition system using neural networks

Author :

Hosom, John-Paul ; Cole, Ronald A.

Author_Institution :

Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA

Volume :

fYear :

1997

fDate :

21-24 Apr 1997

Firstpage :

3369

Abstract :

In exploring new ways of looking at speech data, we have developed an alternative method of segmentation for training a neural-network-based digit-recognition system. Whereas previous methods segment the data into monophones, biphones, or triphones and train on each sub-phone unit in several broad-category contexts, our new method uses modified diphones to train on the regions of greatest spectral change as well as the regions of greatest stability. Although we account for regions of spectral stability, we do not require their presence in our word models. Empirical evidence for the advantage of this new method is seen by the 13% reduction in word-level error that was achieved on a test set of the OGI Numbers corpus. Comparison was made to a baseline system that used context-independent monophones and context-dependent biphones and triphones

Keywords :

acoustic signal processing; learning (artificial intelligence); neural nets; speech processing; speech recognition; OGI Numbers corpus; context-dependent biphones; context-dependent triphones; context-independent monophones; diphone-based digit recognition system; neural networks; segmentation; spectral change; spectral stability; word models; word-level error; Context modeling; Energy states; Humans; NIST; Neural networks; Noise level; Speech enhancement; Speech recognition; Stability; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location :

Munich

ISSN :

1520-6149

Print_ISBN :

0-8186-7919-0

Type :

conf

DOI :

10.1109/ICASSP.1997.595516

Filename :

595516

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=310487