Title :
Continuous digit recognition using coarse phonetic segmentation
Author_Institution :
Siemens Corporate Research and Technology Laboratories, Princeton, New Jersey
Abstract :
This paper describes a robust speaker dependent continuous digit recognition system which runs in real time on a 16-bit micro-processor. An important design goal was the efficient use of available processing resources. The decision-making steps are ordered according to the degree of difficulty and the amount of processing required. The system uses dynamic time alignment only selectively and locally, relying on lexical constraints imposed in the form of coarse phonetic transcription and a preclassification step which does not require costly time warping in pattern matching. The system achieved 96.5% string accuracy and 99.1% digit accuracy on 540 digit strings (average length of 4 digits) collected from six speakers (4 male, 2 female).
Keywords :
Decision making; Error analysis; Filter bank; Laboratories; Loudspeakers; Pattern matching; Real time systems; Robustness; Signal generators; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
DOI :
10.1109/ICASSP.1987.1169588