Title :
Data driven search organization for continuous speech recognition
Author :
Ney, Hermann ; Mergel, Dieter ; Noll, Andreas ; Paeseler, Annedore
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
fDate :
2/1/1992 12:00:00 AM
Abstract :
The authors describe an architecture and search organization for continuous speech recognition. The recognition module is part of the Siemens-Philips-Ipo project on continuous speech recognition and understanding (SPICOS) system for the understanding of database queries spoken in natural language. The goal of this project is a man-machine dialogue system that is able to understand fluently spoken German sentences and thus to provide voice access to a database. The recognition strategy is based on Bayes decision rule and attempts to find the best interpretation of the input speech data in terms of knowledge sources such as a language model, pronunciation lexicon, and inventory of subword units. The implementation of the search has been tested on a continuous speech database comprising up to 4000 words for each of several speakers. The efficiency and robustness of the search organization have been checked and evaluated along many dimensions, such as different speakers, phoneme models, and language models
Keywords :
Bayes methods; search problems; speech recognition; Bayes decision rule; SPICOS; Siemens-Philips-Ipo project; continuous speech recognition; database queries; fluently spoken German sentences; inventory of subword units; knowledge sources; language model; man-machine dialogue system; pronunciation lexicon; search organization; speech understanding; voice access; Computational efficiency; Decision theory; Delay; Dynamic programming; Heuristic algorithms; Natural languages; Robustness; Speech recognition; State-space methods; Testing;
Journal_Title :
Signal Processing, IEEE Transactions on