DocumentCode :
3060336
Title :
Properties of large lexicons: Implications for advanced isolated word recognition systems
Author :
Shipman, David W. ; Zue, Victor W.
Author_Institution :
Massachusetts Institute of Technology, Cambridge, MA, USA
Volume :
7
fYear :
1982
fDate :
30072
Firstpage :
546
Lastpage :
549
Abstract :
As part of our goal to design large-vocabulary, phonetically-based isolated word recognition systems, we investigated the statistical properties and constraints of the phonemic structures of English words. Our database consisted of five lexicons varying in size from 1250 to 20,000 words. The lexicons included, in addition to a phonemic transcription for each word, the word´s frequency of occurrence as determined from the Brown Corpus. We studied the distributions of the phonemes, both individually and by class, within the lexicon and within the corpus. Distributions of consonant clusters were also obtained. Finally, the distribution of words in terms of patterns derived from broad categorization of the phonemes was investigated. This paper summarizes the results of these studies and discusses implications for phonetically-based isolated word recognition strategies.
Keywords :
Databases; Dynamic programming; Error analysis; Frequency; Isolation technology; Pattern matching; Pattern recognition; Speech analysis; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type :
conf
DOI :
10.1109/ICASSP.1982.1171902
Filename :
1171902
Link To Document :
بازگشت