DocumentCode :
3048764
Title :
The efficiency of demisyllable segmentation in the recognition of spoken words
Author :
Ruske, Günther ; Schotola, Thomas
Author_Institution :
Technical University of Munich, Federal Republic of Germany
Volume :
6
fYear :
1981
fDate :
29677
Firstpage :
971
Lastpage :
974
Abstract :
The efficiency of syllabic segmentation and recognition is demonstrated in an experiment using three different word recognition systems and a vocabulary of 1000 words. In each system the preprocessing is carried out by a special loudness analyzer which yields 22 specific loudness functions. The first system avoids any segmentation and the total word pattern is time normalized to a constant length. In the second system syllable nuclei are detected and used as segment boundaries; the segments are time normalized and the resulting word pattern classified. The third system classifies each demisyllable using vowels and consonant clusters as decision units. For small vocabularies the first system gives the best performance. For more than 500 words the performance of the second system with syllabic segmentation surpasses that of the first system. For this vocabulary size however, the demisyllable recognition performs best and is significantly advantageous for a vocabulary consisting of 1000 words.
Keywords :
Humans; Pattern recognition; Signal processing; Speech processing; Speech recognition; Uncertainty; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '81.
Type :
conf
DOI :
10.1109/ICASSP.1981.1171361
Filename :
1171361
Link To Document :
بازگشت