DocumentCode :
312280
Title :
Automatic detection of accent nuclei at the head of words for speech recognition
Author :
Minematsu, Nobuaki ; Nakagawa, Seiichi
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Japan
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1620
Abstract :
A new scheme is proposed to incorporate prosodic processing into speech recognition, where the accent nuclei at the head of words are detected automatically and used to limit the searching space in speech recognition, that is, to preselect candidate words. The proposed method for the automatic detection of the accent nuclei and its performance are described. Using this scheme, it is expected that the recognition speed is improved. This scheme is derived from a finding by perceptual experiments conducted previously by the first author. Results of the experiments indicated that the accent nucleus at the first mora has acceleration effect on perceiving the word. This effect can be explained by the earlier identification of the word accent type as type 1 by its nucleus at the first mora. In other words, the accent nucleus at the head of a word can limit the searching space effectively in the mental lexicon. This mechanism was implemented using HMMs and examined for isolated words on a machine, where the vowel detection by broad segmental features and the rejection of words with a devoiced vowel at the first or second mora were introduced at the sane time. Evaluation experiments showed 94.7% and 90.0% as recall factor and precision factor of the accent nucleus detection respectively
Keywords :
hidden Markov models; speech recognition; automatic accent nuclei detection; broad segmental features; candidate word preselection; devoiced vowel; first mora; hidden Markov models; isolated words; limited searching space; mental lexicon; perceptual experiments; precision factor; prosodic processing; recall factor; recognition speed; second mora; speech recognition; vowel detection; word head; word rejection; Acceleration; Data mining; Humans; Natural languages; Speech processing; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607934
Filename :
607934
Link To Document :
بازگشت