DocumentCode :
838269
Title :
Hamlet: a prototype of a voice-activated typewriter
Author :
Mariani, J.J.
Author_Institution :
LIMSI, CNRS, Orsay, France
Volume :
136
Issue :
2
fYear :
1989
fDate :
4/1/1989 12:00:00 AM
Firstpage :
162
Lastpage :
166
Abstract :
This project integrates different parts of a speaker-dependent isolated word voice-activated typewriter on a personal computer (IBM PC-AT). To build up the language model (for French), several routines have been written: automatic grapheme to phoneme conversion, semiautomatic training texts (20 pages) processing (building up the graphemic (2500 words) and phonemic (2000 words) lexicons), syntactic labelling through inductive inference, computation of the probabilistic language model (bigrams and trigrams on grammatical classes), and the definition of the phonological rules. The speech signal is analysed by 20 digital bandpass filters. Several types of speech compression techniques have been tried on medium and large difficulty vocabularies. Vector quantisation and nonlinear time compression have been chosen. Recognition is conducted in three steps: fast match based on word length and gross comparison; detailed match based on conventional DTW algorithms; and use of the language model to take into account the linguistic constraints, and to achieve the phoneme to grapheme conversion. Overall recognition rates of 95% have been obtained with a mean recognition time of 2 s, the 2000 templates being stored on 60 KBytes of RAM memory. Recognition results with or without the language model have been compared.
Keywords :
speech recognition; 60 KByte; Hamlet; IBM PC-AT; automatic grapheme; bigrams; digital bandpass filters; inductive inference; nonlinear time compression; personal computer; phoneme conversion; phonological rules; probabilistic language model; semiautomatic training texts; speaker-dependent isolated word voice-activated typewriter; speech compression; syntactic labelling; trigrams; vector quantisation; voice-activated typewriter;
fLanguage :
English
Journal_Title :
Communications, Speech and Vision, IEE Proceedings I
Publisher :
iet
ISSN :
0956-3776
Type :
jour
Filename :
19002
Link To Document :
بازگشت