Title :
A human machine speaker dependent speech interactive system
Author :
Sreenu, G. ; Girija, P.N. ; Prasad, M. Narendra ; Nagamani, M.
Author_Institution :
Dept. of Comput. & Inf. Sci., Hyderabad Univ., India
Abstract :
Speech has several characteristic features such as naturalness and efficient, spontaneous mode makes it an attractive interface element. It is possible to express emotions and attitudes. Its high bandwidth allows to develop multimodal interfaces. Telugu language has been used to communicate with the system. This process involves conversion of speech query into text, search for its suitable answer and then conversion of text into speech signal. So to perform the above tasks speech recognition system, search engine and speech synthesis system are needed respectively. Each of these modules takes input and passes output to another module in sequential order. First the features of the raw signal that are stored in the speech database are extracted and given to the speech recognizer which produces its equivalent text form and sends it to the search engine, whose duty is to find suitable answer from the database. It passes answer to the speech synthesizer that converts text into speech (i.e., reverse operation of speech recognizer) and sends back to the user. Speaker dependent speech recognition systems give more recognition performance than speaker dependent systems. In order to build a speaker dependent speech recognition system the data was recorded by a single male speaker and also for speech synthesis 1 hour speech is used. The data was recorded in a computer laboratory and background noise was normal. This system works using only microphone and our next step is to build a telephonic communication system.
Keywords :
audio databases; feature extraction; interactive systems; man-machine systems; natural languages; query processing; speech recognition; speech synthesis; speech-based user interfaces; Telugu language; human machine system; microphone; multimodal interface; search engine; speaker dependent speech interactive system; speech database extraction; speech query; speech recognition system; speech synthesis; telephonic communication system; Bandwidth; Humans; Interactive systems; Search engines; Signal processing; Spatial databases; Speech processing; Speech recognition; Speech synthesis; Text recognition;
Conference_Titel :
India Annual Conference, 2004. Proceedings of the IEEE INDICON 2004. First
Print_ISBN :
0-7803-8909-3
DOI :
10.1109/INDICO.2004.1497769