Title :
Robust speech recognition over IP networks
Author :
Milner, Ben ; Semnani, Sharam
Author_Institution :
BT Adastral Park, Adv. Communs. Eng., Martlesham Heath, UK
Abstract :
This work looks at the issues involved in performing robust speech recognition over a packet-based network such as the IP network. This involves the combination of robust speech recognition together with a reliable method of sending speech data over the IP network. The format in which the speech is sent over the network is considered and results show that much better robustness is achieved when the front-end features are transmitted directly rather than encoding the speech with a codec. The problem of packet loss is addressed and a novel detection and estimation scheme for missing frames is introduced to overcome this problem. This is shown to recover performance with 50% packet loss from 33% to 90% which is only 3% below the no loss case
Keywords :
packet switching; speech recognition; IP networks; detection; estimation scheme; front-end features; missing frames; packet loss; packet-based network; performance; reliable method; robust speech recognition; voice recognition; Automatic speech recognition; IP networks; Internet telephony; Mel frequency cepstral coefficient; Robustness; Speech codecs; Speech processing; Speech recognition; Telecommunication network reliability; Web and internet services;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.862101