Title :
Recognition of packet loss speech using the most reliable reduced-frame-rate data
Author :
Lee-Min Lee ; Fu-Rong Jean ; Tan-Hsu Tan ; Jen-Hsiang Chou
Author_Institution :
Dept. of Electr. Eng., Dayeh Univ., Changhua, Taiwan
Abstract :
In a client-server distributed speech recognition (DSR) application, speech features are extracted and quantized at the client-end, and are sent to a remote back-end server for recognition. Although the bandwidth constrains are mostly eliminated, data packets may be lost over error prone channels. In order to reduce the performance degradation because of frame missing, a frequently used error concealment approach is to restore a full frame rate (FFR) observation sequence for recognition at the back-end. In this paper, an alternative approach is proposed to deal with observations with lost frames. This approach at first extracts the most reliable reconstructed reduced-frame-rate (RFR) observation sequence from the received data at the back-end, and then decodes it with an adapted hidden Markov model (HMM) that compensates the mismatch between the FFR trained model and the RFR test data. Experimental results show that a DSR system using the proposed method can achieve the same level of accuracy as an FFR data reconstruction method and significantly lessens the computation time. From the viewpoint of user capacity of a DSR system, we find that the proposed method is capable of serving much more client users without any extra cost of installing new equipment.
Keywords :
client-server systems; feature extraction; hidden Markov models; quantisation (signal); speech recognition; FFR data reconstruction method; FFR observation sequence restore; FFR trained model; HMM; RFR test data; bandwidth constrains; client server; client users; client-server DSR application; client-server distributed speech recognition application; computation time; data compensation; data decoding; data packets; error prone channels; frequently-used error concealment approach; full-frame rate observation sequence restoration; hidden Markov model; mismatch compensates; packet loss speech recognition; performance degradation reduction; reliable-reconstructed reduced-frame-rate observation sequence; remote back-end server; speech feature extraction; speech feature quantization; user capacity; Accuracy; Channel models; Decoding; Hidden Markov models; Reliability; Speech; Speech recognition; automatic speech recognition (ASR); distributed speech recognition (DSR); full frame rate (FFR) speech; hidden Markov model (HMM); reduced frame rate (RFR) speech;
Conference_Titel :
Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on
Conference_Location :
San Diego, CA
DOI :
10.1109/SMC.2014.6974182