مرکز منطقه ای اطلاع رساني علوم و فناوري - A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition

DocumentCode :

754307

Title :

A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition

Author :

Ion, Valentin ; Haeb-Umbach, Reinhold

Author_Institution :

Dept. of Commun. Eng., Univ. of Paderborn, Paderborn

Volume :

Issue :

fYear :

2008

fDate :

7/1/2008 12:00:00 AM

Firstpage :

1047

Lastpage :

1060

Abstract :

In this paper, we derive an uncertainty decoding rule for automatic speech recognition (ASR), which accounts for both corrupted observations and inter-frame correlation. The conditional independence assumption, prevalent in hidden Markov model-based ASR, is relaxed to obtain a clean speech posterior that is conditioned on the complete observed feature vector sequence. This is a more informative posterior than one conditioned only on the current observation. The novel decoding is used to obtain a transmission-error robust remote ASR system, where the speech capturing unit is connected to the decoder via an error-prone communication network. We show how the clean speech posterior can be computed for communication links being characterized by either bit errors or packet loss. Recognition results are presented for both distributed and network speech recognition, where in the latter case common voice-over-IP codecs are employed.

Keywords :

Internet telephony; codecs; decoding; hidden Markov models; speech recognition; automatic speech recognition; bit errors; communication links; corrupted observations; distributed speech recognition; error-prone communication network; feature vector sequence; hidden Markov model-based ASR; inter-frame correlation; network speech recognition; packet loss; speech posterior; transmission error robust speech recognition; uncertainty decoding; voice-over-IP codecs; Acoustic distortion; Acoustic noise; Automatic speech recognition; Communication networks; Decoding; Hidden Markov models; Robustness; Speech processing; Speech recognition; Uncertainty; Conditional independence; distributed speech recognition; error concealment; network speech recognition; uncertainty decoding;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2008.925879

Filename :

4544822

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=754307