DocumentCode
3021607
Title
Robust voice activity detection for DTX operation of speech coders
Author
Basbug, Filiz ; Nandkumar, S. ; Swaminathan, Karthik
Author_Institution
Hughes Network Syst. Inc., Germantown, MD, USA
fYear
1999
fDate
1999
Firstpage
58
Lastpage
60
Abstract
Robust detection of voice activity for short-term speech frames is essential for discontinuous transmission (DTX) mode of operation of vocoders such as IS-641. A reference VAD for the IS-641 coder has been chosen for such a purpose and is based on the GSM-EFR (enhance full rate) VAD. We show by developing a comprehensive evaluation procedure that the reference VAD is sensitive to speech level variations. For example, a significant increase is seen in frames falsely classified as active at speech levels of 10 dB above or below nominal level. We propose a solution based on automatic gain control to reduce level sensitivity. Objective performance measures confirm the robustness of our proposed VAD
Keywords
acoustic signal detection; automatic gain control; speech coding; vocoders; DTX operation; GSM-EFR VAD; IS-641 coder; automatic gain control; discontinuous transmission mode; enhance full rate; objective performance measures; robust detection; sensitivity reduction; short-term speech frames; speech coders; speech level variations; vocoders; voice activity detection; Base stations; Battery charge measurement; GSM; Gain control; Robust control; Robustness; Speech analysis; Speech enhancement; Statistics; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Coding Proceedings, 1999 IEEE Workshop on
Conference_Location
Porvoo
Print_ISBN
0-7803-5651-9
Type
conf
DOI
10.1109/SCFT.1999.781483
Filename
781483
Link To Document