DocumentCode
417126
Title
The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation
Author
Sorin, Alexander ; Ramabadran, Tenkasi ; Chazan, Dan ; Hoory, Ron ; McLaughlin, Michael ; Pearce, David ; Wang, Fan CR ; Zhang, Yaxin
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We present work that has been carried out in developing the ETSI extended DSR standards ES 202 211 and ES 202 212 (2003). These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. The paper discusses the client-side estimation of pitch and voicing class parameters whereas a companion paper discusses the server-side speech reconstruction. Experimental results show enhancement of tonal language recognition rates of proprietary recognition engines, when the standard extensions are used.
Keywords
natural languages; speech processing; speech recognition; standards; ES 201 108; ES 202 050; ES 202 211; ES 202 212; ETSI distributed speech recognition standards; advanced front-end; basic front-end; client-side pitch estimation; client-side voicing class parameter estimation; enhanced tonal language recognition; extended distributed speech recognition standards; noise robust front-end; server-side speech reconstruction; Delay; Feature extraction; Frequency estimation; Mel frequency cepstral coefficient; Natural languages; Noise robustness; Speech processing; Speech recognition; Standards development; Telecommunication standards;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325939
Filename
1325939
Link To Document