• DocumentCode
    2315416
  • Title

    Multiple Description Quantization for Recognizing Voice Over Packet Networks

  • Author

    Lin, I-Te ; Wu, Chun-Feng ; Chen, Sin-Horng ; Chang, Wen-Whei

  • Author_Institution
    Dept. of Commun. Eng., Chiao-Tung Univ., Hsinchu
  • fYear
    2006
  • fDate
    25-27 Oct. 2006
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The practical design of multiple description vector quantizers for robust distributed speech recognition over packet networks is investigated. In the proposed system, speech parameters are quantized and mapped to multiple descriptions for transmission over independent channels. A new approach to the index assignment optimization is presented on the basis of a linear programming framework. Also, a fast local search algorithm is proposed to find the optimal index assignment without compromising the speech recognition accuracy. Experiments with random packet loss in a range of loss conditions are conducted on the Mandarin digit string recognition task. Simulation results indicate that the proposed multiple description scheme provides more robust performance than the ETSI standardized split vector quantization scheme with a single description.
  • Keywords
    linear programming; search problems; speech coding; speech recognition; voice communication; Mandarin digit string recognition task; fast local search algorithm; independent channels; linear programming framework; multiple description quantization; optimal index assignment; random packet loss; robust distributed speech recognition; split vector quantization scheme; voice over packet networks; Cepstrum; Decoding; Feature extraction; Forward error correction; Hidden Markov models; Linear programming; Quantization; Robustness; Speech recognition; Telecommunication standards;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Networking in China, 2006. ChinaCom '06. First International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0463-0
  • Electronic_ISBN
    1-4244-0463-0
  • Type

    conf

  • DOI
    10.1109/CHINACOM.2006.344720
  • Filename
    4149956