• DocumentCode
    1786627
  • Title

    Variable frame rate and length analysis for data compression in distributed speech recognition

  • Author

    Ivan, Kraljevski ; Tan Zhenghua

  • Author_Institution
    voice INTER connect GmbH, Dresden, Germany
  • fYear
    2014
  • fDate
    19-21 Sept. 2014
  • Firstpage
    453
  • Lastpage
    457
  • Abstract
    This paper addresses the issue of data compression in distributed speech recognition on the basis of a variable frame rate and length analysis method. The method first conducts frame selection by using a posteriori signal-to-noise ratio weighted energy distance to find the right time resolution at the signal level, and then increases the length of the selected frame according to the number of non-selected preceding frames to find the right time-frequency resolution at the frame level. It produces high frame rate and small frame length in rapidly changing regions and low frame rate and large frame length for steady regions. The method is applied to scalable source coding in distributed speech recognition where the target bitrate is met by adjusting the frame rate. Speech recognition results show that the proposed approach outperforms other compression methods in terms of recognition accuracy for noisy speech while achieving higher compression rates.
  • Keywords
    data compression; speech recognition; time-frequency analysis; data compression; distributed speech recognition; frame level; frame selection; length analysis method; noisy speech; nonselected preceding frames; posteriori signal-to-noise ratio weighted energy distance; scalable source coding; time-frequency resolution; variable frame rate; Bit rate; Data compression; Noise measurement; Signal to noise ratio; Speech; Speech recognition; distributed speech recognition; variable frame length; variable frame rate;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Infrastructure and Digital Content (IC-NIDC), 2014 4th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-4736-2
  • Type

    conf

  • DOI
    10.1109/ICNIDC.2014.7000344
  • Filename
    7000344