Title :
Frequency offset correction in speech without detecting pitch
Author :
Clark, P. ; Mallidi, S.H. ; Jansen, Anton ; Hermansky, Hynek
Author_Institution :
Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
Radio-transmitted speech sometimes contains a residual frequency shift or offset, resulting from incorrect demodulation in single-sideband channels. Frequency-shifted speech can mask speaker identity and reduce intelligibility. Therefore, frequency offset will degrade the performance of downstream speech technologies. Existing offset correction methods require a pitch estimate of the speech signal, which is difficult in noisy radio channels. We present a new, automatic algorithm for detecting and correcting frequency offset, based on third-order modulation spectral analysis. Our method is remarkably simple and does not require pitch estimation. We provide derivations, examples, and a pilot study demonstrating how offset correction improves speaker verification for radio-transmitted speech.
Keywords :
estimation theory; frequency allocation; signal detection; speaker recognition; spectral analysis; speech intelligibility; wireless channels; automatic algorithm; detecting pitch; downstream speech technology; frequency offset correction; frequency offset detection; frequency-shifted speech; incorrect demodulation; noisy radio channels; offset correction methods; pitch estimate; pitch estimation; radio-transmitted speech; residual frequency shift; single-sideband channels; speaker identity; speaker verification; speech intelligibility; speech signal; third-order modulation spectral analysis; Amplitude modulation; Demodulation; Frequency modulation; Harmonic analysis; Speech; Speech processing; Modulation spectrum; frequency offset; single-sideband; speaker recognition; speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639023