Abstract :
Low-power, miniature EM radar-like sensors have made it possible to measure properties of the human speech production system in real-time, without acoustic interference, at low cost. Compression and other applications use an EM sensor measured glottal signal, combined with one or more acoustic signals, to robustly estimate voiced excitation functions, transfer functions, unvoiced speech segments, articulator gestures, and background noise. Applications are speech coding, de-noising, verification, recognition, voice (and music) synthesis, and medical uses. In speech compression, an almost 10-fold bandwidth reduction has been demonstrated, compared to a standard 2.4 kbps LPC10 protocol.
Keywords :
acoustic signal processing; data compression; interference (signal); real-time systems; signal denoising; speech coding; transfer functions; 2.4 kbit/s; EM sensor; acoustic signals; articulator gestures; background noise; bandwidth reduction; human speech production system; real-time systems; signal denoising; speech coding; speech compression; transfer functions; unvoiced speech segments; voiced excitation functions; Acoustic measurements; Acoustic sensors; Costs; Humans; Interference; Noise measurement; Production systems; Real time systems; Sensor systems; Speech coding;