Title :
A user voice reduction algorithm based on binaural signal separation for portable digital imaging devices
Author :
Park, Ji Hun ; Kim, Hong Kook ; Kim, Myeong Bo ; Kim, Sang Ryong
Author_Institution :
Sch. of Inf. & Commun., Gwangju Inst. of Sci. & Technol. (GIST), Gwangju, South Korea
fDate :
5/1/2012 12:00:00 AM
Abstract :
In this paper, a user voice reduction algorithm for portable digital imaging devices is proposed based on a binaural signal separation approach in order to improve the naturalness of user-generated video contents. The proposed algorithm first estimates the interaural time differences (ITDs) from binaural signals recorded by the microphones equipped on a device. Then, the estimated ITDs are used to obtain the time-frequency domain masking patterns of a user voice against an actual subject sound of video content. Finally, the user voice recorded in video content can be reduced by applying the mask patterns to the binaural signals. In order to demonstrate the effectiveness of the proposed algorithm, the proposed algorithm is implemented on a portable digital imaging device having a clock speed of 600 MHz. It is shown from the performance evaluation by measuring a sound pressure level that the proposed algorithm reduces user voice by around 10 dB.
Keywords :
source separation; time-frequency analysis; voice equipment; ITD; binaural signal separation approach; interaural time differences; microphones; performance evaluation; portable digital imaging devices; sound pressure level; time-frequency domain masking patterns; user voice reduction algorithm; user-generated video contents; Algorithm design and analysis; Digital images; Graphical user interfaces; Microphones; Optimization; Signal processing algorithms; Source separation; Voice reduction; binaural signal separation; computational auditory scene analysis; digital imaging device;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2012.6227476