Title :
A smart background music mixing algorithm for portable digital imaging devices
Author :
Jin Ah Kang ; Chan Jun Chun ; Hong Kook Kim ; Myeong Bo Kim ; Sang Ryong Kim
Author_Institution :
Sch. of Inf. & Commun., Gwangju Inst. of Sci. & Technol. (GIST), Gwangju, South Korea
fDate :
8/1/2011 12:00:00 AM
Abstract :
In this paper, we propose a smart background music (BGM) mixing algorithm for portable digital imaging devices to enable users to enjoy video content with BGM. The proposed algorithm automatically adjusts the BGM output energy based on the activity and energy of foreground audio (FGA) contained in a video file. To this end, the proposed algorithm classifies each segment of FGA as speech, non-speech, or a mixed signal. After that, it estimates a scale factor for mixing FGA and BGM according to the signal classification result and the energy of FGA. In addition, a fade-in and fade-out process is incorporated in the proposed algorithm in order to improve the perceptual quality of output audio at the boundaries where signal classification is changed. In order to demonstrate the effectiveness of the proposed algorithm, we implement it on a portable digital imaging device in real time and compare the user´s preference of the proposed algorithm with those of conventional algorithms that mixes FGA with BGM based on voice activity detection or a predefined fixed scale factor. It is shown from the experiments that the proposed algorithm is pretty much preferred by around 79%, compared to the conventional algorithms.
Keywords :
audio signal processing; image classification; image segmentation; music; speech processing; video signal processing; BGM; FGA segment; fixed scale factor; foreground audio; portable digital imaging device; smart background music mixing algorithm; speech signal classification; video content; video file; voice activity detection; Algorithm design and analysis; Classification algorithms; Clocks; Digital images; Performance evaluation; Signal processing algorithms; Speech; Portable digital imaging device; audio content classification; audio mixing; backgroundmusic; fade-in andfade-out;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2011.6018882