DocumentCode :
730068
Title :
Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation
Author :
Xiu Zhang ; Wei Li ; Bilei Zhu
Author_Institution :
Sch. of Comput. Sci., Fudan Univ., Shanghai, China
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
131
Lastpage :
135
Abstract :
Monaural singing voice separation has aroused considerable attention. Many pitch-based methods have been proposed to address this task, but generally have limited performance. The most crucial difficulties lie in the inaccurate judgment on voiced pitches and the failed recognition on unvoiced singing sounds. In this paper, we propose a novel algorithm based on the latent component analysis of time-frequency representation to overcome these difficulties. Specifically, the time-frequency (T-F) representations of the song are firstly decomposed into components, and each component approximately originates from a single sound source. We then construct non-overlapping T-F segments with these components, to complete the omitted useful singing voice information. Extensive experiments on the MIR-1K public dataset shows the effectiveness of the proposed algorithm.
Keywords :
approximation theory; audio signal processing; information retrieval; music; signal representation; source separation; speech processing; time-frequency analysis; MIR-1K public dataset; latent time-frequency component analysis; monaural singing voice separation; music information retrieval; nonoverlapping T-F segments; pitch-based approach; time-frequency representation; unvoiced singing sounds; Accuracy; Algorithm design and analysis; Inference algorithms; Matrix decomposition; Speech; Speech processing; Time-frequency analysis; Latent Time-Frequency Component Analysis; Pitch-Based Inference; Singing voice separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7177946
Filename :
7177946
Link To Document :
بازگشت