Title :
A two-stage query by singing/humming system on GPU
Author :
Wei-Tsa Kao ; Chung-Che Wang ; Chang, Kaichun K. ; Jang, Jyh-Shing R. ; Wenshan Liou
Author_Institution :
ISA, Nat. Tsing Hua Univ., Hsinchu, Taiwan
fDate :
Oct. 29 2013-Nov. 1 2013
Abstract :
This paper proposes the use of GPU (graphic processing unit) to implementing a two-stage comparison method for a QBSH (query by singing/humming) system. The system can take a user´s singing or humming and retrieve the top-10 most likely candidates from a database of 8431 songs. In order to speed up the comparison, we apply linear scaling in the first stage to select candidate songs from the database. These candidate songs are then re-ranked by dynamic time warping to achieve better recognition accuracy in the second stage. With the optimum setting, we can achieve a speedup factor of 7 (compared to dynamic time warping on GPU) and an accuracy of 77.65%.
Keywords :
audio signal processing; graphics processing units; microphones; music; query processing; GPU; QBSH; audio signal; candidate song selection; dynamic time warping; linear scaling; microphone; music retrieval; two-stage query-by-singing-humming system; Accuracy; Computer architecture; Databases; Educational institutions; Graphics processing units; Instruction sets; Vectors;
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
DOI :
10.1109/APSIPA.2013.6694309