DocumentCode
417769
Title
Recent improvements of an auditory model based front-end for the transcription of vocal queries
Author
De Mulder, T. ; Martens, J.P. ; Lesaffre, M. ; Leman, M. ; De Baets, B. ; De Meyer, H.
Author_Institution
Dept. of Electron. & Inf. Syst., Ghent Univ., Gent, Belgium
Volume
4
fYear
2004
fDate
17-21 May 2004
Abstract
In this paper recent improvements of an existing acoustic frontend for the transcription of vocal (hummed, sung) musical queries is presented. Thanks to the addition of a new second pitch extractor and the introduction of a novel multi-stage segmentation algorithm, the application domain of the front-end could be extended to whistled queries, and on top of that, the performance on the other two query types could be improved. Experiments have shown that the new system can transcribe vocal queries with an accuracy ranging from 76 % (whistling) to 85 % (humming), and that it clearly outperforms other state-of-the art systems on all three query types.
Keywords
audio databases; audio signal processing; music; query processing; auditory model based front-end; hummed musical queries; multi-stage segmentation algorithm; performance; second pitch extractor; sung musical queries; vocal musical queries; vocal query transcription; whistled queries; Electronic music; Frequency; Information systems; Low pass filters; Mathematics; Music information retrieval; Pattern analysis; Pattern matching; Performance analysis; Psychoacoustic models;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326812
Filename
1326812
Link To Document