Recent improvements of an auditory model based front-end for the transcription of vocal queries

Author

De Mulder, T. ; Martens, J.P. ; Lesaffre, M. ; Leman, M. ; De Baets, B. ; De Meyer, H.

Author_Institution

Dept. of Electron. & Inf. Syst., Ghent Univ., Gent, Belgium

Volume

4

fYear

2004

fDate

17-21 May 2004

Abstract

In this paper recent improvements of an existing acoustic frontend for the transcription of vocal (hummed, sung) musical queries is presented. Thanks to the addition of a new second pitch extractor and the introduction of a novel multi-stage segmentation algorithm, the application domain of the front-end could be extended to whistled queries, and on top of that, the performance on the other two query types could be improved. Experiments have shown that the new system can transcribe vocal queries with an accuracy ranging from 76 % (whistling) to 85 % (humming), and that it clearly outperforms other state-of-the art systems on all three query types.

Keywords

audio databases; audio signal processing; music; query processing; auditory model based front-end; hummed musical queries; multi-stage segmentation algorithm; performance; second pitch extractor; sung musical queries; vocal musical queries; vocal query transcription; whistled queries; Electronic music; Frequency; Information systems; Low pass filters; Mathematics; Music information retrieval; Pattern analysis; Pattern matching; Performance analysis; Psychoacoustic models;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-8484-9

Type

conf

DOI

10.1109/ICASSP.2004.1326812

Filename

1326812