DocumentCode
431647
Title
Automatic transcription of drum sequences using audiovisual features
Author
Gillet, Olivier ; Richard, Gaël
Author_Institution
Signal & Image Process. Dept., Telecom Paris, France
Volume
3
fYear
2005
fDate
18-23 March 2005
Abstract
The transcription of a musical performance from the audio signal is often problematic, either because it requires the separation of complex sources, or simply because some important high-level music information cannot be directly extracted from the audio signal. We propose a novel multimodal approach for the transcription of drum sequences using audiovisual features. The transcription is performed by support vector machine (SVM) classifiers, and three different information fusion strategies are evaluated. A correct recognition rate of 85.8% can be achieved for a detailed taxonomy and a fully automated transcription.
Keywords
audio signal processing; audio-visual systems; music; pattern classification; sensor fusion; sequences; signal classification; support vector machines; video signal processing; SVM classifiers; audio signal; audiovisual features; automatic music transcription; complex source separation; drum sequences; high-level music information; information fusion strategies; multimodal approach; support vector machine classifiers; video signal; Audio recording; Data mining; Independent component analysis; Instruments; Layout; Machine assisted indexing; Multiple signal classification; Signal processing; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8874-7
Type
conf
DOI
10.1109/ICASSP.2005.1415682
Filename
1415682
Link To Document