DocumentCode :
3604984
Title :
Real-Time Piano Music Transcription Based on Computer Vision
Author :
Akbari, Mohammad ; Cheng, Howard
Author_Institution :
Dept. of Math. & Comput. Sci., Univ. of Lethbridge, Lethbridge, AB, Canada
Volume :
17
Issue :
12
fYear :
2015
Firstpage :
2113
Lastpage :
2121
Abstract :
One important problem in musical information retrieval is automatic music transcription, which is an automated conversion process from played music to a symbolic notation such as MIDI file. Since the accuracy of previous audio-based transcription systems is not satisfactory, we propose an innovative computer vision-based automatic music transcription system named claVision to perform piano music transcription. Instead of processing the music audio, the system performs the transcription only from the video performance captured by a camera mounted over the piano keyboard. In this paper, we describe the architecture and the algorithms used in claVision. The claVision system has a high accuracy ( F1 score over 0.95) and a very low latency (about 7.0 ms) in real-time music transcription, even under different illumination conditions. This technology can also be used for other musical keyboard instruments.
Keywords :
audio signal processing; computer vision; information retrieval; music; real-time systems; video signal processing; MIDI file; audio-based transcription systems; automatic music transcription; computer vision; musical information retrieval; real-time piano music transcription; video performance; Accuracy; Algorithm design and analysis; Cameras; Computer vision; Keyboards; Music; Music information retrieval; Real-time systems; Automatic music transcription; claVision; computer vision; multipitch estimation; piano;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2015.2473702
Filename :
7225173
Link To Document :
بازگشت