Title :
Transcription of polyphonic piano music with neural networks
Author_Institution :
Fac. of Comput. & Inf. Sci., Ljubljana Univ., Slovenia
Abstract :
This paper presents our experiences in building a system for transcription of polyphonic piano music. By transcription we mean the conversion of an audio recording of a polyphonic piano performance to a series of notes and their starting times. Our final goal is to build a transcription system that would transcribe polyphonic piano music over the entire piano range and with large polyphony. The system consists of three main stages. We first use a cochlear model based on the gammatone filterbank to transform an audio signal of a piano performance into time-frequency space. In the second stage we use a network of coupled adaptive oscillators to extract partial tracks from the output of the cochlear model and in the third stage we employ artificial neural networks acting as pattern recognisers to extract notes from the output of the oscillator network. The system uses several networks each trained to recognize the occurrence of a specific note in the input signal.
Keywords :
audio signal processing; channel bank filters; feedforward neural nets; music; pattern recognition; time-frequency analysis; transforms; artificial neural networks; audio recording; audio signal; cochlear model; coupled adaptive oscillators; gammatone filterbank; neural networks; notes; partial tracks; pattern recognisers; polyphonic piano music; starting times; time-frequency space; transcription; Adaptive systems; Artificial neural networks; Band pass filters; Filter bank; Frequency; Multiple signal classification; Music; Neural networks; Oscillators; Speech recognition;
Conference_Titel :
Electrotechnical Conference, 2000. MELECON 2000. 10th Mediterranean
Print_ISBN :
0-7803-6290-X
DOI :
10.1109/MELCON.2000.879982