• DocumentCode
    3143172
  • Title

    Polyphonic piano note transcription with recurrent neural networks

  • Author

    Böck, Sebastian ; Schedl, Markus

  • Author_Institution
    Dept. of Comput. Perception, Johannes Kepler Univ., Linz, Austria
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    121
  • Lastpage
    124
  • Abstract
    In this paper a new approach for polyphonic piano note onset transcription is presented. It is based on a recurrent neural network to simultaneously detect the onsets and the pitches of the notes from spectral features. Long Short-Term Memory units are used in a bidirectional neural network to model the context of the notes. The use of a single regression output layer instead of the often used one-versus-all classification approach enables the system to significantly lower the number of erroneous note detections. Evaluation is based on common test sets and shows exceptional temporal precision combined with a significant boost in note transcription performance compared to current state-of-the-art approaches. The system is trained jointly with various synthesized piano instruments and real piano recordings and thus generalizes much better than existing systems.
  • Keywords
    information retrieval; music; musical instruments; recurrent neural nets; regression analysis; signal classification; spectral analysis; bidirectional neural network; common test sets; erroneous note detections; exceptional temporal precision; long short-term memory units; music information retrieval; note transcription performance; one-versus-all classification approach; piano recordings; polyphonic piano note onset transcription; polyphonic piano note transcription; recurrent neural networks; single regression output layer; spectral features; synthesized piano instruments; Accuracy; Hidden Markov models; Instruments; Recurrent neural networks; Spectrogram; Training; music information retrieval; neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6287832
  • Filename
    6287832