• DocumentCode
    180326
  • Title

    Sparse cepstral codes and power scale for instrument identification

  • Author

    Li-Fan Yu ; Li Su ; Yi-Hsuan Yang

  • Author_Institution
    Res. Center for Inf. Technol. Innovation, Acad. Sinica, Taipei, Taiwan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    7460
  • Lastpage
    7464
  • Abstract
    This paper presents a novel feature representation called sparse cepstral codes for instrument identification. We first motivate the approach by discussing why cepstrum is suitable for instrument identification. Then we propose the use of sparse coding and power normalization to derive compact codes that better represent the information of the cepstrum. Our evaluation on both uni-source and multi-source instrument identification tasks show that the proposed feature leads to significantly better accuracy than existing methods. We further show that cepstrum obtained from power-scaled spectrum can do better than typical cepstrum especially in multi-source signal. The proposed system achieves 0.955 F-score in uni-source dataset and 0.688 F-score in multi-source dataset.
  • Keywords
    cepstral analysis; encoding; F-score; cepstrum information; compact codes; feature representation; multisource dataset; multisource instrument identification; multisource signal; power normalization; power-scaled spectrum; sparse cepstral codes; uni-source dataset; uni-source instrument identification; Accuracy; Cepstrum; Dictionaries; Instruments; Mel frequency cepstral coefficient; Speech; cepstrum; dictionary learning; instrument identification; power scale; sparse coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6855050
  • Filename
    6855050