• DocumentCode
    714183
  • Title

    Classification of vocal and non-vocal regions from audio songs using spectral features and pitch variations

  • Author

    Srinivasa Murthy, Y.V. ; Koolagudi, Shashidhar G.

  • Author_Institution
    Dept. of CSE, Nat. Inst. of Technol. Karnataka, Surathkal, India
  • fYear
    2015
  • fDate
    3-6 May 2015
  • Firstpage
    1271
  • Lastpage
    1276
  • Abstract
    In this work, an effort has been made to identify vocal and non-vocal regions from a given song using signal processing techniques and machine learning algorithm. Initially spectral features like mel-frequency cepstral coefficients (MFCCs) are used to develop the baseline system. Statistical values of pitch, jitter and shimmer are considered to improve performance of the system. Artificial neural networks (ANNs) are used to capture the characteristics of vocal and non-vocal segments of the songs. The experiment is conducted on 60 vocal and 60 non-vocal clips extracted from Telugu albums. 11-point moving window is used to ensure the continuity of vocal and non-vocal segments, thus improving the accuracy of system. With this approach system achieves 85.59% accuracy for vocal and 88.52% for non-vocal segment classification.
  • Keywords
    audio signal processing; neural nets; signal classification; MFCC; Telugu albums; artificial neural networks; audio songs; jitter; machine learning algorithm; mel-frequency cepstral coefficients; nonvocal regions classification; nonvocal segment classification; pitch variations; signal processing techniques; spectral features; statistical values; vocal regions classification; Accuracy; Feature extraction; Jitter; Mel frequency cepstral coefficient; Music; Neurons; Artificial neural networks; Jitter; Mel-frequency cepstral coefficients; Non-vocal regions; Pitch; Shimmer; Vocal regions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering (CCECE), 2015 IEEE 28th Canadian Conference on
  • Conference_Location
    Halifax, NS
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4799-5827-6
  • Type

    conf

  • DOI
    10.1109/CCECE.2015.7129461
  • Filename
    7129461