• DocumentCode
    3348834
  • Title

    Automatic singer identification based on auditory features

  • Author

    Wei Cai ; Qiang Li ; Xin Guan

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Tianjin Univ., Tianjin, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1624
  • Lastpage
    1628
  • Abstract
    The paper describes a method of identifying singers´ voice from the monophonic music including sounds of various musical instruments based on auditory features. In this system, there are four problems to solve, vocal segment detection, feature extraction, modeling of the singing voice and identification. For a song to be identified, the vocal/nonvocal segment is detected via a new classifier - Sparse Representation-based Classification (SRC). The feature extraction is of the most importance. Human ear can distinguish among different types of sounds, so auditory features to describe the singer´s voice are important. To describe the auditory features, we calculate features of each frame including Mel-frequency Cepstral Coefficient (MFCC), Liner Prediction Mel-frequency Cepstral Coefficient (LPMCC) and Gammatone Cepstral Coefficient (GTCC). Finally, we introduce the Gaussian Mixture Model (GMM) to model the singers´ voice. This system is demonstrated to improve the performance of an automatic singer identification system in Music Information Retrieval (MIR).
  • Keywords
    Gaussian processes; cepstral analysis; feature extraction; information retrieval; musical instruments; signal classification; speaker recognition; GMM; Gaussian mixture model; LPMCC; MFCC; auditory feature extraction; automatic singer identification; human ear; liner prediction Mel-frequency cepstral coefficient; monophonic music; music information retrieval; musical instruments; nonvocal segment detection; singer voice identification; singing voice modeling; sparse representation-based classification; vocal segment detection; Feature extraction; Filter banks; Humans; Low pass filters; Mel frequency cepstral coefficient; GMM; Gammatone Cepstrum Coefficient; SRC; auditory feature; singer identification; singing voice detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2011 Seventh International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    2157-9555
  • Print_ISBN
    978-1-4244-9950-2
  • Type

    conf

  • DOI
    10.1109/ICNC.2011.6022500
  • Filename
    6022500