• DocumentCode
    2258632
  • Title

    Multimodal structure segmentation and analysis of music using audio and textual information

  • Author

    Cheng, Heng-Tze ; Yang, Yi-Hsuan ; Lin, Yu-Ching ; Chen, Homer H.

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2009
  • fDate
    24-27 May 2009
  • Firstpage
    1677
  • Lastpage
    1680
  • Abstract
    In this paper, we present a multimodal approach to structure segmentation of music with applications to audio content analysis and music information retrieval. In particular, since lyrics contain rich information about the semantic structure of a song, our approach incorporates lyrics to overcome the existing difficulties associated with large acoustic variation in music. We further design a constrained clustering algorithm for music segmentation and evaluate its performance on commercial recordings. Experimental results show that our method can effectively detect the boundaries and the types of semantic structure of music segments.
  • Keywords
    audio signal processing; information retrieval; music; pattern clustering; text analysis; acoustic variation; audio content analysis; audio information analysis; constrained clustering algorithm; lyrics processing; multimodal structure segmentation; music information retrieval; semantic structure labeling; textual information analysis; Acoustic signal detection; Algorithm design and analysis; Bridges; Clustering algorithms; Content based retrieval; Contracts; Information analysis; Labeling; Multiple signal classification; Music information retrieval; Music; lyrics; music information; retrieval; segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2009. ISCAS 2009. IEEE International Symposium on
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4244-3827-3
  • Electronic_ISBN
    978-1-4244-3828-0
  • Type

    conf

  • DOI
    10.1109/ISCAS.2009.5118096
  • Filename
    5118096