• DocumentCode
    431599
  • Title

    Unsupervised auditory scene categorization via key audio effects and information-theoretic co-clustering

  • Author

    Cai, Rui ; Lu, Lie ; Cai, Lian-Hong

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • Volume
    2
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    Automatic categorization of auditory scenes is very useful in various content-based multimedia applications, such as video indexing and context-aware computing. An unsupervised approach is proposed to group auditory scenes with similar semantics. In our approach, auditory scenes are described by the key audio effects they contain. In order to exploit the relationships between different audio effects and provide a more accurate similarity measure for auditory scene categorization, co-clustering is used to group auditory scenes and key audio effects simultaneously. In addition, a Bayesian information criterion (BIC) is used to select cluster numbers automatically for both the key effects and the auditory scenes. Evaluation on 272 auditory scenes extracted from 12-hour audio data shows very encouraging results.
  • Keywords
    Bayes methods; audio signal processing; information theory; multimedia systems; pattern classification; pattern clustering; signal classification; Bayesian information criterion; audio effects; auditory scene categorization; content-based multimedia applications; context-aware computing; information-theoretic co-clustering; key audio effects; similarity measure; unsupervised auditory scene categorization; video indexing; Application software; Asia; Bayesian methods; Computer science; Context-aware services; Data mining; Explosions; Indexing; Layout; Multimedia computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1415594
  • Filename
    1415594