• DocumentCode
    2289286
  • Title

    New kernels for analyzing multimodal data in multimedia using kernel machines

  • Author

    Aradhye, Hrishikesh ; Dorai, Chitra

  • Author_Institution
    SRI Int., Menlo Park, CA, USA
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    37
  • Abstract
    Research in automated analysis of digital media content has led to a large collection of low-level feature extractors, such as face detectors, videotext extractors, speech and speaker identifiers, people/vehicle trackers, and event locators. These media metadata are often symbolic rather than continuous-valued, and pose significant difficulty to subsequent tasks such as classification and dimensionality reduction which traditionally deal with continuous-valued data. This paper proposes a novel mechanism that extends tasks traditionally limited to continuous-valued feature spaces, such as (a) dimensionality reduction, (b) de-noising, and (c) clustering, to domains with symbolic features. To this end, we introduce new kernels based on well-known distance metrics, and prove Mercer validity of these kernels for analyzing symbolic feature spaces. We demonstrate their usefulness within the context of kernel-space methods such as Kernel PCA and SVM, in classifying machine learning datasets from the UCI repository and in temporal clustering and tracking of videotext in multimedia. We show that the generalized kernels help capture information from symbolic feature spaces, visualize symbolic data, and aid tasks such as classification and clustering, and therefore are useful in multimodal analysis of multimedia.
  • Keywords
    data analysis; data visualisation; feature extraction; learning (artificial intelligence); learning automata; multimedia databases; pattern clustering; principal component analysis; Kernel PCA; Mercer validity; SVM; UCI repository; automated analysis; classification; de-noising; dimensionality reduction; distance metrics; kernel machines; machine learning datasets; multimedia; multimodal analysis; multimodal data analysis; symbolic data visualization; symbolic feature spaces; temporal clustering; videotext tracking; Data analysis; Data mining; Detectors; Event detection; Face detection; Feature extraction; Kernel; Speech analysis; Vehicle detection; Vehicles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7803-7304-9
  • Type

    conf

  • DOI
    10.1109/ICME.2002.1035368
  • Filename
    1035368