• DocumentCode
    3038680
  • Title

    Document warehousing: a document-intensive application of a multimedia database

  • Author

    Ishikawa, Hiroshi ; Ohta, Manabu ; Kato, Koki

  • Author_Institution
    Tokyo Metropolitan Univ., Japan
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    25
  • Lastpage
    31
  • Abstract
    Nowadays, structured data such as sales are stored in data warehouses for decision-making. Less-structured data such as HTML texts, XML data, images, and videos are increasingly accumulated in PC storage due to the spread of the Internet technology such as WWW. Such less-structured data, collectively called multimedia documents, are also precious as corporate assets. So we need to provide a document warehouse to analyze and manage multimedia documents for corporate-wide information mining and reuse like a data warehouse. As a document-intensive application of a multimedia database, we describe a prototype document warehouse system, which supports management of documents, keyword-based and content-based retrieval, rule-based classification, SOM-based clustering and XML active query facility based on ECA rules
  • Keywords
    Internet; content-based retrieval; data mining; data warehouses; document handling; hypermedia markup languages; multimedia databases; visual databases; ECA rules; HTML; Internet; SOM-based clustering; World Wide Web; XML; active query facility; content-based retrieval; corporate-wide information mining; data warehouses; document management; document warehouse; image database; keyword-based retrieval; multimedia database; rule-based classification; structured data; video database; Data warehouses; Decision making; HTML; Image storage; Internet; Marketing and sales; Multimedia databases; Videos; Warehousing; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Issues in Data Engineering, 2001. Proceedings. Eleventh International Workshop on
  • Conference_Location
    Heidelberg
  • Print_ISBN
    0-7695-0957-6
  • Type

    conf

  • DOI
    10.1109/RIDE.2001.916488
  • Filename
    916488