• DocumentCode
    2690597
  • Title

    Could Automatic Metadata Generation be a digital solution for speedier and easier document publishing?

  • Author

    Edvardsen, Lars Fredrik Høimyr ; Sølvberg, Ingeborg Torvik

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Norwegian Univ. of Sci. & Technol., Trondheim, Norway
  • fYear
    2010
  • fDate
    13-16 April 2010
  • Firstpage
    216
  • Lastpage
    221
  • Abstract
    Enabling efficient retrieval and re-usage of digital documents is a major challenge as many documents on the Internet and on Intranets are poorly described with metadata. Manual generation of quality metadata requires skilled human resources, is costly and time-consuming. As a result, metadata related to the documents are too often insufficient or even incorrect. Automatic Metadata Generation (AMG) algorithms could perform similar metadata generation efforts in seconds without the need for human efforts. Submission of conference proceedings commonly includes specifying an extensive range of metadata. Conference proceedings are based on a specific document template with strict usage regulations making them a prime candidate for AMG efforts. This paper evaluates usage of AMG to generate metadata from papers based the MS Word-based IEEE & ACM conference proceedings templates. This enables this research to evaluate if the templates enable efficient AMG efforts, and if the desired paper content is actually retrieved. As authors might not see value in complying with the templates, actual document content can differ from the template specifications.
  • Keywords
    data handling; information retrieval; meta data; Internet; Intranets; automatic metadata generation; document publishing; document re-usage; document retrieval; Algorithm design and analysis; Conferences; Ecosystems; Filling; Humans; Manuals; Visualization; algorithms; component; experimentation; reliability; verification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Ecosystems and Technologies (DEST), 2010 4th IEEE International Conference on
  • Conference_Location
    Dubai
  • ISSN
    2150-4938
  • Print_ISBN
    978-1-4244-5551-5
  • Type

    conf

  • DOI
    10.1109/DEST.2010.5610645
  • Filename
    5610645