• DocumentCode
    3340150
  • Title

    Attention-Based Document Classifier Learning

  • Author

    Buscher, Georg ; Dengel, Andreas

  • Author_Institution
    Dept. for Knowledge-Based Syst., Univ. of Kaiserslautern, Kaiserslautern
  • fYear
    2008
  • fDate
    16-19 Sept. 2008
  • Firstpage
    87
  • Lastpage
    94
  • Abstract
    We describe an approach for creating precise personalized document classifiers based on the user´s attention. The general idea is to observe which parts of a document the user was interested in just before he or she comes to a classification decision. Having information about this manual classification decision and the document parts the decision was based on, we can learn precise classifiers. For observing the user´s focus point of attention we use an unobtrusive eye tracking device and apply an algorithm for reading behavior detection. On this basis, we can extract terms characterizing the text parts interesting to the user and employ them for describing the class the document was assigned to by the user. Having learned classifiers in that way, new documents can be classified automatically using techniques of passage-based retrieval. We prove the very strong improvement of incorporating the user´s visual attention by a case study that evaluates an attention-based term extraction method.
  • Keywords
    classification; human factors; information retrieval; learning (artificial intelligence); text analysis; attention-based document classifier learning; behavior detection; classification decision; passage-based retrieval; personalized document classifier; unobtrusive eye tracking device; user visual attention; Bayesian methods; Data analysis; Data mining; Knowledge based systems; Knowledge management; Machine learning; Support vector machine classification; Support vector machines; Text analysis; Text recognition; attention; document classification; learning; reading detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on
  • Conference_Location
    Nara
  • Print_ISBN
    978-0-7695-3337-7
  • Type

    conf

  • DOI
    10.1109/DAS.2008.36
  • Filename
    4669949