• DocumentCode
    3211389
  • Title

    Topic extraction from news archive using TF*PDF algorithm

  • Author

    Bun, Khoo Khyou ; Ishizuka, Mitsuru

  • Author_Institution
    Dept. of Inf. & Commun. Eng., Univ. of Tokyo, Japan
  • fYear
    2002
  • fDate
    12-14 Dec. 2002
  • Firstpage
    73
  • Lastpage
    82
  • Abstract
    Since the Web became widespread, the amount of electronically available information online, especially news archives, has proliferated and threatens to become overwhelming. We propose an information system that will extract main topics in a news archive on a weekly basis. By obtaining a weekly report, a user can know what the main news events were in the past week.
  • Keywords
    information resources; information retrieval systems; TF*PDF algorithm; Web; information system; news archive; topic extraction; weekly report; Algorithm design and analysis; Broadcasting; Content based retrieval; Data mining; Event detection; Frequency; Humans; Information retrieval; Information systems; Intelligent systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Systems Engineering, 2002. WISE 2002. Proceedings of the Third International Conference on
  • Print_ISBN
    0-7695-1766-8
  • Type

    conf

  • DOI
    10.1109/WISE.2002.1181645
  • Filename
    1181645