• DocumentCode
    528521
  • Title

    An exploratory analysis of the novelty of a news Web site

  • Author

    Calzarossa, Maria Carla ; Tessera, Daniele

  • Author_Institution
    Dipt. di Inf. e Sist., Univ. di Pavia, Pavia, Italy
  • fYear
    2010
  • fDate
    11-14 July 2010
  • Firstpage
    399
  • Lastpage
    404
  • Abstract
    The growing amount of information published on the Web, combined with its dynamic nature, opens many challenging issues dealing with management and retrieval of the information and provisioning of the underlying infrastructures. Search engines have to meet two conflicting requirements: minimize the number of downloads and provide up-to-date information. In this paper, we present the results of an exploratory analysis aimed at investigating the novelty of the content of a news Web site. We analyzed the Web site from an horizontal perspective by focusing on the content of the individual articles and from a vertical perspective by focusing on the entire collection of articles published on the site. These two perspectives allowed us to study how fast and to what extent articles were modified and to model the evolution of the Web site.
  • Keywords
    Web sites; electronic publishing; information management; information retrieval; search engines; Web site; article publishing; exploratory analysis; information retrieval; search engines; HTML; Markov processes; Monitoring; Multimedia communication; Streaming media; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Evaluation of Computer and Telecommunication Systems (SPECTS), 2010 International Symposium on
  • Conference_Location
    Ottawa, ON
  • Print_ISBN
    978-1-56555-340-8
  • Type

    conf

  • Filename
    5589007