• DocumentCode
    3356910
  • Title

    Real-time aggregation of Wikipedia data for visual analytics

  • Author

    Boukhelifa, Nadia ; Chevalier, Fanny ; Fekete, Jean-Daniel

  • fYear
    2010
  • fDate
    25-26 Oct. 2010
  • Firstpage
    147
  • Lastpage
    154
  • Abstract
    Wikipedia has been built to gather encyclopedic knowledge using a collaborative social process that has proved its effectiveness. However, the workload required for raising the quality and increasing the coverage of Wikipedia is exhausting the community. Based on several participatory design sessions with active Wikipedia contributors (a.k.a. Wikipedians), we have collected a set of measures related to Wikipedia activity that, if available and visualized effectively, could spare a lot of monitoring time to these Wikipedians, allowing them to focus on quality and coverage of Wikipedia instead of spending their time navigating heavily to track vandals and copyright infringements. However, most of these measures cannot be computed on the fly using the available Wikipedia API. Therefore, we have designed an open architecture called WikiReactive to compute incrementally and maintain several aggregated measures on the French Wikipedia. This aggregated data is available as a Web Service and can be used to overlay information on Wikipedia articles through Wikipedia Skins or for new services for Wikipedians or people studying Wikipedia. This article describes the architecture, its performance and some of its uses.
  • Keywords
    Web services; application program interfaces; data visualisation; query processing; French Wikipedia; Web Service; WikiReactive; Wikipedia API; Wikipedia articles; Wikipedia data; Wikipedia skins; collaborative social process; copyright infringements; encyclopedic knowledge; participatory design sessions; real-time aggregation; visual analytics; Data visualization; Databases; Electronic publishing; Encyclopedias; Internet; Measurement; Database Management [H.2.1]: Logical Design-Schema and subschema; Database Management [H.2.4]: System-Query processing; Information Storage and Retrieval [H.3.5]: Online Information Services-Web-based services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Visual Analytics Science and Technology (VAST), 2010 IEEE Symposium on
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4244-9488-0
  • Electronic_ISBN
    978-1-4244-9487-3
  • Type

    conf

  • DOI
    10.1109/VAST.2010.5652896
  • Filename
    5652896