• DocumentCode
    2185261
  • Title

    Multi-faceted information retrieval system for large scale email archives

  • Author

    Perkiö, Jukka ; Tuulos, Ville ; Buntine, Wray ; Tirri, Henry

  • Author_Institution
    Helsinki Inst. for Inf. Technol., Finland
  • fYear
    2005
  • fDate
    19-22 Sept. 2005
  • Firstpage
    557
  • Lastpage
    564
  • Abstract
    We profile a system for search and analysis of large-scale email archives. The system builds around four facets: content-based search engine, statistical topic model, automatically inferred social networks, and time-series analysis. The facets correspond to the types of information available in email data. The presented system allows chaining or combining the facets flexibly. Results of one facet may be used as input to another yielding remarkable combinatorial power. In information retrieval point of view, the system provides support for exploration, approximate textual searches and data visualization. We present some experimental results based on a large real-world email corpus.
  • Keywords
    content-based retrieval; electronic mail; information retrieval; search engines; statistical analysis; time series; content-based search engine; data visualization; large scale email archive; multifaceted information retrieval system; social network; statistical topic model; textual search; time-series analysis; Data mining; Data visualization; Information retrieval; Information technology; Large-scale systems; Power system modeling; Search engines; Social network services; Time series analysis; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2415-X
  • Type

    conf

  • DOI
    10.1109/WI.2005.103
  • Filename
    1517908