• DocumentCode
    2432762
  • Title

    Summarization of meetings using word clouds

  • Author

    De Hollander, Gilles ; Marx, Maarten

  • Author_Institution
    Inf. Inst., Univ. of Amsterdam, Amsterdam, Netherlands
  • fYear
    2011
  • fDate
    15-16 June 2011
  • Firstpage
    54
  • Lastpage
    61
  • Abstract
    In this study parsimonious language models were used to construct word clouds of the proceedings of the European Parliament. Multiple design choices had to be made and are discussed. Important features are stemming during tokenization, including bigrams into the word cloud and multilingualism. Also, the original parsimonious language models were extended with an additional term dampening unigrams that already occurred in the word cloud. This algorithm was tested in a small user study, using proceedings of the University of Amsterdam Science faculty´s student council. Members of this council had to give their preference for multiple word clouds constructed using either parsimonious language models or simple Term Frequencies (TF) with stop words. 68% over 29% (p <;60; 0.05, two-tailed paired t-test) preferred the word clouds constructed using parsimonious language models. Beside the system design, further technical findings, the social significance of applying word clouds to political data and possibilities for future work are discussed.
  • Keywords
    government data processing; linguistics; text analysis; word processing; European Parliament; University of Amsterdam Science faculty student council; meeting summarization; parsimonious language models; term dampening unigrams; term frequencies; text summarization; two-tailed paired t-test; word cloud construction; Cloud computing; Computational modeling; Europe; Navigation; Semantics; Tag clouds; Open Government Data; Text summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering (CSSE), 2011 CSI International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-61284-206-6
  • Type

    conf

  • DOI
    10.1109/CSICSSE.2011.5963995
  • Filename
    5963995