• DocumentCode
    3423569
  • Title

    Informed software installation through License Agreement Categorization

  • Author

    Borg, A. ; Boldt, Martin ; Lavesson, Nils

  • Author_Institution
    Sch. of Comput., Blekinge Inst. of Technol., Karlskrona, Sweden
  • fYear
    2011
  • fDate
    15-17 Aug. 2011
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Spyware detection can be achieved by using machine learning techniques that identify patterns in the End User License Agreements (EULAs) presented by application installers. However, solutions have required manual input from the user with varying degrees of accuracy. We have implemented an automatic prototype for extraction and classification and used it to generate a large data set of EULAs. This data set is used to compare four different machine learning algorithms when classifying EULAs. Furthermore, the effect of feature selection is investigated and for the top two algorithms, we investigate optimizing the performance using parameter tuning. Our conclusion is that feature selection and performance tuning are of limited use in this context, providing limited performance gains. However, both the Bagging and the Random Forest algorithms show promising results, with Bagging reaching an AUC measure of 0.997 and a False Negative Rate of 0.062. This shows the applicability of License Agreement Categorization for realizing informed software installation.
  • Keywords
    bagging; category theory; invasive software; learning (artificial intelligence); software engineering; AUC measure; EULA; automatic prototype; bagging algorithm; data set; end user license agreement; feature selection; informed software installation; license agreement categorization; machine learning technique; parameter tuning; performance tuning; random forest algorithm; spyware detection; Licenses; Machine learning; Machine learning algorithms; Software; Spyware; Tuning; EULA analysis; Parameter tuning; Spyware; automated detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Security South Africa (ISSA), 2011
  • Conference_Location
    Johannesburg
  • Print_ISBN
    978-1-4577-1481-8
  • Type

    conf

  • DOI
    10.1109/ISSA.2011.6027539
  • Filename
    6027539