• DocumentCode
    2028066
  • Title

    User-representative feature selection for keystroke dynamics

  • Author

    Al Solami, E. ; Boyd, Colin ; Clark, Andrew ; Ahmed, Irfan

  • Author_Institution
    Inf. Security Inst., Queensland Univ. of Technol., Brisbane, QLD, Australia
  • fYear
    2011
  • fDate
    6-8 Sept. 2011
  • Firstpage
    229
  • Lastpage
    233
  • Abstract
    Continuous user authentication with keystroke dynamics uses characters sequences as features. Since users can type characters in any order, it is imperative to find character sequences (n-graphs) that are representative of user typing behavior. The contemporary feature selection approaches do not guarantee selecting frequently-typed features which may cause less accurate statistical user-representation. Furthermore, the selected features do not inherently reflect user typing behavior. We propose four statistical-based feature selection techniques that mitigate limitations of existing approaches. The first technique selects the most frequently occurring features. The other three consider different user typing behaviors by selecting: n-graphs that are typed quickly; n-graphs that are typed with consistent time; and n-graphs that have large time variance among users. We use Gunetti´s keystroke dataset and k-means clustering algorithm for our experiments. The results show that among the proposed techniques, the most-frequent feature selection technique can effectively find user-representative features. We further substantiate our results by comparing the most-frequent feature selection technique with three existing approaches (popular Italian words, common n-graphs, and least frequent n-graphs). We find that it performs better than the existing approaches after selecting a certain number of most-frequent n-graphs.
  • Keywords
    authorisation; feature extraction; graph theory; message authentication; pattern clustering; statistical analysis; Gunetti keystroke dataset; Italian words; character sequences; common n-graphs; continuous user authentication; k-means clustering algorithm; keystroke dynamics; least frequent n-graphs; statistical user-representation; statistical-based feature selection techniques; user-representative feature selection; Accuracy; Authentication; Cities and towns; Clustering algorithms; Data mining; Feature extraction; 2-graphs; feature selection; keystroke dynamics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network and System Security (NSS), 2011 5th International Conference on
  • Conference_Location
    Milan
  • Print_ISBN
    978-1-4577-0458-1
  • Type

    conf

  • DOI
    10.1109/ICNSS.2011.6060005
  • Filename
    6060005