DocumentCode
2028066
Title
User-representative feature selection for keystroke dynamics
Author
Al Solami, E. ; Boyd, Colin ; Clark, Andrew ; Ahmed, Irfan
Author_Institution
Inf. Security Inst., Queensland Univ. of Technol., Brisbane, QLD, Australia
fYear
2011
fDate
6-8 Sept. 2011
Firstpage
229
Lastpage
233
Abstract
Continuous user authentication with keystroke dynamics uses characters sequences as features. Since users can type characters in any order, it is imperative to find character sequences (n-graphs) that are representative of user typing behavior. The contemporary feature selection approaches do not guarantee selecting frequently-typed features which may cause less accurate statistical user-representation. Furthermore, the selected features do not inherently reflect user typing behavior. We propose four statistical-based feature selection techniques that mitigate limitations of existing approaches. The first technique selects the most frequently occurring features. The other three consider different user typing behaviors by selecting: n-graphs that are typed quickly; n-graphs that are typed with consistent time; and n-graphs that have large time variance among users. We use Gunetti´s keystroke dataset and k-means clustering algorithm for our experiments. The results show that among the proposed techniques, the most-frequent feature selection technique can effectively find user-representative features. We further substantiate our results by comparing the most-frequent feature selection technique with three existing approaches (popular Italian words, common n-graphs, and least frequent n-graphs). We find that it performs better than the existing approaches after selecting a certain number of most-frequent n-graphs.
Keywords
authorisation; feature extraction; graph theory; message authentication; pattern clustering; statistical analysis; Gunetti keystroke dataset; Italian words; character sequences; common n-graphs; continuous user authentication; k-means clustering algorithm; keystroke dynamics; least frequent n-graphs; statistical user-representation; statistical-based feature selection techniques; user-representative feature selection; Accuracy; Authentication; Cities and towns; Clustering algorithms; Data mining; Feature extraction; 2-graphs; feature selection; keystroke dynamics;
fLanguage
English
Publisher
ieee
Conference_Titel
Network and System Security (NSS), 2011 5th International Conference on
Conference_Location
Milan
Print_ISBN
978-1-4577-0458-1
Type
conf
DOI
10.1109/ICNSS.2011.6060005
Filename
6060005
Link To Document