DocumentCode :
2028066
Title :
User-representative feature selection for keystroke dynamics
Author :
Al Solami, E. ; Boyd, Colin ; Clark, Andrew ; Ahmed, Irfan
Author_Institution :
Inf. Security Inst., Queensland Univ. of Technol., Brisbane, QLD, Australia
fYear :
2011
fDate :
6-8 Sept. 2011
Firstpage :
229
Lastpage :
233
Abstract :
Continuous user authentication with keystroke dynamics uses characters sequences as features. Since users can type characters in any order, it is imperative to find character sequences (n-graphs) that are representative of user typing behavior. The contemporary feature selection approaches do not guarantee selecting frequently-typed features which may cause less accurate statistical user-representation. Furthermore, the selected features do not inherently reflect user typing behavior. We propose four statistical-based feature selection techniques that mitigate limitations of existing approaches. The first technique selects the most frequently occurring features. The other three consider different user typing behaviors by selecting: n-graphs that are typed quickly; n-graphs that are typed with consistent time; and n-graphs that have large time variance among users. We use Gunetti´s keystroke dataset and k-means clustering algorithm for our experiments. The results show that among the proposed techniques, the most-frequent feature selection technique can effectively find user-representative features. We further substantiate our results by comparing the most-frequent feature selection technique with three existing approaches (popular Italian words, common n-graphs, and least frequent n-graphs). We find that it performs better than the existing approaches after selecting a certain number of most-frequent n-graphs.
Keywords :
authorisation; feature extraction; graph theory; message authentication; pattern clustering; statistical analysis; Gunetti keystroke dataset; Italian words; character sequences; common n-graphs; continuous user authentication; k-means clustering algorithm; keystroke dynamics; least frequent n-graphs; statistical user-representation; statistical-based feature selection techniques; user-representative feature selection; Accuracy; Authentication; Cities and towns; Clustering algorithms; Data mining; Feature extraction; 2-graphs; feature selection; keystroke dynamics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network and System Security (NSS), 2011 5th International Conference on
Conference_Location :
Milan
Print_ISBN :
978-1-4577-0458-1
Type :
conf
DOI :
10.1109/ICNSS.2011.6060005
Filename :
6060005
Link To Document :
بازگشت