مرکز منطقه ای اطلاع رساني علوم و فناوري - User-representative feature selection for keystroke dynamics

DocumentCode :

2028066

Title :

User-representative feature selection for keystroke dynamics

Author :

Al Solami, E. ; Boyd, Colin ; Clark, Andrew ; Ahmed, Irfan

Author_Institution :

Inf. Security Inst., Queensland Univ. of Technol., Brisbane, QLD, Australia

fYear :

2011

fDate :

6-8 Sept. 2011

Firstpage :

229

Lastpage :

233

Abstract :

Continuous user authentication with keystroke dynamics uses characters sequences as features. Since users can type characters in any order, it is imperative to find character sequences (n-graphs) that are representative of user typing behavior. The contemporary feature selection approaches do not guarantee selecting frequently-typed features which may cause less accurate statistical user-representation. Furthermore, the selected features do not inherently reflect user typing behavior. We propose four statistical-based feature selection techniques that mitigate limitations of existing approaches. The first technique selects the most frequently occurring features. The other three consider different user typing behaviors by selecting: n-graphs that are typed quickly; n-graphs that are typed with consistent time; and n-graphs that have large time variance among users. We use Gunetti´s keystroke dataset and k-means clustering algorithm for our experiments. The results show that among the proposed techniques, the most-frequent feature selection technique can effectively find user-representative features. We further substantiate our results by comparing the most-frequent feature selection technique with three existing approaches (popular Italian words, common n-graphs, and least frequent n-graphs). We find that it performs better than the existing approaches after selecting a certain number of most-frequent n-graphs.

Keywords :

authorisation; feature extraction; graph theory; message authentication; pattern clustering; statistical analysis; Gunetti keystroke dataset; Italian words; character sequences; common n-graphs; continuous user authentication; k-means clustering algorithm; keystroke dynamics; least frequent n-graphs; statistical user-representation; statistical-based feature selection techniques; user-representative feature selection; Accuracy; Authentication; Cities and towns; Clustering algorithms; Data mining; Feature extraction; 2-graphs; feature selection; keystroke dynamics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Network and System Security (NSS), 2011 5th International Conference on

Conference_Location :

Milan

Print_ISBN :

978-1-4577-0458-1

Type :

conf

DOI :

10.1109/ICNSS.2011.6060005

Filename :

6060005

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2028066