DocumentCode
3394206
Title
Vector quantization, density estimation and outlier detection on cricket dataset
Author
Parameswaran, K.
Author_Institution
Dept. of Electron. & Telecommun. Eng., Univ. of Moratuwa, Moratuwa, Sri Lanka
fYear
2013
fDate
4-6 Jan. 2013
Firstpage
1
Lastpage
5
Abstract
This study aims to apply unsupervised machine learning algorithms on Cricket players´ career statistics dataset. K-means clustering algorithm is used to find the natural grouping that exists within the cricket players using player´s batting average, strike rate, bowling average, economy etc. as input features - in this case players are grouped into 3 groups. Further separate probability density models are fitted for batsmen, bowlers and all-rounding players using appropriate player´s performance metrics as input features and using these models, outstanding players are identified. Similar method is used to identify match winning players, where the differences between player´s performance metrics and team´s average performance metrics are used as input features. The results obtained from this study seem to correlate with expert generated results where they used point based system to rank the players. This kind of statistical analysis of sports data plays a vital role in team planning and exploiting opponents´ weakness.
Keywords
estimation theory; pattern clustering; probability; sport; statistical analysis; unsupervised learning; vector quantisation; Cricket player career statistics dataset; all-rounding players; batsmen; bowlers; density estimation; input features; k-means clustering algorithm; match winning player identification; opponent weakness exploitation; outlier detection; player natural grouping; player performance metrics; player ranking; point-based system; probability density models; sports data statistical analysis; team average performance metrics; team planning; unsupervised machine learning algorithms; vector quantization; Biological system modeling; Clustering algorithms; Computers; Engineering profession; Estimation; Informatics; Measurement; Cricket; Density estimation; K-means clustering; Outlier detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Communication and Informatics (ICCCI), 2013 International Conference on
Conference_Location
Coimbatore
Print_ISBN
978-1-4673-2906-4
Type
conf
DOI
10.1109/ICCCI.2013.6466249
Filename
6466249
Link To Document