• DocumentCode
    3394206
  • Title

    Vector quantization, density estimation and outlier detection on cricket dataset

  • Author

    Parameswaran, K.

  • Author_Institution
    Dept. of Electron. & Telecommun. Eng., Univ. of Moratuwa, Moratuwa, Sri Lanka
  • fYear
    2013
  • fDate
    4-6 Jan. 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This study aims to apply unsupervised machine learning algorithms on Cricket players´ career statistics dataset. K-means clustering algorithm is used to find the natural grouping that exists within the cricket players using player´s batting average, strike rate, bowling average, economy etc. as input features - in this case players are grouped into 3 groups. Further separate probability density models are fitted for batsmen, bowlers and all-rounding players using appropriate player´s performance metrics as input features and using these models, outstanding players are identified. Similar method is used to identify match winning players, where the differences between player´s performance metrics and team´s average performance metrics are used as input features. The results obtained from this study seem to correlate with expert generated results where they used point based system to rank the players. This kind of statistical analysis of sports data plays a vital role in team planning and exploiting opponents´ weakness.
  • Keywords
    estimation theory; pattern clustering; probability; sport; statistical analysis; unsupervised learning; vector quantisation; Cricket player career statistics dataset; all-rounding players; batsmen; bowlers; density estimation; input features; k-means clustering algorithm; match winning player identification; opponent weakness exploitation; outlier detection; player natural grouping; player performance metrics; player ranking; point-based system; probability density models; sports data statistical analysis; team average performance metrics; team planning; unsupervised machine learning algorithms; vector quantization; Biological system modeling; Clustering algorithms; Computers; Engineering profession; Estimation; Informatics; Measurement; Cricket; Density estimation; K-means clustering; Outlier detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communication and Informatics (ICCCI), 2013 International Conference on
  • Conference_Location
    Coimbatore
  • Print_ISBN
    978-1-4673-2906-4
  • Type

    conf

  • DOI
    10.1109/ICCCI.2013.6466249
  • Filename
    6466249