Abstract :
Recent emergence of diverse services have led to explosive traffic growth in cellular data networks. Understanding the service dynamics in large cellular networks is important for network design, trouble shooting, quality of service (QoE) support, and resource allocation. In this paper, we present our study to reveal the distributions and temporal patterns of different services in cellular data network from two different perspectives, namely service request times and service duration. Our study is based on big traffic data, which is parsed to readable records by our Hadoop-based packet parsing platform, captured over a week-long period from a tier-1 mobile operator´s network in China. We propose a Zipf´s ranked model to characterize the distributions of traffic volume, packet, request times and duration of cellular services. Two-stage method (Self-Organizing Map combined with kmeans) is first used to cluster time series of service into four request patterns and three duration patterns. These seven patterns are combined together to better understand the fine-grained temporal patterns of service in cellular network. Results of our distribution models and temporal patterns present cellular network operators with a better understanding of the request and duration characteristics of service, which of great importance in network design, service generation and resource allocation.
Keywords :
cellular radio; data handling; parallel processing; resource allocation; self-organising feature maps; telecommunication traffic; time series; China; Hadoop-based packet parsing platform; Zipf´s ranked model; cellular data network; cellular services; cluster time series; fine-grained temporal patterns; readable records; resource allocation; self-organizing map; service duration; service request times; temporal pattern mining; temporal pattern modeling; tier-1 mobile operator network; traffic data; traffic volume; Big data; Computer architecture; Data mining; Measurement; Mobile communication; Mobile computing; Solid modeling; SOM cluster; big data; cellular network; data mining; hadoop; service;