• DocumentCode
    3524365
  • Title

    Spatial extension of the Reality Mining Dataset

  • Author

    Ficek, Michal ; Kencl, Lukas

  • Author_Institution
    R & D Centre for Mobile Applic., Czech Tech. Univ. in Prague, Prague, Czech Republic
  • fYear
    2010
  • fDate
    8-12 Nov. 2010
  • Firstpage
    666
  • Lastpage
    673
  • Abstract
    Data captured from a live cellular network with the real users during their common daily routine help to understand how the users move within the network. Unlike the simulations with limited potential or expensive experimental studies, the research in user-mobility or spatio-temporal user behavior can be conducted on publicly available datasets such as the Reality Mining Dataset. These data have been for many years a source of valuable information about social interconnection between users and user-network associations. However, an important, spatial dimension is missing in this dataset. In this paper, we present a methodology for retrieving geographical locations matching the GSM cell identifiers in the Reality Mining Dataset, an approach base on querying the Google Location API. A statistical analysis of the measure of success of locations retrieval is provided. Further, we present the LAC-clustering method for detecting and removing outliers, a heuristic extension of general agglomerative hierarchical clustering. This methodology enables further, previously impossible analysis of the Reality Mining Dataset, such as studying user mobility patterns, describing spatial trajectories and mining the spatio-temporal data.
  • Keywords
    application program interfaces; cellular radio; data mining; mobility management (mobile radio); pattern clustering; query processing; search engines; spatiotemporal phenomena; statistical analysis; GSM cell identifier; Google location API; LAC-clustering method; agglomerative hierarchical clustering; cellular network; geographical location matching; geographical location retrieval; outlier detection; query processing; reality mining dataset; spatiotemporal user behavior; statistical analysis; user-mobility; Computer architecture; Data mining; Databases; Google; Mobile communication; Mobile computing; Poles and towers; Cell-ID; GSM; Reality Mining; agglomerative clustering; mobility; tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mobile Adhoc and Sensor Systems (MASS), 2010 IEEE 7th International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    2155-6806
  • Print_ISBN
    978-1-4244-7488-2
  • Type

    conf

  • DOI
    10.1109/MASS.2010.5663788
  • Filename
    5663788