• DocumentCode
    3438477
  • Title

    Scalable Audience Reach Estimation in Real-Time Online Advertising

  • Author

    Jalali, A. ; Kolay, Santanu ; Foldes, P. ; Dasdan, Ali

  • Author_Institution
    Turn Inc., Redwood City, CA, USA
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    629
  • Lastpage
    637
  • Abstract
    Online advertising has been introduced as one of the most efficient methods of advertising throughout the recent years. Yet, advertisers are concerned about the efficiency of their online advertising campaigns and consequently, would like to restrict their ad impressions to certain websites and/or certain groups of audience. These restrictions, known as targeting criteria, limit the reach ability for better performance. This trade-off between reach ability and performance illustrates a need for a forecasting system that can quickly predict/estimate (with good accuracy) this trade-off. Designing such a system is challenging due to (a) the huge amount of data to process, and, (b) the need for fast and accurate estimates. In this paper, we propose a distributed fault tolerant system that can generate such estimates fast with good accuracy. The main idea is to keep a small representative sample in memory across multiple machines and formulate the forecasting problem as queries against the sample. The key challenge is to find the best strata across the past data, perform multivariate stratified sampling while ensuring fuzzy fall-back to cover the small minorities. Our results show a significant improvement over the uniform and simple stratified sampling strategies which are currently widely used in the industry.
  • Keywords
    advertising data processing; distributed processing; fault tolerant computing; fuzzy set theory; sampling methods; Web sites; advertising method; audience groups; data processing; distributed fault tolerant system; forecasting system; fuzzy fall-back; multivariate stratified sampling; online advertising campaigns; reachability-performance trade-off; realtime online advertising; scalable audience reach estimation; targeting criteria; Accuracy; Advertising; Forecasting; Information management; Nickel; Radiation detectors; Vectors; Algorithmic Advertising; Distributed Systems; Fuzzy Fall-back Scheme; High-Dimensional Forecasting; Stratied Sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • Print_ISBN
    978-1-4799-3143-9
  • Type

    conf

  • DOI
    10.1109/ICDMW.2013.164
  • Filename
    6753979