• DocumentCode
    2459051
  • Title

    Earlybird: Real-Time Search at Twitter

  • Author

    Busch, Michael ; Gade, Krishna ; Larson, Brian ; Lok, Patrick ; Luckenbill, Samuel ; Lin, James

  • fYear
    2012
  • fDate
    1-5 April 2012
  • Firstpage
    1360
  • Lastpage
    1369
  • Abstract
    The web today is increasingly characterized by social and real-time signals, which we believe represent two frontiers in information retrieval. In this paper, we present Early bird, the core retrieval engine that powers Twitter´s real-time search service. Although Early bird builds and maintains inverted indexes like nearly all modern retrieval engines, its index structures differ from those built to support traditional web search. We describe these differences and present the rationale behind our design. A key requirement of real-time search is the ability to ingest content rapidly and make it searchable immediately, while concurrently supporting low-latency, high-throughput query evaluation. These demands are met with a single-writer, multiple-reader concurrency model and the targeted use of memory barriers. Early bird represents a point in the design space of real-time search engines that has worked well for Twitter´s needs. By sharing our experiences, we hope to spur additional interest and innovation in this exciting space.
  • Keywords
    Internet; query processing; real-time systems; search engines; social networking (online); Earlybird; Twitter; Web search; World Wide Web; information retrieval; query evaluation; real-time search; retrieval engines; Arrays; Indexing; Query processing; Real time systems; Twitter; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2012 IEEE 28th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-0042-1
  • Type

    conf

  • DOI
    10.1109/ICDE.2012.149
  • Filename
    6228205