DocumentCode
2459051
Title
Earlybird: Real-Time Search at Twitter
Author
Busch, Michael ; Gade, Krishna ; Larson, Brian ; Lok, Patrick ; Luckenbill, Samuel ; Lin, James
fYear
2012
fDate
1-5 April 2012
Firstpage
1360
Lastpage
1369
Abstract
The web today is increasingly characterized by social and real-time signals, which we believe represent two frontiers in information retrieval. In this paper, we present Early bird, the core retrieval engine that powers Twitter´s real-time search service. Although Early bird builds and maintains inverted indexes like nearly all modern retrieval engines, its index structures differ from those built to support traditional web search. We describe these differences and present the rationale behind our design. A key requirement of real-time search is the ability to ingest content rapidly and make it searchable immediately, while concurrently supporting low-latency, high-throughput query evaluation. These demands are met with a single-writer, multiple-reader concurrency model and the targeted use of memory barriers. Early bird represents a point in the design space of real-time search engines that has worked well for Twitter´s needs. By sharing our experiences, we hope to spur additional interest and innovation in this exciting space.
Keywords
Internet; query processing; real-time systems; search engines; social networking (online); Earlybird; Twitter; Web search; World Wide Web; information retrieval; query evaluation; real-time search; retrieval engines; Arrays; Indexing; Query processing; Real time systems; Twitter; Web search;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
Conference_Location
Washington, DC
ISSN
1063-6382
Print_ISBN
978-1-4673-0042-1
Type
conf
DOI
10.1109/ICDE.2012.149
Filename
6228205
Link To Document