Title :
A parallel accelerator for semantic search
Author :
Majumdar, Abhinandan ; Cadambi, Srihari ; Chakradhar, Srimat T. ; Graf, Hans Peter
Author_Institution :
NEC Labs. America, Inc., Princeton, NJ, USA
Abstract :
Semantic text analysis is a technique used in advertisement placement, cognitive databases and search engines. With increasing amounts of data and stringent response-time requirements, improving the underlying implementation of semantic analysis becomes critical. To this end, we look at Supervised Semantic Indexing (SSI), a recently proposed algorithm for semantic analysis. SSI ranks a large number of documents based on their semantic similarity to a text query. For each query, it computes millions of dot products on unstructured data, generates a large intermediate result, and then performs ranking. SSI underperforms on both state-of-the-art multi-cores as well as GPUs. Its performance scalability on multi-cores is hampered by their limited support for fine-grained data parallelism. GPUs, though beat multi-cores by running thousands of threads, cannot handle large intermediate data because of their small on-chip memory. Motivated by this, we present an FPGA-based hardware accelerator for semantic analysis. As a key feature, the accelerator combines hundreds of simple processing elements together with in-memory processing to simultaneously generate and process (consume) the large intermediate data. It also supports “dynamic parallelism” - a feature that configures the PEs differently for full utilization of the available processin logic after the FPGA is programmed. Our FPGA prototype is 10-13x faster than a 2.5 GHz quad-core Xeon, and 1.5-5x faster than a 240 core 1.3 GHz Tesla GPU, despite operating at a modest frequency of 125 MHz.
Keywords :
computer graphic equipment; coprocessors; field programmable gate arrays; indexing; multi-threading; multiprocessing systems; pattern matching; query processing; storage management; text analysis; FPGA based hardware accelerator; GPU; advertisement placement; cognitive database; fine grained data parallelism; in-memory processing; multicores processor; on-chip memory; parallel accelerator; search engine; semantic search; semantic similarity; semantic text analysis; supervised semantic indexing; text query; unstructured; unstructured data; Arrays; Digital signal processing; Field programmable gate arrays; Parallel processing; Random access memory; Semantics; System-on-a-chip;
Conference_Titel :
Application Specific Processors (SASP), 2011 IEEE 9th Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4577-1212-8
DOI :
10.1109/SASP.2011.5941090