DocumentCode
1641690
Title
Improving Watchlist Screening By Combining Evidence From Multiple Search Algorithms
Author
Miller, Keith J. ; Arehart, Mark D.
Author_Institution
MITRE Corp., McLean, VA
fYear
2008
Firstpage
106
Lastpage
110
Abstract
In this paper, we describe a metasearch tool resulting from experiments in aggregating the results of different name matching algorithms on a knowledge- intensive multicultural name matching task. Three retrieval engines that match Romanized names were tested on a noisy and predominantly Arabic dataset. One is based on a generic string matching algorithm; another is designed specifically for Arabic names; and the third makes use of culturally-specific matching strategies for multiple cultures. We show that even a relatively naive method for aggregating results significantly increased effectiveness over each of the individual algorithms, resulting in nearly tripling the F-score of the worst-performing algorithm included in the aggregate, and in a 6 point improvement in F-score over the single best-performing algorithm included.
Keywords
national security; search engines; string matching; Arabic names; Romanized names; generic string matching algorithm; metasearch tool; multiple search algorithms; name matching; watchlist screening improvement; Aggregates; Algorithm design and analysis; Cultural differences; Databases; Engines; Government; Information retrieval; Metasearch; Testing; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Technologies for Homeland Security, 2008 IEEE Conference on
Conference_Location
Waltham, MA
Print_ISBN
978-1-4244-1977-7
Electronic_ISBN
978-1-4244-1978-4
Type
conf
DOI
10.1109/THS.2008.4534432
Filename
4534432
Link To Document