• DocumentCode
    1641690
  • Title

    Improving Watchlist Screening By Combining Evidence From Multiple Search Algorithms

  • Author

    Miller, Keith J. ; Arehart, Mark D.

  • Author_Institution
    MITRE Corp., McLean, VA
  • fYear
    2008
  • Firstpage
    106
  • Lastpage
    110
  • Abstract
    In this paper, we describe a metasearch tool resulting from experiments in aggregating the results of different name matching algorithms on a knowledge- intensive multicultural name matching task. Three retrieval engines that match Romanized names were tested on a noisy and predominantly Arabic dataset. One is based on a generic string matching algorithm; another is designed specifically for Arabic names; and the third makes use of culturally-specific matching strategies for multiple cultures. We show that even a relatively naive method for aggregating results significantly increased effectiveness over each of the individual algorithms, resulting in nearly tripling the F-score of the worst-performing algorithm included in the aggregate, and in a 6 point improvement in F-score over the single best-performing algorithm included.
  • Keywords
    national security; search engines; string matching; Arabic names; Romanized names; generic string matching algorithm; metasearch tool; multiple search algorithms; name matching; watchlist screening improvement; Aggregates; Algorithm design and analysis; Cultural differences; Databases; Engines; Government; Information retrieval; Metasearch; Testing; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies for Homeland Security, 2008 IEEE Conference on
  • Conference_Location
    Waltham, MA
  • Print_ISBN
    978-1-4244-1977-7
  • Electronic_ISBN
    978-1-4244-1978-4
  • Type

    conf

  • DOI
    10.1109/THS.2008.4534432
  • Filename
    4534432