DocumentCode :
1608735
Title :
Identifying and tracking online financial services through web mining and latent semantic indexing
Author :
Bernard, Kristen ; Cassidy, Andrew ; Clark, Monica ; Liu, Kevin ; Lobaton, Katrina ; McNeill, Drew ; Brown, Donald
Author_Institution :
Dept. of Syst. & Inf. Eng., Univ. of Virginia, Charlottesville, VA, USA
fYear :
2011
Firstpage :
158
Lastpage :
163
Abstract :
As Internet usage has heavily increased within recent years, money launderers have started to take advantage of Online Financial Transaction (OFT) services to facilitate their money laundering activities. However, law enforcement has struggled to understand and detect OFT services that criminals use for money laundering. To assist law enforcement in its efforts to identify and monitor OFT services, we have designed the Online Financial Transaction Services Identification Tool (OFTSIT), which crawls the Internet and determines the probability that they are OFT services. OFTSIT analyzes a website´s content and extracts textual features using latent semantic indexing (LSI). LSI is a text mining approach that can extract a small number (<; 10) of features from more than 40,000 possible words on a website. OFTSIT inputs the LSI discovered features into a generalized linear model to produce the probability that a website is an OFT service. Testing showed that OFTSIT outperforms current method of manual searching. This paper describes the system architecture, algorithms employed to classify OFT services from other websites, and performance testing to demonstrate OFTSIT´s operational relevance.
Keywords :
Internet; Web sites; data mining; feature extraction; financial data processing; fraud; indexing; law; transaction processing; Internet usage; OFT service; OFTSIT analysis; Web mining; Web site content; latent semantic indexing; law enforcement; money laundering; online financial transaction services identification tool; system architecture; textual feature extraction; Indexes; Internet; Large scale integration; Law enforcement; Logistics; Text mining; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems and Information Engineering Design Symposium (SIEDS), 2011 IEEE
Conference_Location :
Charlottesville, VA
Print_ISBN :
978-1-4577-0446-8
Type :
conf
DOI :
10.1109/SIEDS.2011.5876870
Filename :
5876870
Link To Document :
بازگشت