DocumentCode
2935779
Title
Filtering adult image content with topic models
Author
Lienhart, Rainer ; Hauke, Rudolf
Author_Institution
Lehrstuhl fur Multimedia Comput., Univ. Augsburg, Augsburg, Germany
fYear
2009
fDate
June 28 2009-July 3 2009
Firstpage
1472
Lastpage
1475
Abstract
Protecting children from exposure to adult content has become a serious problem in the real world. Current statistics show that, for instance, the average age of first Internet exposure to pornography is 11 years, that the largest consumer group of Internet pornography is the age group of 12-to-17- year-olds and that 90% of the 8-to-16-year-olds have viewed porn online. To protect our children, effective algorithms for detecting adult images are needed. In this research we evaluate the use of probabilistic Latent Semantic Analysis (pLSA) for this task. We will show that topic models based on pLSA can detect adult content with a correct positive rate of 92.7%, while only showing off a false positive rate of 1.9%. Even when using grayscale images only, a correct positive rate of 90.8% at a false positive rate of 2% can be achieved.
Keywords
Internet; filtering theory; image classification; probability; statistical analysis; Internet pornography; adult image content filtering; adult image content recognition; adult image detection; grayscale image classification; pLSA; probabilistic latent semantic analysis; statistical analysis; Feature extraction; Gray-scale; Image classification; Information filtering; Information filters; Internet; Pixel; Protection; Skin; Vocabulary; adult image content recognition; image classification; porn image detection; topic models;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location
New York, NY
ISSN
1945-7871
Print_ISBN
978-1-4244-4290-4
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2009.5202781
Filename
5202781
Link To Document