Title :
An efficient search method for the content-based identification of telephone-SPAM
Author :
Strobl, Julian ; Mainka, Bernhard ; Grutzek, Gary ; Knospe, Heiko
Author_Institution :
Cologne Univ. of Appl. Sci., Cologne, Germany
Abstract :
With the help of VoIP technology, large numbers of unsolicited calls can be conveniently placed and SPAM over Internet Telephony may become a major nuisance and threat. Various mitigation methods have been proposed which are mostly based on a pattern analysis of the signaling traffic. This contribution shows that an analysis of the audio content is also feasible and can provide protection against replayed calls. In order to identify similar or equal audio data, spectral features are extracted and a short and robust audio fingerprint is computed. The definition of the fingerprint is optimized for a fast index-based search. Then, the matching of telephone speech data is based on the intersection of inverted files of audio fingerprints. Furthermore, the system design of a working prototype is explained and experimental results on the recognition rate and the performance of the system are presented. It can be shown that the search method is suitable for an efficient identification of SPAM calls.
Keywords :
Internet telephony; audio signal processing; feature extraction; search problems; telecommunication traffic; unsolicited e-mail; Internet telephony; VoIP technology; audio content analysis; content-based identification; fast index-based search method; mitigation methods; pattern analysis; robust audio fingerprint; signaling traffic; spectral feature extraction; telephone speech data matching; telephone-SPAM calls; Feature extraction; Indexes; Prototypes; Robustness; Speech; Unsolicited electronic mail; Vectors; Audio Fingerprinting; Audio Search; SPAM; SPIT; Speech Identification; VoIP;
Conference_Titel :
Communications (ICC), 2012 IEEE International Conference on
Conference_Location :
Ottawa, ON
Print_ISBN :
978-1-4577-2052-9
Electronic_ISBN :
1550-3607
DOI :
10.1109/ICC.2012.6363654