• DocumentCode
    3516242
  • Title

    Spam detection in voice-over-IP calls through semi-supervised clustering

  • Author

    Wu, Yu-Sung ; Bagchi, Saurabh ; Singh, Navjot ; Wita, Ratsameetip

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
  • fYear
    2009
  • fDate
    June 29 2009-July 2 2009
  • Firstpage
    307
  • Lastpage
    316
  • Abstract
    In this paper, we present an approach for detection of spam calls over IP telephony called SPIT in VoIP systems. SPIT detection is different from spam detection in email in that the process has to be soft real-time, fewer features are available for examination due to the difficulty of mining voice traffic at runtime, and similarity in signaling traffic between legitimate and malicious callers. Our approach differs from existing work in its adaptability to new environments without the need for laborious and error-prone manual parameter configuration. We use clustering based on the call parameters, using optional user feedback for some calls, which they mark as SPIT or non-SPIT. We improve on a popular algorithm for semi-supervised learning, called MPCK-Means, to make it scalable to a large number of calls and operate at runtime. Our evaluation on captured call traces shows a fifteen fold reduction in computation time, with improvement in detection accuracy.
  • Keywords
    Internet telephony; data mining; learning (artificial intelligence); pattern classification; pattern clustering; telecommunication traffic; unsolicited e-mail; VoIP system; data classification; semi supervised learning; semi-supervised clustering; spam detection; voice traffic mining; voice-over-IP telephony call; Clustering algorithms; Feedback; Filtering; Internet telephony; Machine learning; Protocols; Runtime; Semisupervised learning; Signal processing; Unsolicited electronic mail; Voice-over-IP systems; clustering; semisupervised learning; spam detection; spit detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems & Networks, 2009. DSN '09. IEEE/IFIP International Conference on
  • Conference_Location
    Lisbon
  • Print_ISBN
    978-1-4244-4422-9
  • Electronic_ISBN
    978-1-4244-4421-2
  • Type

    conf

  • DOI
    10.1109/DSN.2009.5270323
  • Filename
    5270323