DocumentCode :
2565651
Title :
A Bayesian Approach for Text Filter on 3G Network
Author :
Huang Jie ; Huang Bei ; Pu Wenjing
Author_Institution :
Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China
fYear :
2010
fDate :
23-25 Sept. 2010
Firstpage :
1
Lastpage :
5
Abstract :
With the high-spread of 3rd Generation Mobile Communication Technology, on 3G network, the number of junk information has increased rapidly. Much pornographic and junk information have flooded 3G network and become a serious social problem. It becomes necessary to filter all exchanged information quickly and efficiently. An improved Bayesian filtering algorithm is proposed to classify text messages in this article, which is called the double threshold Bayesian algorithm based on minimum risk (DTBA). The method utilizes Document Frequency (DF) to select feature words. Due to the high precision rate and the low error rate of classifying text messages, it is suitable for 3G network. Two text classification approaches, such as the DTBA and the classical minimum risk-based Bayesian algorithm (MRBA), are tested in the TD-SCDMA system. As a result, the DTBA has better controllability, and the recall rate of the junk text messages can reach 95.2%. So the real-time and high efficient anti-junk messages filter can be achieved by the DTBA.
Keywords :
3G mobile communication; Bayes methods; electronic messaging; 3G network; 3rd generation mobile communication technology; Bayesian filtering algorithm; TD-SCDMA system; document frequency; double threshold Bayesian algorithm; junk information; minimum risk-based Bayesian algorithm; pornographic; text filter; Bayesian methods; Classification algorithms; Filtering algorithms; Information filters; Support vector machine classification; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless Communications Networking and Mobile Computing (WiCOM), 2010 6th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-3708-5
Electronic_ISBN :
978-1-4244-3709-2
Type :
conf
DOI :
10.1109/WICOM.2010.5601282
Filename :
5601282
Link To Document :
بازگشت