Title :
Duplicate Reviews Detection
Author :
Li Zhen ; Lin Chen ; Li Bi-Cheng
Author_Institution :
Inf. Process. Dept., Inf. Technol. Inst., Zhengzhou, China
Abstract :
With the rapid development of Internet, BBS has become an important place for the people to acquire information and write reviews. However the existing of a vast number of duplicate reviews has been a new problem, so the effective detection and removing of duplicate reviews is crucial to the BBS information acquisition and supervision system. Considering its characteristics, we proposed a method of duplicate reviews detection based on SHA-1 algorithm in the paper. The experimental results show that the proposed method is very effective.
Keywords :
Internet; information analysis; public information systems; reviews; BBS information acquisition; Internet; SHA-1 algorithm; duplicate reviews detection; supervision system; Data mining; Error analysis; Feature extraction; File systems; Fingerprint recognition; Frequency; Information processing; Information technology; Internet; Large-scale systems;
Conference_Titel :
Biomedical Engineering and Computer Science (ICBECS), 2010 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5315-3
DOI :
10.1109/ICBECS.2010.5462334