DocumentCode :
3419920
Title :
Probabilistic model for a distributed feature selection method
Author :
Berényi, Zsolt ; Vajk, István
Author_Institution :
Dept. of Autom. & Appl. Inf., Budapest Univ. of Technol. & Econ., Budapest, Hungary
fYear :
2009
fDate :
July 29 2009-Aug. 1 2009
Firstpage :
27
Lastpage :
32
Abstract :
When building topic based document classifiers, feature selection is a key step: features not holding any information about the topic of a document introduce only unnecessary noise during the classification. In a distributed environment, when the nodes are interacting, the locally retrieved features and the their attributes must be shared to have at every node a more accurate estimation of the global classifier. When expanding the knowledge of the local classifiers, to reduce costs, the network traffic should be kept to a minimum. We propose a probabilistic model for a keyword selection method which makes a more thorough analysis possible and can be used as a baseline when sharing information in a distributed environment. It can be used for incrementally building up the distributed classifiers ensuring minimal network traffic. This model can be refined later on by sending more content-related information to achieve higher performance. This probabilistic model together with experimental results are presented in this paper.
Keywords :
document handling; feature extraction; minimisation; mobile computing; pattern classification; probability; distributed feature selection method; distributed mobile environment; document classifier; information sharing; keyword selection method; network traffic minimisation; probabilistic model; Automation; Environmental economics; Informatics; Information analysis; Information retrieval; Mobile communication; Noise reduction; Peer to peer computing; Telecommunication traffic; Traffic control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing Applications, 2009. SOFA '09. 3rd International Workshop on
Conference_Location :
Arad
Print_ISBN :
978-1-4244-5054-1
Electronic_ISBN :
978-1-4244-5056-5
Type :
conf
DOI :
10.1109/SOFA.2009.5254884
Filename :
5254884
Link To Document :
بازگشت