Title :
Naïve bayes classification of DRDO tender documents
Author :
Goswami, Suparna ; Bhardwaj, Prakriti ; Kapoor, Shubham
Author_Institution :
Directorate of P&C, DRDO, New Delhi, India
Abstract :
We propose a technique of automatic classification of DRDO tender documents into predefined technological categories. The dataset comprised of 698 tender files obtained from DRDO website. The dataset was processed and fed into Weka toolkit. Experiments were conducted using Naïve Bayes classifier with 10 folds cross validation. The documents were classified with 75.21 percent accuracy by technology area and 68.62 percent by lab´s name when the dataset included lab names somewhere in the document. An accuracy result of 74.36 percent and 68.34 percent was attained for the technology area and name of the lab respectively when all occurrences of lab names were removed from the dataset. This experiment is a step to train auto-classification of the tender documents available on Internet for Selective Dissemination of Information to the Concerned Vendors. Also, this is an attempt to gauge the area of work, undergoing tasks and the exact work being carried by an organisation based on the tenders published by it.
Keywords :
Bayes methods; data mining; document delivery; information dissemination; learning (artificial intelligence); DRDO Website; DRDO tender document classification; Defense Research and Development Organisation; Internet; SDI; lab names; machine learning; naïve Bayes classifier; selective dissemination of information; tender files; text mining; Accuracy; Blogs; Computers; Internet; Organizations; Procurement; Text categorization; Machine Learning; Naïve Bayes; Selective Dissemination of Information (SDI); Tender Document Classification; Text Mining;
Conference_Titel :
Computing for Sustainable Global Development (INDIACom), 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-93-80544-10-6
DOI :
10.1109/IndiaCom.2014.6828030