DocumentCode
3154843
Title
Machine learning for automated tender classification
Author
Goswami, Sumit ; Kapoor, Sunaina ; Bhardwaj, Prakriti
Author_Institution
Dte of Mgmt Inf. Syst. & Technol. (MIST), Defence R&D Organ. (DRDO), New Delhi, India
fYear
2011
fDate
16-18 Dec. 2011
Firstpage
1
Lastpage
4
Abstract
This paper presents classification of DRDO tender documents into predefined categories. Since there is a consistent growth in the volume of digital documents, both on the internet and within organizations, the need to classify them into categories is obvious. In this paper we used `bag-of-words´ technique to represent the tender documents. The dataset was prepared and fed into Weka toolkit. Classification was implemented by Naïve Bayes classifier using 10-folds cross validation technique. The machine resulted in classifying the tender documents with an accuracy of 77.36% by technology category and 67.2% by lab´s name.
Keywords
Bayes methods; document handling; learning (artificial intelligence); pattern classification; procurement; tendering; 10-folds cross validation technique; DRDO tender documents; Internet; Weka toolkit; automated tender classification; bag-of-words technique; digital document; machine learning; naïve Bayes classifier; Accuracy; Blogs; Machine learning; Probabilistic logic; Support vector machine classification; Text mining; DRDO; machine learning; naïve bayes classifier; tenderdtocument classification; text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
India Conference (INDICON), 2011 Annual IEEE
Conference_Location
Hyderabad
Print_ISBN
978-1-4577-1110-7
Type
conf
DOI
10.1109/INDCON.2011.6139406
Filename
6139406
Link To Document