Title :
Classifying Spend Descriptions with Off-the-Shelf Learning Components
Author :
Mukherjee, Saikat ; Fradkin, Dmitriy ; Roth, Michael
Abstract :
Analyzing spend transactions is essential to organizations for understanding their global procurement. Central to this analysis is the automated classification of these transactions to hierarchical commodity coding systems. Spend classification is challenging due not only to the complexities of the commodity coding systems but also because of the sparseness and quality of each individual transaction text description and the volume of such transactions in an organization. In this paper, we demonstrate the application of off-the-shelf machine learning tools to address the challenges in spend classification. We have built a system using off-the-shelf SVM, logistic regression, and language processing toolkits and describe the effectiveness of these different learning techniques for spend classification.
Keywords :
classification; procurement; production engineering computing; regression analysis; support vector machines; automated classification; global procurement; hierarchical commodity coding systems; language processing toolkits; logistic regression; off-the-shelf learning components; off-the-shelf machine learning tools; spend transactions; transaction text description; Artificial intelligence; Data systems; Error correction; Humans; Logistics; Machine learning; Procurement; Software tools; Support vector machine classification; Support vector machines; BMR; noisy channel; spend;
Conference_Titel :
Tools with Artificial Intelligence, 2008. ICTAI '08. 20th IEEE International Conference on
Conference_Location :
Dayton, OH
Print_ISBN :
978-0-7695-3440-4
DOI :
10.1109/ICTAI.2008.95