Title :
Identification and Disambiguation of Product Mentions with Information Retrieval and Problem Specific Methods
Abstract :
Identifying personal, institutional or product names in a text is an important task for numerous applications. This paper describes a solution for the Consumer Products Contest organized at International Conference on Data Mining 2012. The goal of this competition is to determine the state-of-the-art methods to automatically recognize product mentions in a collection of web documents, and to correctly identify the product(s) that each product mention refers to from a large catalog of products. Our approach combines methods of information retrieval and problem specific heuristics.
Keywords :
Web sites; consumer products; data mining; document handling; information retrieval; Consumer Product Contest; International Conference on Data Mining 2012; Web document collection methods; information retrieval; institutional name identification; personal name identification; problem specific methods; product catalog; product disambiguation; product identification; product name identification; state-of-the-art methods; Automotive engineering; Catalogs; Consumer electronics; Consumer products; Data mining; Dictionaries; ICDM contest; Named entity recognition; information retrieval;
Conference_Titel :
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-5164-5
DOI :
10.1109/ICDMW.2012.60