Title :
Learning Classification with Auxiliary Probabilistic Information
Author :
Nguyen, Quang ; Valizadegan, Hamed ; Hauskrecht, Milos
Author_Institution :
Dept. of Comput. Sci., Univ. of Pittsburgh Pittsburgh, Pittsburgh, PA, USA
Abstract :
Finding ways of incorporating auxiliary information or auxiliary data into the learning process has been the topic of active data mining and machine learning research in recent years. In this work we study and develop a new framework for classification learning problem in which, in addition to class labels, the learner is provided with an auxiliary (probabilistic) information that reflects how strong the expert feels about the class label. This approach can be extremely useful for many practical classification tasks that rely on subjective label assessment and where the cost of acquiring additional auxiliary information is negligible when compared to the cost of the example analysis and labelling. We develop classification algorithms capable of using the auxiliary information to make the learning process more efficient in terms of the sample complexity. We demonstrate the benefit of the approach on a number of synthetic and real world data sets by comparing it to the learning with class labels only.
Keywords :
data mining; learning (artificial intelligence); pattern classification; probability; active data mining; auxiliary data; auxiliary probabilistic information; classification algorithm; classification learning problem; learning process; machine learning research; Concrete; Data models; Humans; Logistics; Machine learning; Noise; Probabilistic logic; classification learning; learning with auxiliary label information; sample complexity;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.84