Title :
Integration of domain knowledge in the form of ancillary map data into supervised classification of remotely sensed data
Author :
Friedl, Mark A. ; McIver, Douglas K. ; Brodley, C.E.
Author_Institution :
Dept. of Geogr., Boston Univ., MA, USA
Abstract :
In recent years, machine learning and data mining methods have become increasingly common in remote sensing applications. One area in which such techniques are particularly useful is classification of remotely sensed data for land cover and vegetation mapping applications. In this paper, we describe new methods to include available information (domain knowledge) in supervised classification of land cover using high dimensional remote sensing observations. specifically, land cover and vegetation classification schemes are generally designed for ecological or land use applications. As a result, the classes of interest are often poorly separable in the multi-spectral or multi-temporal feature space provided by remote sensing. In many cases, ancillary data sources can provide useful information to help distinguish between problematic classes. However, available methods for including ancillary data sources, such as the use of prior probabilities in maximum likelihood classification, are often problematic in practice. This paper presents a method for incorporating prior probabilities in remote sensing-based land cover classification using a supervised decision tree classification algorithm. The method exploits recent theory from the domain of statistics and machine learning that allows robust estimates of class membership to be estimated using a technique known as boosting. This approach allows poorly separable classes to be distinguished based on ancillary information, but does not penalize rare classes.
Keywords :
geophysical signal processing; geophysical techniques; image classification; multidimensional signal processing; terrain mapping; ancillary data; ancillary map data; available information; domain knowledge integration; geophysical measurement technique; high dimensional observations; image classification; land cover; land surface; land use; multidimensional signal processing; multispectral remote sensing; remote sensing; supervised classification; terrain mapping; vegetation; Classification algorithms; Classification tree analysis; Data mining; Decision trees; Machine learning; Machine learning algorithms; Maximum likelihood estimation; Probability; Remote sensing; Vegetation mapping;
Conference_Titel :
Geoscience and Remote Sensing Symposium, 2002. IGARSS '02. 2002 IEEE International
Print_ISBN :
0-7803-7536-X
DOI :
10.1109/IGARSS.2002.1025768