• DocumentCode
    484568
  • Title

    Generating High-Quality Training Data for Automated Land-Cover Mapping

  • Author

    Rebbapragada, U. ; Lomasky, R. ; Brodley, C.E. ; Friedl, M.A.

  • Author_Institution
    Dept. of Comput. Sci., Tufts Univ., Medford, MA
  • Volume
    4
  • fYear
    2008
  • fDate
    7-11 July 2008
  • Abstract
    This paper presents two machine learning techniques that greatly reduce the number of person-hours required to generate high-quality training data for land cover classification. The first technique uses active learning to guide the generation of training data by selecting only the most informative examples for labeling. The second technique identifies and mitigates the impact of mislabeled instances. Both techniques are tested on data from NASA´s Moderate Resolution Imaging Spectroradiometer (MODIS), which has required thousands of person hours to label. Our results shows that the active learning method requires fewer labeled examples than random sampling to produce a high quality classifier. Our results on class noise mitigation show that if mislabelings occur, we can further improve classifier accuracy, and that weighting instances by their label confidence outperforms an analogous method that discards suspected mislabelings. If combined, these methods have the potential to make training data generation a more efficient and reliable process.
  • Keywords
    geophysics computing; image classification; learning (artificial intelligence); terrain mapping; MODIS; Moderate Resolution Imaging Spectroradiometer; NASA; active learning method; analogous method; automated land-cover mapping; data generation; generate high-quality training data; labeling; land cover classification; machine learning techniques; mislabeled instances; Humans; Labeling; Learning systems; MODIS; Machine learning; Sampling methods; Support vector machine classification; Support vector machines; Testing; Training data; active learning; landcover classification; supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Geoscience and Remote Sensing Symposium, 2008. IGARSS 2008. IEEE International
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-2807-6
  • Electronic_ISBN
    978-1-4244-2808-3
  • Type

    conf

  • DOI
    10.1109/IGARSS.2008.4779779
  • Filename
    4779779