Zhen Zhang, Jue Huang, Tao Jiang, Baikai Sui, Xinliang Pan
Journal of Applied Remote Sensing, Vol. 14, Issue 01, 016502, (January 2020) https://doi.org/10.1117/1.JRS.14.016502
TOPICS: Image segmentation, Remote sensing, Convolution, Buildings, RGB color model, Vegetation, Image processing algorithms and systems, Image processing, Image classification, Infrared imaging
Large intraclass variance and low interclass variance are among the most challenging problems in very high-resolution (VHR) image classification. Semantic segmentation constructed in a deep convolution neural network is used as a classification algorithm conducted via end-to-end training, which combines spectral–spatial features and context information. However, large-scale remote sensing images cannot be directly processed because they are limited by GPU memory and segmentation algorithm. At the same time, classification using single band combinations is also unsatisfactory due to the extraordinary complex features of VHR images. Therefore, a method is proposed based on multiple band combinations and patchwise scene analysis. A complex remote sensing image can be considered as into a combination of simple scenes from multiple patchwise images. And optimal band combinations of each patchwise image are selected according to their scene. The segmentation results of each patchwise image are merged to get the desired results according to geographical coordinates. Our method is validated on the ISPRS 2-D Semantic Labeling dataset of Potsdam, on which results competitive with the state-of-the-art are obtained. The proposed scheme has strong universality and can be used for large-scale high-resolution remote sensing image classification.