Superpixel generation for synthetic aperture radar imagery using edge-dominated local clustering

Hao Hu; Bin Liu; Zenghui Zhang; Weiwei Guo; Wenxian Yu

doi:10.1117/1.JRS.12.045006

16 October 2018 Superpixel generation for synthetic aperture radar imagery using edge-dominated local clustering

Hao Hu, Bin Liu, Zenghui Zhang, Weiwei Guo, Wenxian Yu

Author Affiliations +

Journal of Applied Remote Sensing, Vol. 12, Issue 4, 045006 (October 2018). https://doi.org/10.1117/1.JRS.12.045006

Abstract

Recently, superpixel-based methods have shown promising performance for synthetic aperture radar (SAR) image interpretation. In these methods, the statistical model-based local iterative clustering represents the mainstream of superpixel generation for SAR images. However, errors in the model parameter estimation degrade the accuracy of the model-based distance measure between a pixel and a cluster, which directly affects the performance of superpixel segmentation results. Further, the relative weight between statistical similarity and spatial proximity should be carefully selected to control the balance between boundary adherence and regularity of superpixels. An edge-dominated local clustering method is proposed to overcome these limitations. Edge information is introduced not only to define the dissimilarity of a pixel and a cluster but also to provide an adaptive grid with multiple layers for the initialization of cluster centers. Experiments on simulated and real datasets show that, compared with the previous algorithms using the statistical model-based dissimilarity, the proposed method produces superpixels, which have better edge adherence and stable performance.

1. Introduction

Due to the active coherent microwave imaging mechanism, synthetic aperture radar (SAR) provides high-resolution images independent from daylight, cloud coverage, and weather conditions.¹ Nowadays, SAR images have become a regular and powerful information sources for many applications, including environmental monitoring, terrain classification, etc. However, the interpretation of SAR images is still a challenging task because of their special imaging mechanism. In recent years, superpixel-based methods have attracted increasing attention for SAR image understanding. The basic concept of superpixel was first presented by Ren and Malik² as the local coherent regions using an oversegmentation algorithm. As superpixels group the pixels with similar characteristics into meaningful atomic regions, they can effectively capture image features and well adhere to object boundaries. Therefore, superpixels can achieve a better perceptual representation of images than pixels, as well as reduce the complexity of subsequent image processing tasks, such as segmentation, classification, object detection, and so on.

Until now, most of the superpixel generation methods for SAR images with promising performance are specially tailored from the ones proposed in the computer vision community, such as normalized cut,³ turbopixels,⁴ simple linear iterative clustering (SLIC),⁵ etc. Normalized cut is the most classical algorithm;it treats image segmentation as a graph partitioning problem and globally minimizes the segmentation cost. However, the high computational complexity has limited the wide applicability of this algorithm. Turbopixels is an effective method for generating superpixels, and it has been applied for SAR image analysis in much research.⁶^,⁷ It gradually dilates regularly distributed seeds using geometric flows and poses strong constraints on the uniformness and compactness of superpixels. Meanwhile, due to the stability and efficiency issues of the underlying level-set method, the generated superpixels present relatively lower adherence to boundaries,⁸ and computational results show that it runs relatively slower on real-world datasets than the other $O (N)$ superpixel algorithms.⁵^,⁹

On the contrary, SLIC⁵ has been widely used in SAR images because of its simple concept, easy implementation, and high efficiency in practice. SLIC assigns each pixel to a cluster of the nearest seed and iteratively updates the cluster center by computing a pixel-to-cluster distance measure. However, in the original SLIC, this measure is obtained using five-dimensional (5-D) Euclidean distance in $l a b x y$ space,⁵ which cannot be applied directly on SAR images due to the multiplicative speckle noise. Thus, some alternative distance measures have been proposed in the last few years. For instance, Xiang et al.¹⁰ used a distance based on pixel intensity and location similarity for SAR images that is derived from the Nakagami–Rayleigh distribution and pixel intensity ratio. Zou et al.¹¹ combined the generalized gamma distribution-based likelihood value with spatial distance to represent the pixel-to-cluster similarity. Yu et al.¹² proposed a distance of two patches based on the likelihood ratio test statistic following the exponential distribution and used it to measure the intensity dissimilarity of a pixel and a cluster center. For polarimetric SAR (PolSAR) images, Feng et al.¹³ directly used a complex Wishart distribution-based distance as a substitute for the feature-based distance in SLIC to generate superpixels. Song et al.¹⁴ defined a dissimilarity using the Bartlett distance, which is derived from hypothesis tests on Wishart distribution. Qin et al.¹⁵ improved the cluster center initialization and used the revised Wishart distance for local clustering. Xiang et al.¹⁶ defined a similarity measure that contains multiple cues, including polarimetric, texture, and spatial information.

In summary, to relieve the speckle noise effect and make the SLIC method applicable for SAR/PolSAR images, most of the existing research follows two ideas: (1) replacing the color-based distance with statistical model based ones and making improvements and (2) combining statistical models with other features to construct a compound distance, such as $D_{T} = D_{1} + D_{2} + \dots + D_{n}$ . However, there is a problem with these two ideas. First, to calculate the aforementioned pixel-to-cluster distance measures, the parameters of the statistical models should be estimated accurately in each cluster. However, the initial clusters are sampled on a regular grid and will continually change during the local iterative clustering, which means the assumption of the independent and identically distribution (i.i.d.) in the clusters is usually violated, especially in heterogeneous areas. In this situation, the estimated parameters are biased, so the accuracy of distance measures will be degraded and the performance of superpixel generation will be affected. Second, combining statistical models with other features can partly improve the accuracy of the pixel-to-cluster distance measure, but the direct adding of different distances still lacks theoretical support. If there were remarkable differences in the range and distribution of values of each distance, the addition of multiple distances derived from different features would be unreliable in some cases.

In this paper, we explore the issue of distance measure from another point of view. Motivated by Leung and Malik,¹⁷ edge information can be directly used to define the dissimilarity between pairwise pixels in the natural images. Additionally, in SAR images, edges are not the simple sharp changes in image brightness, but significantly reflect changes in the statistical properties of each area in the images. In other words, edge information can be considered the abstraction of the underlying statistical characteristics and a bridge to connect statistics and superpixels. Thus, the edges are more perceptual and stable to represent dissimilarity between two pixels if there is an edge located in the middle of them. Liu et al.¹⁸ computed the dissimilarity by the edge information, which is extracted by a classical region-based detector for SAR images, but the detector suffers from the scale dilemma and the orientation problem.¹⁹ Thus, the locations of edge points are unreliable and the performance of superpixels is not satisfactory. To overcome the limitation, we adopt an up-to-date detector to extract the edge information more precisely and define an edge-dominant distance to replace the statistical model-based distance. Experimental results confirm that a reliable result of superpixels can be provided using only edge information and the superpixels can well adhere to the real edges.

Another problem with the model-based SLIC methods is that it is often difficult to make an appropriate selection of the relative weight between statistical similarity and spatial proximity. The weight is important for offering a balance between boundary adherence, compactness, and regularity of superpixels.⁵ However, it is usually set manually to a constant value by trial and error, which might still not be suitable for each iteration and is often too large to lead to undersegmentations in some areas. To solve this problem, we built an initialization step for the cluster centers with an edge-adaptive grid (EG). This grid has multiple layers that are generated based on edge information and quadtree decomposition. Experiments show that it is able to reduce the negative effect caused by a large value of the relative weight and make the performance of superpixels less sensitive to the changes of weight.

The remainder of this paper is organized as follows. The proposed method is described in Sec. 2. The experiments and the performance evaluations are presented in Sec. 3. The conclusions are given in Sec. 4.

2. Superpixel Generation

2.1.

Edge Extraction

In this paper, the edge information is extracted using the degenerate filter with the weight maximum likelihood estimation (DG-WMLE) proposed in Ref. 19. The DG-WMLE method can address the scale dilemma in edge extraction and provide a better performance on the estimation of the edge strength and the location of edge points, which is extremely important and necessary for generating superpixels with a good boundary adherence. The key design of the DG-WMLE method is a degenerate filter, as illustrated in Fig. 1. The edge strength of the center pixel is estimated by the dissimilarity between the two pixels adjacent to the center pixel. And the calculation of this dissimilarity needs the noise-free intensity of the two pixels. According to Refs. 20 and 21, the noise-free value can be evaluated using the WMLE, which is

Eq. (1)

\hat{μ} (x) = \frac{\sum_{x^{'} \in R_{SW} (x)} ω (x, x^{'}) I (x^{'})}{\sum_{x^{'} \in R_{SW} (x)} ω (x, x^{'})},

where

I (x^{'})

means the intensity value of the SAR image with noise. The WMLE estimation on

x

uses all the values

I (x^{'})

in the search window

R_{SW} (x)

, and the design of the window is inherited from the classic region-based filter. The weight

ω

is derived from the probabilistic patch-based dissimilarity using an exponential kernel²⁰^,²¹ and is calculated as follows:

Eq. (2)

ω (x, x^{'}) = \exp {- \frac{D_{PPB} (P_{x}, P_{x^{'}})}{h}},

where

D_{PPB} (P_{x}, P_{x^{'}})

denotes the patch-based dissimilarity measure of two patches

P_{x}

and

P_{x^{'}}

, with

x

and

x^{'}

as the centers, respectively, and

h > 0

is the kernel parameter.¹⁹

Fig. 1

The degenerate filter design for edge extraction: $l_{df}$ and $w_{df}$ are the length and width of the search window, respectively, $d_{df}$ is the spacing between the two pixels for calculating the edge strength at the center pixel, and $θ_{df}$ is the filter orientation. This figure is adopted from Ref. 19.

Considering the design of the DG filter and the WMLE-based estimation method, if $x$ and $y$ are the two adjacent pixels to the center pixel $z$ , the corresponding indicator of the edge information (i.e., the edge strength at the pixel $z$ ) at the current orientation of the filter $θ_{df}$ is calculated with the use of the Bhattacharyya distance,²²^–²⁴ and the edge strength at the pixel $z$ is the maximum value among all the orientations, as shown in Eqs. (3) and (4):

Eq. (3)

E (z, θ_{df}) = D_{B} (x, y) = 2 \ln \frac{\hat{μ} (x) + \hat{μ} (y)}{2} - \ln \hat{μ} (x) - \ln \hat{μ} (y),

Eq. (4)

E^{*} (z) = \max_{θ_{df}} E (z, θ_{df}) .

The relative parameters are set as suggested in Ref. 19. The orientations of the filter are ${0, \frac{π}{4}, \frac{π}{2}, \frac{3 π}{4}}$ . The detailed information about the DG-WMLE edge extractor can be found in Ref. 19.

2.2.

Edge-Dominated Local Clustering

The SLIC⁵ is an effective and efficient method for superpixel generation. The basic idea of the SLIC is a local k-means clustering method, including three steps: (1) initialization of cluster centers by a regular grid (RG); (2) iterative local clustering based on a distance measure between a pixel and a cluster center; and (3) postprocessing to remove isolated pixels and enforce the connectivity of superpixels.

In general, the performance of the SLIC is greatly affected by the capability of the distance measure. In the original SLIC, this measure is defined as the 5-D Euclidean distance combining the color similarity and the spatial proximity.⁵ Since this distance cannot be directly applied for SAR images with multiplicative speckle noise, several studies in recent years have deduced suitable measures and introduced them into the SLIC, as discussed in Sec. 1. Motivated by the work of Leung and Malik¹⁷ and Liu et al.,¹⁸ in this paper, we directly use the aforementioned DG-WMLE edge information to measure the pairwise dissimilarity of two arbitrary pixels. As shown in Fig. 2, the edge-based pairwise dissimilarity is perceptually meaningful, easy to understand, and can ensure a good boundary adherence of superpixels.

Fig. 2

Illustration of the pairwise dissimilarity using edge information. (a) SAR image and (b) the extraction result of DG-WMLE. Because of the high value of edge strength existing along the line $l_{2}$ , the pixels $p_{1}$ and $p_{3}$ are suggested to be divided into different clusters. On the contrary, the pixels $p_{1}$ and $p_{2}$ probably belong to the same cluster. This figure is adopted from Ref. 18.

The dissimilarity of two pixels $x$ and $y$ is defined as follows:

Eq. (5)

d_{Edge} (x, y) = \arg \max_{z \in l} E^{*} (z),

where

E^{*} (z)

denotes the edge strength at the pixel

z

and

l

is the line connecting

x

and

y

.

Similar to Ref. 5, the distance measure for edge-dominated local clustering (EDLC) is defined as follows:

Eq. (6)

d_{ED} = \sqrt{d_{Edge}^{2} + m {(\frac{d_{Sp}}{S})}^{2}},

where the subscript ED stands for edge-dominated,

d_{Sp}

is the spatial distance of the pairwise pixels, and

S

is the grid interval.

m

is a relative weight introduced to control the relative importance of the edge information against the spatial distance.

As mentioned in Sec. 1, the value of $m$ should be carefully determined to offer a balance between boundary adherence, compactness, and regularity of superpixels. A smaller $m$ will emphasize $d_{Edge}$ more and makes the generated superpixels adhere better to the real boundaries. However, a larger $m$ will emphasize $d_{Sp}$ and makes the superpixels more compact and regular. As shown in Figs. 3(b) and 3(c), an inappropriate choice of $m$ leads to an unsatisfactory segmentation result. More specifically, a large value of $m$ around the edges will have a fatal impact on the performance of segmentation.

Fig. 3

Illustration of two different initializations of cluster centers and the corresponding superpixel segmentation results. (a) initialized by a RG and (d) by an EG. (b) and (e) The results produced using the distance measure in Eq. (6) with $m = 0.5$ , respectively. (c) and (f) The results with $m = 1.5$ .

Motivated by the idea of quadtree mesh generation,²⁵ we provide an initialization strategy with an EG instead of the RG to overcome this limitation. First, a RG is generated on the image according to the expected number of superpixels. Next, an automatic thresholding²⁶ is applied on the extraction result of the DG-WMLE to get an edge map. Then, if the number of edge points in any block of the RG exceeds a preset threshold, the block is recursively subdivided into four smaller equal-sized parts. In this way, a multilayer grid adaptive to the edge information is generated, as displayed in Fig. 3(d). In Fig. 3, under the same value of $m$ and a similar amount of initial clusters, it is shown that EG-based initialization has a larger grid interval $S$ than RG. In addition, more initial centers are generated close to the real edges, which makes the spatial distance $d_{Sp}$ between pixels and cluster centers around the edges decrease a lot. In both cases, according to Eq. (6), the importance of spatial proximity will be weakened, i.e., the importance of edge information will be emphasized. Thus, the boundary adherence of superpixels around the real edges can be improved significantly, as shown in Figs. 3(e) and 3(f).

In summary, the procedure of EDLC for superpixel generation is presented as follows:

(1) Parameter setting—Set the number of blocks $N_{b}$ in the top layer of EG, the number of layers $N_{L}$ for EG, the relative weight $m$ , and the maximum number of iterations $N_{itr}$ .
(2) Initialization of cluster centers—Generate an EG based on the edge map obtained from the DG-WMLE, and set the center of each block as an initial cluster center. To avoid getting centers on pixels with strong edge strengths, move every center to the position with the lowest edge strength in its $3 \times 3$ neighborhood.
(3) Local iterative clustering—For a cluster center $C$ , compute the edge-dominated distance $d_{ED}$ between $C$ and each pixel $p$ in the region of $2 S \times 2 S$ around $C$ , according to Eq. (6). Here, $S$ is the grid interval of the top layer in EG. Then, assign $p$ to the cluster with the minimum $d_{ED}$ , and save the cluster label for $p$ . After all the cluster centers are processed, update the locations of centers and calculate the residual error $E_{r}$ ( $L_{1}$ distance between previous centers and recomputed centers). Repeat the assignment and updating until the error $E_{r}$ converges or the number of iterations reaches $N_{itr}$ . In our experiments, 20 iterations are found to be enough, and this number is used as the stopping criterion in all the tests.
(4) Postprocessing—Due to lack of connectivity enforcement, there may be some broken superpixels produced in the final clustering results. To correct for this, find the regions with the size smaller than 10 pixels, and reassign each pixel of these regions into a large neighboring superpixel with the minimum likelihood-based distance. After this processing, small isolated regions are carefully removed and the boundaries of most of the other superpixels remain the same.

An intuitive flowchart is shown in Fig. 4.

Fig. 4

The flowchart of EDLC.

3. Experiments and Analyses

3.1.

Datasets

In this section, we generate a simulated four-look SAR image based on the Monte Carlo procedure²⁷ to objectively evaluate the performance of the proposed method. The size of the image is $300 \times 300$ . The image contains five different regions, and the intensity of each region follows the gamma distribution, as shown in Fig. 5(a). The actual intensity values without the interference of noise in the five regions are set to 100, 400, 1600, 3600, and 8100, respectively. The corresponding ground truth of edges is given in Fig. 5(b).

Fig. 5

Simulated dataset. (a) A simulated SAR image with five regions following the gamma distribution. (b) The corresponding ground truth of edges.

In addition, two TerraSAR-X StripMap images are used in our experiments, as shown in Figs. 6(a) and 6(c). The first one is extracted from Dessau, Germany, covering several crop areas. The second is from South Mississippi, USA, covering both water and vegetation areas. The size of each is $300 \times 300$ . The pixel spacings are 3 m in both directions, and the number of looks is $\sim 6$ . The ground truth of edges from manual delineation is shown in Figs. 6(b) and 6(d).

Fig. 6

Two TerraSAR images for experiments: (a) from Dessau and (c) from South Mississippi. The ground truth of edges from manual delineation is shown in panels (b) and (d).

3.2.

Performance Evaluation

To evaluate the performance of the proposed method quantitatively, two commonly used metrics²⁸ are applied in this section: boundary recall (BR) and under-segmentation error (USE). BR is defined as the fraction of the ground truth edges correctly recovered with the superpixel boundaries. In practice, BR measures the percentage of ground truth edges that fall within superpixel boundaries with a tolerance distance $ϵ = 1$ . USE compares superpixel segment areas to measure to what extent superpixels cover the ground truth segment border. If $G_{i}$ is a ground truth segment, $S_{k}$ is a superpixel, and $| \cdot |$ indicates the size of the segment in pixels, USE is computed by

Eq. (7)

USE = \frac{\sum_{i} \sum_{k} \min (| S_{k} \cap G_{i} |, | S_{k} - G_{i} |)}{\sum_{i} | G_{i} |} .

Next, we compare the superpixel generation results of the EDLC with that of the other three methods, i.e., three different measures to represent the dissimilarity between a pixel and a cluster:

(1) the original SLIC using the grayscale-based dissimilarity:⁵
Eq. (8)
$d_{ori} = \sqrt{{(A_{j} - A_{i})}^{2}},$
where $A$ denotes the amplitude for SAR images.
(2) the likelihood-based SLIC (LB-SLIC)¹¹ with a likelihood value based dissimilarity:
Eq. (9)
$d_{LB} = \exp {- p (z_{j} | i)},$
where $z_{j}$ denotes the intensity of a given pixel, $p (z | i)$ is the conditional PDF of the $i^{'}$ th cluster $C_{i}$ , which can be defined by the gamma distribution:
Eq. (10)
$p (z | μ, L) = {(\frac{L}{μ})}^{L} z^{L - 1} \frac{1}{Γ (L)} \exp (- \frac{L z}{μ}),$
where $L$ is the number of looks and $μ$ is the noise-free intensity value. For the $i^{'}$ th cluster $C_{i}$ , the MLE of $μ$ is as follows:
Eq. (11)
$\hat{μ} = \frac{1}{N} \sum_{k \in C_{i}} z_{k} .$
(3) The modified SLIC using a patch-based dissimilarity (PB-SLIC):¹²
Eq. (12)
$d_{PB} (P_{i}, P_{j}) = 2 M \cdot \log \frac{{\bar{I}}_{P_{i j}}}{\sqrt{{\bar{I}}_{P_{i}} {\bar{I}}_{P_{j}}}},$
where $P_{i}$ and $P_{j}$ are two image patches with the center pixel $i$ and $j$ , ${\overline{I}}_{P_{i}}$ denotes the average intensity in the patch $P_{i}$ , and $M$ is the number of pixels in $P_{i}$ or $P_{j}$ . According to Ref. 12, a $5 \times 5$ patch is found to be appropriate and is used in the following tests.

To make a fair comparison, we first replace the $d_{Edge}$ in Eq. (6) with the above three dissimilarities. Then, we perform the same local clustering and postprocessing procedures to get the final results. To obtain superpixels with a good balance between boundary adherence and regularity, the values of the weight $m$ are all set carefully for all the three methods according to Refs. 5, 11, and 12. The number of layers $N_{L}$ for EDLC is set as 3. The number of blocks $N_{b}$ in the top layer of EG in EDLC is also set suitably to get a number of the generated superpixels similar to that of the other three methods. The maximum number of iterations $N_{itr}$ is set as 20.

For the simulated data, the segmentation results of EDLC, LB-SLIC, PB-SLIC, and SLIC are shown in Fig. 7 from left to right. The expected number of superpixels $N_{s p}$ in LB-SLIC, PB-SLIC, and SLIC is set as {100, 200, 300, 400, 500}, increasing from top to bottom. Additionally, in the same lines of the figures, the number of generated superpixels in EDLC is close to the other three methods. To provide superpixels with a better boundary adherence, $m$ is set as {0.5, 0.6, 1.0, 0.5} for the four methods, respectively. The numerical evaluation for the superpixels provided by these methods is shown in Fig. 8, using the aforementioned metrics BR and USE.

Fig. 7

Superpixels generated by the four methods on the simulated SAR image: (a) EDLC, (b) LB-SLIC, (c) PB-SLIC, and (d) SLIC. The number of superpixels is increasing from top to bottom (100 to 500). The weight $m$ is set as 0.5, 0.6, 1.0, and 0.5, respectively.

Fig. 8

Performance evaluation of EDLC, LB-SLIC, PB-SLIC, and SLIC on the simulated SAR image using (a) BR and (b) USE. In the legends, the numbers in brackets after the names of the methods are the values of $m$ .

From Figs. 7 and 8, we notice that

(1) The original SLIC has the worst performance among these four methods. SLIC has a good boundary adherence only at the borders between two regions with a low degree of similarity, such as regions 1 and 4 and regions 1 and 5. Some irregular superpixels are produced, and their boundaries poorly adhere to the real edges. The results show that the grayscale distance is not quite applicable for the superpixel generation on the SAR images with speckle noise.
(2) Although the BR values of LB-SLIC are close to EDLC, the regularity of superpixels in LB-SLIC is much worse, especially in regions 3 and 4 of the image. The reason for this is that the local clustering in LB-SLIC produces too much broken regions and orphaned pixels, so after merging in postprocessing, the nearby superpixels will probably turn into irregular regions. Further, as shown in Fig. 7(b), some real boundaries (between the regions 4 and 5 and the regions 3 and 4) are still not covered by the borders of superpixels in spite of the increase of $N_{s p}$ . This indicates a limitation in the performance of LB-SLIC, which is discussed in Sec. 1.
(3) The performance of PB-SLIC is worse than EDLC and LB-SLIC. According to Eq. (12), the average intensity of patches is used to calculate the dissimilarity of two central pixels. Thus, the pixels near the edges of two regions, which have a low degree of similarity, will indicate a high degree of similarity. As shown in Fig. 7(c), near the border between the regions 1 and 4 and the regions 1 and 5, some superpixels overlap with different regions at the same time. This overlapping clearly makes a poor adherence to the real boundaries and degrades the performance of PB-SLIC.
(4) The proposed EDLC method yields a noticeable improvement on the performance. The superpixels provided by EDLC obtain a higher value of BR and a lower USE than the other three methods. Although the value of $m$ is increased from 0.5 to 1.0 and the number of generated superpixels rises from 100 to 500, both of them have a smaller impact on the performance of EDLC. As shown in Fig. 7(a), the compactness and regularity of superpixels is also ensured.

For the two real images, the segmentation results of the four methods are shown in Figs. 9 and 10 from left to right. The expected number of superpixels $N_{s p}$ in LB-SLIC, PB-SLIC, and SLIC is set as {200, 300, 400, 500, 600}, increasing from top to bottom. And the number of generated superpixels in EDLC is close to the other three methods in the same lines. $m$ is set as {0.5, 0.6, 1.0, 0.3} for the four methods, respectively. The numerical evaluation for the superpixels provided by these methods is shown in Figs. 11 and 12. From these figures, the proposed EDLC still provides better results than the other three methods, considering both BR and USE. Although with a low value of $m$ , LB-SLIC or PB-SLIC can obtain a good boundary adherence, which is close to or even a little bit better than EDLC, their performance of USE is worse. There are also many irregular superpixels generated both near the real boundaries and inside the homogenous areas. Furthermore, a lot of broken regions are produced during the local clustering, so the number of superpixels in the final results is much more than the preset value of $N_{s p}$ . In general, the visual presentation of LB-SLIC and PB-SLIC is poorer because of these negative attributes.

Fig. 9

Superpixels generated by the four methods on the real image 1: (a) EDLC, (b) LB-SLIC, (c) PB-SLIC, and (d) SLIC. The number of superpixels isincreasing from top to bottom (200 to 600). The weight $m$ is set as 0.5, 0.6, 1.0, and 0.3, respectively.

Fig. 10

Superpixels generated by the four methods on the real image 2: (a) EDLC, (b) LB-SLIC, (c) PB-SLIC, and (d) SLIC. The number of superpixels is increasing from top to bottom (200 to 600). The weight $m$ is set as 0.5, 0.6, 1.0, and 0.3, respectively.

Fig. 11

Performance evaluation of EDLC, LB-SLIC, PB-SLIC, and SLIC on the real image 1 using (a) BR and (b) USE. In the legends, the numbers in brackets are the values of $m$ .

Fig. 12

Performance evaluation of EDLC, LB-SLIC, PB-SLIC, and SLIC on the real image 2 using (a) BR and (b) USE. In the legends, the numbers in brackets are the values of $m$ .

3.3.

Parameter Analysis

According to Sec. 2.2, two parameters need to be determined before EDLC: the weight $m$ and the number of layers $N_{L}$ . As shown in Fig. 3, both $m$ and $N_{L}$ have a great influence on the superpixel segmentation results. To evaluate the impact of the two parameters, we set $N_{L} = {1,2, 3}$ and $m = {0.5, 0.8, 1.0}$ ; then, we applied the EDLC to the simulated image. The performance on the condition of different parameters is shown in Fig. 13. From the figures, it is noticed that, with the increase of layers, the boundary adherence of EDLC is improved remarkably. In addition, the BR and USE curves under different values of $m$ become much closer to each other. This represents that, by the initialization of EG, the performance of EDLC is less sensitive to the change of $m$ than using RG. Thus, we used $N_{L} = 3$ in all the experiments, and set $m$ in the range $[0.5, 1.0]$ for EDLC. If the proposed method is applied for a larger dataset, more layers are recommended. However, the size of blocks in the bottom of EG is not suggested to be smaller than $5 \times 5$ .

Fig. 13

Performance evaluation of EDLC using (a) BR and (b) USE under different parameters. Here, “EG3” denotes $N_{L} = 3$ , “EG2” denotes $N_{L} = 2$ . When $N_{L} = 1$ , EG equals to RG. In the legends, the numbers in brackets are the values of $m$ .

4. Conclusions

In this paper, we propose an edge-dominated local clustering method to generate superpixels for SAR images. Edge information is introduced not only to define the dissimilarity of a pixel and a cluster but also to produce an adaptive grid for the initializations of cluster centers. Experiments on the simulated and real SAR images show that the proposed method provides an improved performance of boundary adherence and visual presentation, compared with the other methods using statistical model-based dissimilarities. In the future, we will extend the edge-dominated dissimilarity into multitemporal data and provide a segmentation result suitable for all the temporals. In this case, superpixels will become a basic element for multitemporal analysis.

Acknowledgments

This work was supported by the State Key Program of the National Natural Science Foundation of China under Grant No. 61331015. The authors declare no conflict of interest.

References

1.

C. Oliver and S. Quegan, Understanding Synthetic Aperture Radar Images, SciTech Publishing, Raleigh, North Carolina (2004). Google Scholar

2.

X. Ren and J. Malik, “Learning a classification model for segmentation,” in Proc. Ninth IEEE Int. Conf. Computer Vision, 10 –17 (2003). https://doi.org/10.1109/ICCV.2003.1238308 Google Scholar

3.

J. Shi and J. Malik, “Normalized cuts and image segmentation,” IEEE Trans. Pattern Anal. Mach. Intell., 22 888 –905 (2000). https://doi.org/10.1109/34.868688 ITPIDJ 0162-8828 Google Scholar

4.

A. Levinshtein et al., “Turbopixels: fast superpixels using geometric flows,” IEEE Trans. Pattern Anal. Mach. Intell., 31 (12), 2290 –2297 (2009). https://doi.org/10.1109/TPAMI.2009.96 ITPIDJ 0162-8828 Google Scholar

5.

R. Achanta et al., “SLIC superpixels compared to state-of-the-art superpixel methods,” IEEE Trans. Pattern Anal. Mach. Intell., 34 2274 –2282 (2012). https://doi.org/10.1109/TPAMI.2012.120 ITPIDJ 0162-8828 Google Scholar

6.

L. Gan et al., “Triplet Markov fields with edge location for fast unsupervised multi-class segmentation of synthetic aperture radar images,” IET Image Process., 6 (7), 831 –838 (2012). https://doi.org/10.1049/iet-ipr.2011.0198 Google Scholar

7.

H. Yu et al., “Context-based hierarchical unequal merging for sar image segmentation,” IEEE Trans. Geosci. Remote Sens., 51 995 –1009 (2013). https://doi.org/10.1109/TGRS.2012.2203604 IGRSD2 0196-2892 Google Scholar

8.

Y.-J. Gong and Y. Zhou, “Differential evolutionary superpixel segmentation,” IEEE Trans. Image Process., 27 (3), 1390 –1404 (2018). https://doi.org/10.1109/TIP.2017.2778569 IIPRE4 1057-7149 Google Scholar

9.

Y.-J. Liu et al., “Intrinsic manifold SLIC: a simple and efficient method for computing content-sensitive superpixels,” IEEE Trans. Pattern Anal. Mach. Intell., 40 (3), 653 –666 (2018). https://doi.org/10.1109/TPAMI.2017.2686857 ITPIDJ 0162-8828 Google Scholar

10.

D. Xiang et al., “Superpixel generating algorithm based on pixel intensity and location similarity for SAR image classification,” IEEE Geosci. Remote Sens. Lett., 10 1414 –1418 (2013). https://doi.org/10.1109/LGRS.2013.2259214 Google Scholar

11.

H. Zou et al., “A likelihood-based SLIC superpixel algorithm for SAR images using generalized Gamma distribution,” Sensors, 16 1 –16 (2016). https://doi.org/10.1109/JSEN.2016.2616227 SNSRES 0746-9462 Google Scholar

12.

W. Yu et al., “Superpixel-based cfar target detection for high-resolution sar images,” IEEE Geosci. Remote Sens. Lett., 13 730 –734 (2016). https://doi.org/10.1109/LGRS.2016.2540809 Google Scholar

13.

J. Feng, Z. Cao and Y. Pi, “Polarimetric contextual classification of PolSAR images using sparse representation and superpixels,” Remote Sens., 6 7158 –7181 (2014). https://doi.org/10.3390/rs6087158 Google Scholar

14.

H. Song et al., “Unsupervised classification of polarimetric SAR imagery using large-scale spectral clustering with spatial constraints,” Int. J. Remote Sens., 36 2816 –2830 (2015). https://doi.org/10.1080/01431161.2015.1043759 IJSEDK 0143-1161 Google Scholar

15.

F. Qin, J. Guo and F. Lang, “Superpixel segmentation for polarimetric SAR imagery using local iterative clustering,” IEEE Geosci. Remote Sens. Lett., 12 13 –17 (2015). https://doi.org/10.1109/LGRS.2014.2322960 Google Scholar

16.

D. Xiang et al., “Adaptive superpixel generation for polarimetric sar images with local iterative clustering and SIRV model,” IEEE Trans. Geosci. Remote Sens., 55 3115 –3131 (2017). https://doi.org/10.1109/TGRS.2017.2662010 IGRSD2 0196-2892 Google Scholar

17.

T. Leung and J. Malik, “Contour continuity in region based image segmentation,” in European Conf. on Computer Vision, 544 –559 (1998). Google Scholar

18.

B. Liu et al., “Superpixel-based classification with an adaptive number of classes for polarimetric SAR images,” IEEE Trans. Geosci. Remote Sens., 51 907 –924 (2013). https://doi.org/10.1109/TGRS.2012.2203358 IGRSD2 0196-2892 Google Scholar

19.

B. Liu et al., “Edge extraction for polarimetric SAR images using degenerate filter with weighted maximum likelihood estimation,” IEEE Geosci. Remote Sens. Lett., 11 2140 –2144 (2014). https://doi.org/10.1109/LGRS.2014.2321629 Google Scholar

20.

C.-A. Deledalle, L. Denis and F. Tupin, “Iterative weighted maximum likelihood denoising with probabilistic patch-based weights,” IEEE Trans. Image Process., 18 2661 –2672 (2009). https://doi.org/10.1109/TIP.2009.2029593 IIPRE4 1057-7149 Google Scholar

21.

C.-A. Deledalle et al., “NL-SAR: a unified nonlocal framework for resolution-preserving (Pol)(In)SAR denoising,” IEEE Trans. Geosci. Remote Sens., 53 (4), 2021 –2038 (2015). https://doi.org/10.1109/TGRS.2014.2352555 IGRSD2 0196-2892 Google Scholar

22.

A. D. Nascimento, R. J. Cintra and A. C. Frery, “Hypothesis testing in speckled data with stochastic distances,” IEEE Trans. Geosci. Remote Sens., 48 (1), 373 –385 (2010). https://doi.org/10.1109/TGRS.2009.2025498 IGRSD2 0196-2892 Google Scholar

23.

W. B. Silva et al., “Classification of segments in PolSAR imagery by minimum stochastic distances between Wishart distributions,” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., 6 (3), 1263 –1273 (2013). https://doi.org/10.1109/JSTARS.2013.2248132 Google Scholar

24.

A. C. Frery, A. D. Nascimento and R. J. Cintra, “Analytic expressions for stochastic distances between relaxed complex Wishart distributions,” IEEE Trans. Geosci. Remote Sens., 52 (2), 1213 –1226 (2014). https://doi.org/10.1109/TGRS.2013.2248737 IGRSD2 0196-2892 Google Scholar

25.

M. A. Yerry and M. S. Shephard, “A modified quadtree approach to finite element mesh generation,” IEEE Comput. Graphics Appl., 3 39 –46 (1983). https://doi.org/10.1109/MCG.1983.262997 Google Scholar

26.

A. Brink and N. Pendock, “Minimum cross-entropy threshold selection,” Pattern Recognit., 29 (1), 179 –188 (1996). https://doi.org/10.1016/0031-3203(95)00066-6 Google Scholar

27.

J. S. Lee, M. R. Grunes and R. Kwok, “Classification of multi-look polarimetric sar imagery based on complex Wishart distribution,” Int. J. Remote Sens., 15 (11), 2299 –2311 (1994). https://doi.org/10.1080/01431169408954244 IJSEDK 0143-1161 Google Scholar

28.

D. Stutz, A. Hermans and B. Leibe, “Superpixels: an evaluation of the state-of-the-art,” Comput. Vision Image Understanding, 166 1 –27 (2018). https://doi.org/10.1016/j.cviu.2017.03.007 Google Scholar

Biography

Hao Hu received his MSc degree in electronics and communication engineering from Shanghai Jiao Tong University, Shanghai, China, in 2012, where he is currently working toward his PhD in information and communication engineering, Department of Electronic Engineering. His research interests are in the domain of synthetic aperture radar (SAR) image interpretation (segmentation, classification, and multitemporal analysis).

Bin Liu received his PhD in signal and information processing from Shanghai Jiao Tong University in 2015. Currently, he is a research assistant professor in Shanghai Key Laboratory of Intelligent Sensing and Recognition, Shanghai Jiao Tong University. His main research interests include SAR/PolSAR image understanding and information mining, particularly spatial information analysis, segmentation and classification, multitemporal image analysis, target detection and recognition, and multisensor data joint interpretation.

Zenghui Zhang received his PhD in information and communication engineering from the National University of Defense Technology (NUDT), Changsha, China, in 2008. From 2008 to 2012, he was a lecturer in the Department of Mathematics and System Science, NUDT. He is currently an associate professor in the School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University. His main research interests include radar signal processing and compressed sensing theory and applications.

Weiwei Guo received his PhD in information and communication engineering from the National University of Defense Technology in 2014. He was a joint PhD student at Queen Mary, University of London, London, United Kingom, from 2008 to 2010. Since 2015, he has been a postdoctorate researcher with Shanghai Jiao Tong University. His main research interests include areas of image and signal processing, computer vision, and pattern recognition.

Wenxian Yu He received his PhD in communication and information system from the National University of Defense Technology (NUDT) in 1993. He is currently in the School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, where he is a Yangtze River scholar distinguished professor and the head of research and was the executive dean from 2009 to 2011. His current research interests include radar target recognition, remote sensing information processing, multisensor data fusion, and integrated navigation system.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.

Citation Download Citation

Hao Hu, Bin Liu, Zenghui Zhang, Weiwei Guo, and Wenxian Yu "Superpixel generation for synthetic aperture radar imagery using edge-dominated local clustering," Journal of Applied Remote Sensing 12(4), 045006 (16 October 2018). https://doi.org/10.1117/1.JRS.12.045006

Received: 5 June 2018; Accepted: 20 September 2018; Published: 16 October 2018

Access the abstract

JOURNAL ARTICLE
15 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 8 scholarly publications.

Explore citations on Lens.org

KEYWORDS

Synthetic aperture radar

Distance measurement

Image segmentation

Statistical analysis

Roentgenium

Device simulation

Computer simulations

1.

Introduction

2.