Introduction

Image segmentation is the process of identifying areas in an image according to some criteria. Tree segmentation, which refers to the identification of trees in images, is required to identify trees in DEMs as these are the areas where ground height must be estimated, using image segmentation techniques.

The aim of this project was to identify trees in orchard DEMs. As no other data was used (such as visible spectrum aerial photographs) and as the DEMs were of low-fidelity, accurate segmentation was more difficult to achieve.

Project Goals

We aimed to investigate the usage of different segmentation techniques on DEMs. There are many characteristics of DEMs that affect segmentation accuracy, such as noise, tree spacing and land gradient, and these were looked at. A secondary aim was to compare the performance of the segmentation techniques with each other

Segmentation Techniques

Simple Linear Iterative Clustering (SLIC)

Superpixels are a collection of similar, neighbouring pixels. Simple Linear Iterative Clustering (SLIC) is a way of grouping pixels into superpixels. SLIC works by electing k centroids in the image, evenly spaced apart. The Euclidean distance between each pixel and each centroid is calculated - some optimisation is used to limit the number of calculations done. Pixels are assigned to their closet centroid, forming a cluster, seen in Fig. 1. The centre of each centroid is calculated and this becomes the new centroid. Pixels are assigned to centroids again and this process repeats until centroid positions no longer change or a certain iteration count has been reached.

Figure 1: An image showing SLIC cluster boundaries in cyan.

Watershed Segmentation

Watershed segmentation is an image processing technique that segments an image based on the concept of watersheds and catchment basins. Markers are placed in low-lying regions (it is up the user to decide what constitues as a low or high region) and a "flood" is performed where neighbouring pixels are assigned to the basin formed by the marker. When clusters meet (in high-lying regions), they form a watershed along their border. These watersheds segment the image.

Data

Two datasets were used. The first of which, provided by Aerobotics, consisted of greyscale, 32-bit float TIFF images (seen in Fig. 2). These DEMs presented a challenge to the segmentation algorithm due to their low fidelity (the height profile of a sample of the DEM can be seen in Fig. 3). The second set of DEMs were produced by the DEM Generation subproject (seen in Fig. 4). The aim of this was to create a variety of DEMs to test the segmentation algortihm on different types of terrain.

Figure 2: A DEM of an orchard.

Figure 3: The height profile from a sample of a DEM.

Figure 4: A synthetic DEM from the DEM Generation subproject,

System Framework

A series of steps was required to perform the tree segmentation operation. DEMs underwent preprocessing to increase to increase the accuracy of segmentation. A system diagram can be in Fig. 5.

Figure 5: System diagram showing the preprocessing, segmentation and evaluation frameworks.

Evaluation Framework

To evaluate segmentation, the results produced from this system must be compared with a ground truth, which is an image that has already been segmented with a higher accuracy than what can be produced by the system. There were 4 tests performed:

Tree centre detection - in this test, the Euclidean distance between each calculated tree centre and the actual tree centre was obtained.

Sorensen-Dice coefficient - this test produces the ratio between the number of correct tree identifications (true positives) and the number of incorrect tree identifications (false positives and negatives).

Intersection Over Union (IOU) - this test calculated the accuracy of tree segmentation, by considering the percentage of a tree thats lies outside the range of the ground truth (an illutration can be seen in Fig. 6). Overall IOU was done on a per pixel basis. IOU per tree was calculated using bounding boxes.

Qualitative evaluation - this was a visual inspection of the DEM.

Figure 6: The IOU equation.

Results

Marker Detection

Marker detection was better in disperse orchards. The window size of the local maxima dectection operation, compared to the spacing of the trees, would have affected this.

Terrain	Total Trees	% of Trees Identified	Mean Distance from Centre (in Pixels)	Standard Deviation
Flat	711	98.312	0.308	0.325
Gentle	530	95.849	0.350	0.440
Steep	540	95.926	0.376	0.430
Large-hilled	466	82.403	1.257	2.366
Small-hilled	568	94.014	0.403	0.479

Segmentation

The algorthm performed better in smooth trees (seen in Fig. 7) as watershed is sensitive noise. The gradient of terrain was also shown to have an impact on segmentation accuracy, as the algortihm produced more accurate segments in the flat terrain, where 73% of trees have an IOU of 70% or more.

Figure 7: A series of graphs showing the percentages of trees that obtained a certain IOU percentage (shown in the legend), when using different subcategories of terrains in the flat terrain (a), gentle terrain (b), steep terrain (c), large-hilled terrain (d) and small-hilled terrain (e).

From Fig. 8 it can be seen that the algotihm produces many false positive segments in small-hilled terrain, that is not shown in Fig. 7, that results in its overall IOU (24.3%) being much lower than its mean IOU per tree (80.2%), in one case.

a

b

c

d

Figure 8: A series of images showing the tree masks produced, and their IOU percentage (as shown in the legend above), for the flat terrain (a), steep terrain (b), large-hilled terrain (c) and small-hilled terrain (d).

SLIC versus Watershed

From Fig. 9, it can be seen that segmentation accuracy is best when using both SLIC and watershed. SLIC alone performs better than watershed alone, as watershed is more sensitive to noise. As SLIC has a much larger execution time than watershed, watershed alone can used in time sensitive circumstances.

Figure 9: A series of graphs showing the percentages of trees that obtained a certain IOU percentage (shown in the legend), when using a certain segmentation technique, for the flat terrain (a), gentle terrain (b) and small-hilled terrain (c).

Conclusions

The results of the segmentation varied depending on the DEMs used. In a best case scenario, where trees are smooth topped, a mean IOU per tree of 88.16% can be achieved while in the worst case scenario, in a DEM with noisy, compacted trees on a large hill, a mean IOU per tree of 54.3% can be achieved. As a result, this system may be usable on orchards with spaced-out identical trees. However, more work is required to make it robust enough to operate with uneven terrain. Using a combination of SLIC and watershed has been shown to be more effective than using only one of them.

PLANER

Tree Segmentation