3,148
Views
7
CrossRef citations to date
0
Altmetric
Articles

Nyström-based spectral clustering using airborne LiDAR point cloud data for individual tree segmentation

ORCID Icon, , , , , & show all
Pages 1452-1476 | Received 07 Feb 2021, Accepted 08 Jun 2021, Published online: 29 Jun 2021

ABSTRACT

The spectral clustering method has notable advantages in segmentation. But the high computational complexity and time consuming limit its application in large-scale and dense airborne Light Detection and Ranging (LiDAR) point cloud data. We proposed the Nyström-based spectral clustering (NSC) algorithm to decrease the computational burden. This novel NSC method showed accurate and rapid in individual tree segmentation using point cloud data. The K-nearest neighbour-based sampling (KNNS) was proposed for the Nyström approximation of voxels to improve the efficiency. The NSC algorithm showed good performance for 32 plots in China and Europe. The overall matching rate and extraction rate of proposed algorithm reached 69% and 103%. For all trees located by Global Navigation Satellite System (GNSS) calibrated tape-measures, the tree height regression of the matching results showed an value of 0.88 and a relative root mean square error (RMSE) of 5.97%. For all trees located by GNSS calibrated total-station measures, the values were 0.89 and 4.49%. The method also showed good performance in a benchmark dataset with an improvement of 7% for the average matching rate. The results demonstrate that the proposed NSC algorithm provides an accurate individual tree segmentation and parameter estimation using airborne LiDAR point cloud data.

1. Introduction

The research of tree segmentation helps gaining a comprehensive understanding of the complex structure as well as various biophysical processes of forests. Light Detection and Ranging (LiDAR), an active remote sensing technology, can obtain not only high-precision forest spatial structure information but also information under canopy topographic by transmitting laser energy and receiving return signals. This technology has unparalleled advantages of passive remote sensing and is therefore widely applied into forest management and ecosystem research (Hyyppä et al. Citation2000). Individual tree segmentation algorithms using LiDAR data can be mainly divided into two categories. One is surface model methods that utilize the canopy height model (CHM). The other is three-dimensional (3D) methods which utilize information of the whole point cloud (Zhao et al. Citation2014; Lindberg and Holmgren Citation2017). The 3D methods can be further categorized into point-based methods which directly utilize the original or normalized point cloud, and voxel-based methods which project the point cloud into the voxel space in order to increase computational efficiency.

The basic idea of surface model methods is to segment individual trees by deriving a surface model that represents the upper contour of the canopy and identifying the local maxima in the surface as treetops (Lindberg and Holmgren Citation2017). When searching the local maxima in CHM, fixed or variable window sizes that depend on factors such as crown width and stand density (Zimble et al. Citation2003) are used. Popescu, Wynne, and Nelson (Citation2002) determined the different sizes of window by calculating the crown width from the linear regression of tree height. Zimble et al. (Citation2003) utilized a circular search window with a fixed radius according to the measured minimum crown width. Chen et al. (Citation2006) applied marker-controlled watershed segmentation to isolate individual trees using the treetops detected in CHM by variable window sizes. Zhao et al. (Citation2014) used a morphological crown control-based watershed algorithm to isolate individual trees in the closed coniferous forest. Some other surface model methods were based on the existing 2D image segmentation methods (Brandtberg et al. Citation2003).

Clustering methods have been widely used for individual tree segmentation based on point methods. Morsdorf et al. (Citation2004) extracted the local maxima from the digital surface model (DSM) as seed points for k-means clustering analysis to segment individual trees. Gupta, Weinacker, and Koch (Citation2010) compared several improved k-means clustering and hierarchical clustering on crown extraction. It was found that using local maxima as seed points and scaling down the height value could improve results. Furthermore, the results of the hierarchical clustering method were not satisfactory compared to various k-means methods. In addition to clustering methods, there are many other 3D methods developed for individual tree segmentation. Lee et al. (Citation2010) utilized the region growing method from the treetops downward to the whole point cloud. Sačkov et al. (Citation2017) proposed a method that included several restriction rules on the tree height and crown width to increase the probability of detecting the actual tree. Williams et al. (Citation2020) developed a multiclass graph cut algorithm alongside knowledge of crown allometries to complete the individual tree crown detection.

Regarding voxel-based methods, cubes with a given resolution were usually used to construct the voxel cloud. Wang et al. (Citation2016) employed a voxel resolution of 0.5 m × 0.5 m × 0.5 m. They defined the centre of each voxel element within the voxel as the position and the number of points as the weight of that voxel. Ayrey et al. (Citation2017) proposed a layer stacking method, which included the main steps of horizontal layering, k-means clustering, and layer stacking. Shi and Malik (Citation2000) proposed the normalized cut (NCut) method, which divided data into groups based on the spatial similarity between points. On this basis, Reitberger et al. (Citation2009) implemented a new 3D segmentation of individual trees using NCut segmentation. The features were the distance between two voxels as well as the mean intensity and the mean pulse width for each voxel. Yao, Krzystek, and Heurich (Citation2012) used the segmentation results of the NCut method to classify tree species and complete the estimation of stem volume and DBH.

For the surface model methods, the utilization of upper canopy surface with a small data volume makes it efficient even for large scale individual tree segmentation. However, accurate results were projected only for simple, uniform forest types with the loss of understory information. Moreover, the results rely heavily on the priori information, such as the width for the window size to identify local maxima in CHM. 3D methods using the whole point cloud information are expected to produce more accurate individual tree segmentation results, especially for those stands with complex vertical structure. Nevertheless, there are still some challenges: (i) the reliance on the priori information, e.g. the assumed tree position for the initial seed point for k-means clustering; (ii) the restrictions on the data distribution, e.g. more stable performance with Gaussian distributed variables for k-means clustering; (iii) the limitation of algorithm popularity because of computational complexity for large-scale data, e.g. regional growth and NCut methods. Although voxel-based methods have a good performance for the preprocessing of point cloud data, it contributes less to the reduction of the data volume.

Based on the spectral graph theory, spectral clustering methods have recently shown attractive advantages in segmentation problems (Luxburg Citation2007). Unlike the classic k-means clustering approach, there are no restrictions on data distribution for spectral clustering methods (Fowlkes et al. Citation2004). For the iterative two-class cut method (Reitberger et al. Citation2009), spectral clustering has a similar theoretical foundation but offers better effectiveness in multiclass problems (Luxburg Citation2007). Heinzel and Huber (Citation2018) introduced spectral clustering in tree segmentation using dense terrestrial laser scanning data with a good performance. The biggest challenge of the spectral clustering method for LiDAR point cloud data is computational complexity. For a data set with N points, O(N2) memory is needed to construct the similarity matrix, and the complexity of its eigenvectors calculation is O(N3) (Lin and Cohen Citation2010; Ye et al. Citation2016). Besides time consuming, the procedure poses the risk of memory overflow when N is large for the large-scale dense LiDAR point cloud data.

Nyström approximation is an effective technique for the spectral clustering method in the application of large-scale data. It was originally designed to find a numerical solution of the eigenfunction problem (Sloan Citation1981). Using a sampling technique and the numerical approximation theory, the eigenvector calculated for sampled points are extended to unsampled points by interpolation weights (Fowlkes et al. Citation2004). Since different samples produce different approximations of the original similarity matrix, the choice of sampling method is crucial for the method performance (Bouneffouf and Birol Citation2015). Only when the sampled points can extract the inherent feature of the dataset, i.e. the samples are all landmark points, more ideal results can be obtained with less approximation error. Various sampling methods have been therefore proposed. Some are as simple as random sampling (RS) (Fowlkes et al. Citation2004; Cohen et al. Citation2015), which allows every point to have the same probability of being selected as a sample. Other alternative versions include k-means-based sampling (KS) (Zhang, Tsang, and Kwok Citation2008), incremental sampling (IS) (Zhang and You Citation2011), minimum similarity sampling (SS) (Zeng et al. Citation2014), etc. Bouneffouf and Birol (Citation2015) pointed out that minimizing the sum of the squared similarities between the new and the precedent sampling points would maximize the determinant of the sampling similarity matrix, and then minimize the approximation error. Considering the variability and similarity of sampling points, the minimum sum of squared similarities (MSSS) showed a good performance in previous study (Bouneffouf and Birol Citation2015).

In spite of Nyström method can be used to solve the computational burden of eigenvectors, however, the dilemma in big data storage still needs to be solved. In comparison to the traditional voxel-based approach, the super voxel based method provides a more efficient approach to process point cloud data. Li and Sun (Citation2018) used adaptive octree to acquire supporting points, and took the over segmentation results of the region growing as super voxels. Combined with the resampling process, the unorganized LiDAR point cloud was automatically refined.

In this paper, the Nyström-based spectral clustering (NSC) algorithm combined with a mean shift voxelization was applied to the individual tree segmentation for dense LiDAR point cloud data. Additionally, a more efficient sampling method based on the k-nearest neighbour relationship was proposed for the Nyström approximation. The assessment of the segmentation results was conducted in terms of the detection rate and the parameters of tree height. The remainder of this paper is structured as follows. Section 2 introduces the study area, airborne LiDAR data, and reference data. Section 3 describes the details of the proposed segmentation method, key control parameters, and accuracy assessment. Section 4 evaluates the results of individual tree segmentation and tree height accuracy. This section also shows the comparison of small- and large-scale segmentation as well as the analysis of parameter sensitivity. Section 5 discusses the effectiveness of the similarity function and sampling method as well as the effects of different matching methods for the accuracy assessment. Section 6 summarizes all the important findings of this study.

2. Study area and LiDAR data

2.1. Study area and field measurements

As shown in , the three study sites are located in Mengjiagang (130°32′–130°52′ E, 46°20′–46°30′ N), Dagujia (124°44′–125°16′ E, 42°14′–42°27′ N), and Saihanba (117°–117°36′ E, 42°18′–42°36′ N) from north to south. Their terrain is a gentle slope or hilly state. Dominant tree species are the typical larch plantations, including Larix olgensis in Mengjiagang, Larix Kaempferi in Dagujia, and larix principis-rupprechtii in Saihanba. In this study, 18 plots including 1 large plot and 17 standard plots were used to verify the proposed algorithm. In addition, we selected a tile of 1 km2 () in Mengjiagang Forest Farm for the assessment of large area. The tile segmentation results were compared with three plots inside this area.

Figure 1. The location of study sites, the distribution of field plots and testing tile. The green circles are the plots in each Forest Farm. Upper right panel is CHM of the testing tile. Lower right panel and lower left panel are the CHM and the LiDAR point cloud of the large plot in Saihanba Forest Farm, respectively.

Figure 1. The location of study sites, the distribution of field plots and testing tile. The green circles are the plots in each Forest Farm. Upper right panel is CHM of the testing tile. Lower right panel and lower left panel are the CHM and the LiDAR point cloud of the large plot in Saihanba Forest Farm, respectively.

In total, 2782 living trees with a diameter at breast height (DBH) 5 cm were recorded as references. The tree height varied from 4.2 to 28.8 m. summarizes the parameters of the field plots. DBH was measured using a PI-tape with circular object assumption. Crown sizes were measured using a tape on the ground in four directions (i.e. east, west, south, and north), and the mean value was calculated as the crown radius. The centre location of each plot was measured using a differential Global Navigation Satellite System (GNSS). For the plots from #1 to #11 in Mengjiagang Forest Farm, the stem positions were obtained by the tape and calibreated to the LiDAR CHM through post-processing method. The positioning accuracy of each single stem in plots #1–#11 was within 1 m (Liang, Pang, and Chen Citation2020). For the large plot #12 in Saihanba Forest Farm, the stem positions were obtained by Real Time Kinematic (RTK) GNSS system, and the tree heights were estimated using the allometric equation. For the plots #13–#18 in Dagujia Forest Farm, a number of trees were selected and their positions were measured with total station and GNSS, which have obtained the positioning accuracy within 0.5 m. Meanwhile, the tree heights in Mengjiagang and Dagujia Forest Farm were measured using ultrasonic altimeter and laser hypsometer, respectively. Besides, stem densities were categorized into three levels, i.e. high (H), medium (M) and low (L). These stem densities were resulted from selective logging activities during different growth stages of larch plantation management.

Table 1. Characteristics of 18 field plots in three study sites of China.

2.2. Airborne LiDAR data

The CAF-LiCHy airborne observation system was used for the data collection (Pang et al. Citation2016). The Riegl LMS-Q680i scanner was integrated in the CAF-LiCHy system. The wavelength of this system was 1550 nm, with a 3 ns pulse length and 0.5 mrad beam divergence. The max pulse repetition rate of the sensor is 400 kHz. The LiDAR data parameters in the three study areas are shown in , which including the point density, flight time, relative flight height, and average flight speed (). The point density varied from 10 to 35 pts/m2 after taking the overlapped flights into account.

Table 2. LiDAR data parameters of three study sites of China.

2.3. Benchmark dataset in the Alpine Space of Europe

In order to further examine the segmentation capability for different forest types and compare the proposed method with other approaches, we used 14 plots from the NEWFOR website (Eysn et al. Citation2015). These 14 plots are located in 6 different study areas in the Alpine region. The study data are heterogeneous and collected under different conditions for different purpose. The forest types in the study areas are diverse with the coniferous proportion ranging from 28% to 100%. The main tree species include spruce, fir, beech, larch, etc. The statistical description of the forest plots is shown in , where the Plot ID is consistent with the definition by Eysn et al. (Citation2015). More detailed parameters can be found in their work.

Table 3. Statistical description of the 14 available forest plots from the NEWFOR Alpine Space Programme.

3. Methods

3.1. Segmentation method

3.1.1. Overall of the workflow

Our algorithm was based on the spectral clustering method, and its computational dilemma was solved by the mean shift voxelization and Nyström approximation. The mean shift method transformed the LiDAR point cloud into the voxel space and consequently reduced the burden of calculations and storage. The Nyström method that used the sampling technique provided a fast approximation of the eigenvalues and eigenvectors of the similarity matrix. A new sampling procedure, K-nearest neighbour-based sampling (KNNS), was proposed for the Nyström method, which further improved the performance of the sampling process. shows the general workflow of the algorithm, which was implemented using Python programming language. The shaded rectangle in is the procedure of Nyström approximation. The overall framework consists of the following main steps of (i) LiDAR point cloud pre-processing, including mean shift voxelization; (ii) constructing a similarity graph; (iii) calculating the eigenvalues and eigenvectors of the similar matrix; (iv) clustering the rows of the eigenvector matrix; and (v) mapping the results back to LiDAR point clouds and generating individual tree clusters.

Figure 2. Flowchart of individual tree segmentation using Nyström-based spectral clustering method.

Figure 2. Flowchart of individual tree segmentation using Nyström-based spectral clustering method.

3.1.2. Mean shift voxelization

The traditional voxel-based approach transforms the point cloud into cube voxels with a given resolution. Its main advantage is the convenient access to the neighbourhood of each voxel point. However, a small resolution is usually chosen to obtain a forest structure that is highly close to the original LiDAR point cloud, which limited compression of the data volume. The super voxel based method provides alternatively a more flexible and efficient approach to constructing a tighter voxel space. Mean shift is a clustering algorithm that groups points by iteratively shifting each point towards the density maxima within a kernel. It is a robust method with fast and good results that does not need to assume the distribution of data points or the number of clusters. On account of the advantages of mean shift method, we used it to complete the voxelization procedure. As shown in , each voxel was represented by a cluster obtained based on the mean shift. The voxel position was determined by the centre point and the voxel weight is equal to the number of points within it. As a result, mean shift voxelization reduced the amount of data by approximately a factor of 10 ().

Figure 3. Examples of (a) the original LiDAR point cloud of plot 11 (29,671 pts), (b) the clustering result of mean shift method (each colour represents one voxel) and (c) the voxel cloud (each voxel is represented by voxel weight, 2802 pts).

Figure 3. Examples of (a) the original LiDAR point cloud of plot 11 (29,671 pts), (b) the clustering result of mean shift method (each colour represents one voxel) and (c) the voxel cloud (each voxel is represented by voxel weight, 2802 pts).

3.1.3. Constructing the similarity graph

The spectral clustering algorithm offers great flexibility in the definition of a similarity function and the similarity graph. Referring to the work of Luxburg (Citation2007), we selected the suggested K-nearest neighbour (KNN) graph, that is simple to construct a sparse adjacency matrix and is less susceptible to inappropriate parameter choices. Considering the shape of the trees and the properties of the voxels, we used the Gaussian similarity function as follows: (1) s(xi,xj)={eninj(DijXYσxy)2×eninj(DijZσz)2if(xjinKNNofxi)0otherwise(1) In this equation, DijXY and DijZ are the horizontal and vertical Euclidian distances between two voxels xi and xj, respectively. They were assigned different scale factors σxy and σz to satisfy the ellipsoid shape of the tree. The two weighting factors ni and nj are the weights of the voxel xi and xj, that are used to maintain the consistency of the voxel space with the original point cloud.

shows the similarity graphs of two trees. All the circle dots in the figure represent the voxels of the two trees. The big red dots are the base voxels, which are used to construct the similarity graphs. Each base voxel is connected with its K-nearest neighbours by line segment, and the thickness of line segment represents the similarity between the base voxel and the connected voxel. The blue and orange small circle dots represent the K-nearest neighbours of the two base voxels, while the grey small circle dots represent the points that are not their K-nearest neighbours. Different colours are used to render the connection lines for an enhanced display effect. Meanwhile, each similarity graph only connects the base voxel with odd connection points to display neatly. The even connection points have similar similarities with their neighbourhood points.

Figure 4. The similarity graph of different trees. Here, the number of KNN is 50. The big red circle dots represent the base voxels of different similarity graphs. The blue and orange small circle dots represent the K-nearest neighbours of each base voxel. The grey small circle dots represent the voxels that are not the K-nearest neighbours of each base voxel. Connecting lines with different thicknesses and colour represent the Gaussian similarity between the base voxel and the connected points.

Figure 4. The similarity graph of different trees. Here, the number of KNN is 50. The big red circle dots represent the base voxels of different similarity graphs. The blue and orange small circle dots represent the K-nearest neighbours of each base voxel. The grey small circle dots represent the voxels that are not the K-nearest neighbours of each base voxel. Connecting lines with different thicknesses and colour represent the Gaussian similarity between the base voxel and the connected points.

To improve the searching speed of the K-nearest neighbours for the similarity matrix construction of each voxel, we employed the kd-tree data structure to compensate because of the lack of the location index of the voxels under mean shift voxelization. It should be noted that the similarity matrix is not symmetric, since we constructed the unilateral connected K-nearest neighbour graph instead of a symmetrical graph.

3.1.4. Nyström approximation

The eigenvectors and eigenvalues of the similarity matrix were computed using the Nyström method approximatively. The workflow of the Nyström method is shown in . Its principle is illustrated in the following sections.

3.1.4.1. Nyström method

For a data set with N points and a similarity matrix W, the Nyström method divides all the points into the n sampling points and the m remaining points (m=Nn). Then, the similarity matrix W is partitioned as: (2) W=[ABBTC](2) where ARn×n represents the subblock of weights among the sampling points with diagonalization A=UΛUT, BRn×m represents the weights from the sampling points to the remaining points, and CRm×m represents the subblock of weights among the remaining points. Letting W^ and U¯ denote the approximation of W and its eigenvectors respectively, the Nyström extension gives (3) U¯=[UBTUΛ1](3) and (4) W^=U¯ΛU¯T=[UBTUΛ1]Λ[UTΛ1UTB]=[UΛUTBBTBTUΛ1UTB]=[ABBTBTA1B](4)

Thus, the Nyström extension approximates C using BTA1B implicitly. For the airborne LiDAR point cloud data, C is huge because of the large number of points (N). This technique avoids the direct use of matrix C, and greatly reduces the computing and storage burden.

Since the approximate eigenvector U¯ is not necessarily orthogonal, Fowlkes et al. (Citation2004) gave an expression of the orthogonal eigenvectors of W. If A is positively definite, let A1/2 denote the symmetric positive definite square root of A, define S=A+A1/2BBTA1/2, and diagonalize it as S=USΛUST. Then, W^ can be diagonalized as W^=VΛSVT, and (5) V=[ABT]A1/2USΛS1/2(5)

Thus, the computational complexity of eigendecomposition is reduced from the O(N3) for W to the O(n3) for S. To apply the Nyström approximation to the spectral clustering algorithm requires the normalization of D^1/2W^D^1/2, where D^ is the diagonal matrix with entries D^ii=jW^ij. Since (6) d^=W^1=[A1n+B1nBT1m+BTA1B1n]=[ar+brbc+BTA1br](6) where 1 denotes a column vector of ones, ar, brRn represent the row sums of A and B, respectively, bcRm is the column sum of B, the normalization can be given by (7) AijAijd^id^j,i,j=1,,n(7) and (8) BijBijd^id^j+n,i=1,,n,j=1,,m(8)

3.1.4.2. K-nearest neighbour-based sampling (KNNS) method

For each point, the KNNS method first calculates the sum of the similarities between it and its K-nearest neighbours. The sampling procedure starts by assigning the point with the largest sum, i.e. the one that is most similar to its neighbours. Its neighbours are assigned to the remaining point set directly since they can be considered as representation of the sample. The procedure is repeated until all points in the original data set are assigned. The matrices A and B can be constructed simultaneously during the sampling processing. This cannot be implemented by the MSSS sampling method. The reason for that is when a point is selected as a sample, its neighbours will have no chance to be selected again under the principle of the KNNS method. illustrates the sampling procedure using 2-neighbours as an example. Note that matrix A is asymmetric since the constructed KNNgraph is not symmetrical in this design.

Figure 5. Example of the K-nearest neighbour-based sampling (KNNS) method (K = 2). ID denotes the indices of the points; IDknn and SIMIknn denote the indices of the K-nearest neighbours of each point and the corresponding similarity between them respectively; IDsorted is obtained by the sorting ID according to the descending order of each row sum of SIMIknn; S is the indices set of sampling points; the set R contains the indices of the remaining points; the matrices A and B are defined in 3.1.4.1. The sampling process is performed according to the indices in IDsorted. Initially, all elements in IDsorted are coloured in black, indicating no allocation to S or R. In each step, the first black index in IDsorted is put into set S as a new sampling point and is coloured in red. The indices of its K-nearest neighbours and the corresponding similarities can be obtained directly in IDknn and SIMIknn. There are three cases for each K-nearest neighbour: (a) if it already belongs to S, then put the similarity in the relevant position in A (e.g. s42); (b) if it already belongs to R, then put the similarity in the relevant position in B (e.g.s51,s40); (c) if it is neither in S nor R, then add it to R and colour it blue and put the similarity in the relevant position in B (e.g. s53). This process ends until there is no black element in IDsorted.

Figure 5. Example of the K-nearest neighbour-based sampling (KNNS) method (K = 2). ID denotes the indices of the points; IDknn and SIMIknn denote the indices of the K-nearest neighbours of each point and the corresponding similarity between them respectively; IDsorted is obtained by the sorting ID according to the descending order of each row sum of SIMIknn; S is the indices set of sampling points; the set R contains the indices of the remaining points; the matrices A and B are defined in 3.1.4.1. The sampling process is performed according to the indices in IDsorted. Initially, all elements in IDsorted are coloured in black, indicating no allocation to S or R. In each step, the first black index in IDsorted is put into set S as a new sampling point and is coloured in red. The indices of its K-nearest neighbours and the corresponding similarities can be obtained directly in IDknn and SIMIknn. There are three cases for each K-nearest neighbour: (a) if it already belongs to S, then put the similarity in the relevant position in A (e.g. s42); (b) if it already belongs to R, then put the similarity in the relevant position in B (e.g.s51,s40); (c) if it is neither in S nor R, then add it to R and colour it blue and put the similarity in the relevant position in B (e.g. s53). This process ends until there is no black element in IDsorted.

This method can quickly identify sampling points that satisfy the principle of maximizing the determinant of the sampling similarity matrix mentioned in MSSS. After the construction of the KNN graph, the determinant of the sampling similarity matrix A delivered from KNNS is 1, because A is a lower triangular matrix with the diagonal elements of 1. This is the maximum possible determinant and can minimize the error of the Nyström approximation. Furthermore, our method reduces the computational complexity from theO(mN)of the MSSS method toO(KN), where K represents the number of K-nearest neighbours, and Km for a big data set. Another advantage is that the number of samples for the KNNS method is flexible and reliable, since it depends on the structure of the data itself instead of the empirical assumptions.

3.1.5. Individual tree segmentation

After computing the eigenvalues and eigenvectors of the similarity matrix W^, the number of clusters t was chosen by an eigengap heuristic (Luxburg Citation2007), which is a tool that was particularly designed for spectral clustering. Let λ1,,λn denote the eigenvalues of W^, i.e. the ΛS in Equation (5). The goal is to choose the number t so that all eigenvalues λ1,,λt are very small, and λt+1 is relatively large. Then, there is a gap between the tth and (t + 1)th eigenvalue, i.e. |λt+1λt| is relatively large. According to the eigengap heuristic, this gap indicates that the data set contains t clusters.

Then, the k-means method was applied for the normalized rows of the eigenvector matrix composed of the first t eigenvectors. The clusters of individual trees were obtained by mapping the results back to the original point cloud with reference to the indices of labels. For individual tree parameters, the location and height were defined by the spatial coordinate of the highest point, and the mean of the vertical projections of the clusters in x direction and y direction was taken as the crown radius.

3.1.6. Post-processing

In order to improve the segmentation results, the post-processing workflow was designed as . It includes segmentation using NSC method, result merging and filtering, re-segmentation of the filtering points, result filtering and combination of all tested individual trees. The merging and filtering of the segmentation results mainly take into account the distance and height difference of the adjacent trees, the crown shape and the number of points in each tree point cloud. The criteria are as follows:

  1. If the distance between two adjacent individual trees is less than the average crown diameter of the corresponding plot and the elevation of these two trees is less than 10 m (10 m here is empirical value through a number of experiments), they will be merged into an individual tree.

  2. A tree with a crown diameter greater than half value of the tree height will be empirically regarded as an abnormal crown shape, in view of the fact that it is likely under-segmented. The corresponding tree point cloud will be filtered out so that it can be further searched as potential trees in the re-segmentation.

  3. The absolute value of the difference between the East–West crown and the South–North crown is greater than the average of these two values is considered to be an unreasonable result. As the vertical projection of such a canopy usually differs greatly from circle, the corresponding tree point cloud will be filtered out.

Figure 6. Post-processing workflow.

Figure 6. Post-processing workflow.

These filtered trees are regarded as unreasonable results obtained by primary segmentation. It is very likely that the filtered points contain potential tree information that was not found. Therefore, the filtered points are re-segmented for further mining point cloud information. And the same filtering process is performed on the re-segmentation results to ensure that the obtained trees are in a reasonable shape. The final segmentation result is the tested individual trees of the primary segmentation and the re-segmentation. In the following, NSC are used to represent the primary results of the Nyström-based Spectral Clustering method.

3.2. Control parameters and sensitivity analysis

There are four parameters that control the performance of the proposed algorithm. The key parameter of the mean shift method is the bandwidth of the kernel function. Since the voxelization is to create local small cells inside each individual tree, we selected the bandwidth based on the local neighbour distance. The average K-nearest neighbour distance was calculated as the bandwidth, where the number of neighbours was determined as the point cloud density. Specifically, the Python sklearn.cluster.estimate_bandwidth function was used to adaptively determine the voxel resolution. Here, the value of the key parameter quantile of the estimate_bandwidth function represents the ratio of the number of neighbours to the number of samples for the nearest neighbour query. In this study, all points in the original cloud were used as samples, and the point density was regarded as an approximate representation of the number of local neighbours overall. Therefore, the ratio of point density and the number of points was calculated and used as the value of quantile parameter. The derived bandwidth value was further used to realize the adaptive mean shift voxelization.

For the width parameters σxy and σz of the Gaussian similarity function in Equation (1), i.e. the scale factors for the Euclidean distances, σz was set to six times of σxy based on the tree shape in order to make the horizontal and vertical distances the same scale. To find the optimal σxy, the root mean square of commission costs of the 11 plots in Mengjiagang Forest Farm was considered to evaluate the impact of the parameter. The definition of the commission cost is as follows (Wang et al. Citation2016) (9) CommissionCost=Rcommission/Rmatch(9) where the Rcommission and Rmatch are defined by formula (13) and (12) in Section 3.4, respectively. The commission cost is a detection indicator. The lower the value, the higher matching can be obtained. The root mean square of commission costs allows to check the impact of σxy2 on the overall detection accuracy and further determine the optimal parameter value. The results of σxy2 and the optimal value selection can be found in Section 4.4.

Another important parameter is the number of nearest neighbours in the similarity graph, which is defined as K. Since the KNNS method is based on the neighbourhood relationship, different K values will bring different numbers of sampling points in a given voxel space. Thus it will construct the eigenvector matrices with different dimensions. Considering that the number of clusters t is determined by the eigengap heuristic, the dimension of the eigenvector matrix is equal to the upper limit of the number of clusters. Therefore, the value of K ultimately determined the upper limit of the number of final clusters, i.e. the number of trees segmented by the algorithm. To adaptively calculate K in practical application, the relationship between the K values and one of the main attributes of the sampling points, namely the sampling density, was analysed and discussed in Section 4.4.

3.3. Large dataset segmentation

Usually, the airborne LiDAR data covers large area with huge point cloud data volume. Since the computational complexity is O(n3), when cutting n into multiple sub-blocks n1,,npwithni0fori=1,,p, it can be drawn that i=1pni3n3. However, the reprocessing of points near the cutting lines needs to be considered. Excessive cutting may depreciate the first segmentation and result in a heavy burden on reprocessing. Therefore, for tile segmentation, proper cutting can improve efficiency to some extent. In our study, we cut the tile into parts with approximately 300,000 points. The complete tile segmentation process is demonstrated in .

Figure 7. The process of tile segmentation. (a) Divide the tile into parts using a cutting line (the blue dashed line); (b) perform the segmentation method in each part to generate individual trees with different labels, and mark the labels near the cutting line in blue; (c) gather the points with blue labels (the grey points with label 0), and merge other labels in order (labels 3–5 below were updated to 4–6 to follow the upper label 1–3); (d) re-segment the grey points and merge their labels to the final results.

Figure 7. The process of tile segmentation. (a) Divide the tile into parts using a cutting line (the blue dashed line); (b) perform the segmentation method in each part to generate individual trees with different labels, and mark the labels near the cutting line in blue; (c) gather the points with blue labels (the grey points with label 0), and merge other labels in order (labels 3–5 below were updated to 4–6 to follow the upper label 1–3); (d) re-segment the grey points and merge their labels to the final results.

3.4. Accuracy assessment

The accuracy of the algorithm was assessed in terms of both detection rates and tree parameters. The matching method proposed by Eysn et al. (Citation2015) was performed, which mainly based on tree height and horizontal distance. illustrates an example of the matching process for one reference tree. The whole procedure started from the highest detected tree, and searched for the reference trees that satisfied the height and distance criterion in as match candidates. If a farther candidate showed a better height difference (ΔH) and its 2D distance between detection and reference (ΔD) was at a maximum of 2.5 m greater than the nearer one, then it became a better match. This process was repeated until all detected trees have been checked. Finally, the matched reference trees were checked against the surrounding detection trees. If the closest one with the smallest height difference is the matched detection tree previously, these two trees will be treated as a matched pair. Otherwise, the matched reference tree will be removed.

Figure 8. Matching procedure for a reference tree with height of 20 m. Here, the number represents the tree height. (a) The detected trees that are not within the distance criterion of the reference are removed (19 and 23 m). Here, the criterion is ΔD < 5 m since 15 m < H = 20 m ≤ 25 m. (b) The detected trees that do not satisfy the height criterion in are removed (12 m). Here, the criterion is ΔH < 4 m. (c) The detection with 21 m height became the best match since it shows a better ΔH and its distance from the reference (2.2 m) is 0.7 m (<2.5 m) greater than the near detection with 17 m height (1.5 m).

Figure 8. Matching procedure for a reference tree with height of 20 m. Here, the number represents the tree height. (a) The detected trees that are not within the distance criterion of the reference are removed (19 and 23 m). Here, the criterion is ΔD < 5 m since 15 m < H = 20 m ≤ 25 m. (b) The detected trees that do not satisfy the height criterion in Table 3 are removed (12 m). Here, the criterion is ΔH < 4 m. (c) The detection with 21 m height became the best match since it shows a better ΔH and its distance from the reference (2.2 m) is 0.7 m (<2.5 m) greater than the near detection with 17 m height (1.5 m).

Table 4. Criterions of tree height and 2D distance for the matching procedure (Eysn et al. Citation2015).

For a dataset with high stem density, several candidates may be searched using the distance criterion in . This phenomenon results in several more distant candidates initially becoming matches, and these matches are very likely to be removed when the references are final checked. In order to check the effect of the matching methods, we also matched the detection results using the crown radius as an adaptive distance criterion instead of the fixed threshold. Considering the insufficient positioning accuracy of the stems (Section 2.1), a constant value of 1 m is added to the crown radius.

According to the matching procedure above, the segmentation results can be divided into TP (true positive), FN (false negative) and FP (false positive), representing the match, omission and commission, respectively, where TP+FN=Nreference and TP+FP=Ndetection. The extraction rate (Rextraction), matching rate (Rmatch) and commission rate (Rcommission) can be calculated by (10) Rextraction=NdetectionNreference=TP+FPTP+FN(10) (11) Rmatch=NmatchNreference=TPTP+FN(11) (12) Rcommission=NcommissionNdetection=FPTP+FP=1RmatchRextraction(12) (13) R2=i=1n(HiHi¯)(Hi^Hi^¯)i=1n(HiHi¯)2i=1n(Hi^Hi^¯)2(13) (14) RMSE=i=1n(HiHi^)2(14) (15) rRMSE=RMSEWi^¯(15)

In order to estimate the segmentation performance, the R-squared measure (R2), root mean square error (RMSE) and relative root mean square error (rRMSE) of tree height regression were calculated. The calculation methods of these parameters are shown in formulas (13)–(15).Hi is tree height from reference, Hi¯ is the average of Hi, Hi^ is tree height from detection, Hi^¯ is the average of Hi^, n is the number of trees.

For the tile segmentation, the results were assessed by the plots within it. Considering the field data characteristics in Section 2.1, the tree height were evaluated using the field plots from #1 to # 11 in Mengjiagang and the field plots from #13 to #18 in Dagujia, which have field measured tree height for all the tally trees and selected trees. As the tree heights in plot #12 were estimated using DBH, we did not use these plots for the height evaluation. The extraction rate and matching rate were evaluated using field plots from #1 to #12, which have field measured stem positions. As only a part of stem positions were measured for plots from #13 to #18 in Dagujia, these plots were not used for extraction rate and matching rate evaluation.

4. Results

4.1. Individual tree segmentation

shows the segmentation results with different stem densities. Relative to the field-measured data, the best results were obtained for low stem density ((a)). The segmentation for a small tree covered by a large tree is quite difficult, for example the small referenced tree in (c). It is mainly because there is fewer airborne LiDAR point data in this situation. visually displays the detection rates of 11 standard plots and 1 large plot using the crown radius and distance criterion for matching procedure. Compared with distance criterion, the matching method using crown radius has a lower matching rate of each plot because of its rigorous matching rules, especially for the medium and high stem densities.

Figure 9. Examples of segmentation results for different stem densities. Each colour represents one segmented tree, and the black points are the stem positions of field-measured data with different height represents the inventory tree height. (a) Low stem density, (b) medium stem density, and (c) high stem density.

Figure 9. Examples of segmentation results for different stem densities. Each colour represents one segmented tree, and the black points are the stem positions of field-measured data with different height represents the inventory tree height. (a) Low stem density, (b) medium stem density, and (c) high stem density.

Figure 10. Extraction rates and matching rates for different forest plots. The annotations ‘match_cr’ and ‘match_dc’ represent the matching procedure using crown radius and distance criterion, respectively.

Figure 10. Extraction rates and matching rates for different forest plots. The annotations ‘match_cr’ and ‘match_dc’ represent the matching procedure using crown radius and distance criterion, respectively.

For NSC method, shows that the matching rates increase from 54% to 74% (crown radius for matching procedure) as stem density decreases. The lowest extraction rate is found in the medium stem density plot (Rextr.NSC: 102%), which was almost consistent with the overall performance. The matching procedure using crown criterion obtained lower matching rates with 8%, 7% and 7% lower than the matching procedure using distance radius for high, medium and low stem density plots respectively. Overall, NSC algorithm extracted 103% of the reference trees, 70% of which were correctly matched using distance criterion for matching procedure, and 65% of which were correctly matched using crown radius for matching procedure.

Figure 11. Extraction rates and matching rates for different stem density levels. The annotations ‘match_cr’ and ‘match_dc’ represent the matching procedure using crown radius and distance criterion, respectively.

Figure 11. Extraction rates and matching rates for different stem density levels. The annotations ‘match_cr’ and ‘match_dc’ represent the matching procedure using crown radius and distance criterion, respectively.

To observe the performance of the algorithm, we further examined the results by different tree height levels (). Considering the matching rates using crown radius, the best matching rate was obtained as 79% in the layer >20 m for NSC method. The matching rate for the layer 5–20 m is 78% which is also a relatively better result. For the layer 10–15 m and the layer 5–10 m, the matching rates are 44% and 15%. The matching rates are 15%, 54%, 84% and 87% when using distance criterion for the layer 5–10 m, 10–15 m, 15–20 m and >20 m. All are higher than the results obtained by the matching method using crown radius.

Figure 12. Extraction rates and matching rates for different tree height levels. The annotations ‘cr’ and ‘dc’ represent the matching procedure using crown radius and distance criterion, respectively.

Figure 12. Extraction rates and matching rates for different tree height levels. The annotations ‘cr’ and ‘dc’ represent the matching procedure using crown radius and distance criterion, respectively.

4.2. Tree height accuracy evaluation

High tree height accuracy was found for the tree parameters of the matches (). The large plot 12 was excluded because of its insufficient accuracy of the reference height of each stem which were obtained from empirical model rather than measured value. As shown in , the R2 value is 0.87 and the RMSE value is 1.26 m with rRMSE of 6.13% for the matching procedure using distance criterion. The matching procedure using crown radius obtained better results with a higher R2 value of 0.88andalowerRMSEvaluesof1.22mwithrRMSEof5.97%.

Figure 13. Precision of tree height of plots 1–11. (a) For NSC method using crown radius in matching procedure, (b) for NSC method using distance criterion in matching procedure.

Figure 13. Precision of tree height of plots 1–11. (a) For NSC method using crown radius in matching procedure, (b) for NSC method using distance criterion in matching procedure.

Furthermore, different matching procedures were also used to match the height of detected trees with the reference stems positioned by total station in the 13–18 plots (). Compared with the tree position measured by tape and RTK, there are more accurate stem locations obtained by total station which show better results in the tree height matching. For the matching procedure using distance criterion, the R2 is 0.87 and the RMSE is 0.90 m with rRMSE of 4.86%. The matching procedure using crown radius obtained better results with the R2 of 0.89, RMSE of 0.84 m and rRMSE of 4.49%. And the estimated tree height is very close to field measured height.

Figure 14. Precision of tree height of plots 13–18. (a) For NSC method using crown radius in matching procedure, (b) for NSC method using distance criterion in matching procedure.

Figure 14. Precision of tree height of plots 13–18. (a) For NSC method using crown radius in matching procedure, (b) for NSC method using distance criterion in matching procedure.

4.3. Comparison of plot- and tile-scale segmentation

For the tile segmentation (the testing tile in ), total 37,160 trees are obtained. shows the segmentation results of the three plots in plot scale and tile scale, which keeps almost the same height distribution. compares the results of the three plots at different height scales to view the accuracy of the algorithm in tile segmentation. Overall, the matching rate and parameter precision are almost the same at plot and tile scales. Even the plot scale can obtain a slightly higher matching rates compared with the tile scale. It implies that our algorithm is independent of scale, and can perform well for large-scale LiDAR point cloud data. shows segmentation results of 3# and 7# sample plot in tile scale.

Figure 15. Height distribution of the segmentation results for the three plots at plot scale and tile scale.

Figure 15. Height distribution of the segmentation results for the three plots at plot scale and tile scale.

Figure 16. Comparison of tile and plot segmentation results. The annotations ‘NSC’ represent the proposed method. The annotations ‘cr’ and ‘dc’ represent the matching procedure using crown radius and distance criterion. The annotations ‘Plot’ and ‘Tile’ represent the results in plot and tile scale.

Figure 16. Comparison of tile and plot segmentation results. The annotations ‘NSC’ represent the proposed method. The annotations ‘cr’ and ‘dc’ represent the matching procedure using crown radius and distance criterion. The annotations ‘Plot’ and ‘Tile’ represent the results in plot and tile scale.

Figure 17. Segmentation results of the 3# and 7# plots in tile scale. The raster image is part of the CHM of the tile shown in , which includes the 3# and 7# sample plots. The red triangles are the position of the individual trees after segmentation, and the yellow circles indicate the crown width of the trees. Two large blue boxes represent the extent of the 3# and 7# sample plots. The point cloud shows the final segmentation results of the two plots, and different colours represent different individual trees.

Figure 17. Segmentation results of the 3# and 7# plots in tile scale. The raster image is part of the CHM of the tile shown in Figure 1, which includes the 3# and 7# sample plots. The red triangles are the position of the individual trees after segmentation, and the yellow circles indicate the crown width of the trees. Two large blue boxes represent the extent of the 3# and 7# sample plots. The point cloud shows the final segmentation results of the two plots, and different colours represent different individual trees.

4.4. Comparison with other segmentation methods

The direct comparison results with other methods using the benchmark dataset are shown in . The descriptions of methods #1–#8 can be found in Eysn et al. (Citation2015). Compared with other 8 methods, the matching rate of the proposed NSC method (RMSmatch: 54%) is higher than the other methods. Meanwhile, although methods #5 and #6 have similar matching rates with the proposed NSC method, they have much larger extraction rate, which means these two methods have very high risk of commission errors.

Figure 18. Detection results of different segmentation methods using benchmark dataset in the Alpine Space of Europe. #1–#8 correspond to the methods described by Eysn et al. (Citation2015). NSC represents the proposed method.

Figure 18. Detection results of different segmentation methods using benchmark dataset in the Alpine Space of Europe. #1–#8 correspond to the methods described by Eysn et al. (Citation2015). NSC represents the proposed method.

In , the detection results are summarized by forest type. This comparison was only conducted with the overall results of other eight methods because only the quantitative value of the overall accuracy is available in the work of Eysn et al. (Citation2015). The best result of NSC method was found for single-layered mixed forests (SL/M), showing a lower RMSextr. (132%) and a higher RMSmatch (58%) than those values of other 8 methods. The performance for single-layered coniferous forests (SL/C) is also satisfactory, with a better RMSmatch (67%) and a worse RMSextr. (117%). The multi-layered mixed forests (ML/M) and multi-layered coniferous forests (ML/C) show similar results with higher extraction rates, while the RMSmatch values are 8% and 13% higher than those values of other eight methods.

Figure 19. Detection results by forest type of different segmentation methods using benchmark dataset in the Alpine Space of Europe. Overall_8 represents the overall result of other eight methods. The forest class corresponds to single or multi-layered (SL or ML)/mixed or coniferous (M or C).

Figure 19. Detection results by forest type of different segmentation methods using benchmark dataset in the Alpine Space of Europe. Overall_8 represents the overall result of other eight methods. The forest class corresponds to single or multi-layered (SL or ML)/mixed or coniferous (M or C).

4.5. Parameter sensitivity

and show the results of sensitivity analysis using the 11 plots in Mengjiagang Forest Farm. Regarding the width parameters σxy of the Gaussian similarity function in Equationequation (1), shows the root mean square value of the commission cost when varying the values of σxy2, and the optimal choice is σxy2=10m2, i.e. σxy3.16m. Accordingly, σz is taken as 18.96 m. With respect to the value of K in the similarity graph, shows the changes of sampling density values of the 11 plots in Mengjiagang when varying the K from 3 to 50. Regardless of the characteristics of plots, there is a fixed relationship between the value of K and the density of the sampling points (the red dotted line in ). The detail is as follows: (16) den=13,045.24K0.87(16) where den (num/ha) represents the density of the sampling points, K denotes the number of nearest neighbour in the KNN method. The main attributes of the point cloud data considered are the point density and plot area. The voxelization process removed the former effect using the bandwidth determined by the point density. The density of the sampling points, an area-independent attribute, eliminated the effect of the plot area. In practical application, the values of den can be calculated based on the number of local maxima in the CHM. Then the values of K can be adaptively obtained by Equationequation (16).

Figure 20. Influence of σxy2 on the values of root mean square of commission cost for the 11 plots in Mengjiagang Forest Farm. The optimal value (red dot) appears near σxy2=10m2.

Figure 20. Influence of σxy2 on the values of root mean square of commission cost for the 11 plots in Mengjiagang Forest Farm. The optimal value (red dot) appears near σxy2=10m2.

Figure 21. Influence of K (the number of neighbours) on the values of den (the density of the sampling points) for the 11 plots in Mengjiagang Forest Farm.

Figure 21. Influence of K (the number of neighbours) on the values of den (the density of the sampling points) for the 11 plots in Mengjiagang Forest Farm.

5. Discussion

5.1. Voxel weights for the similarity function

The voxel weights introduced in the similarity function have an important influence on the algorithm. In the comparison of , voxel weights produced better matching rates both in the overall results and in every stem density class for each matching method. Although the matching method using crown radius has lower matching rates, it can obtain higher precision of tree height.

Table 5. Comparison of the segmentation results for different similarity functions using different matching methods.

illustrates the reason for the improvement by introducing voxel weights in the similarity function. The similarity function without voxel weights produced unconnected clusters, such as the abnormal individuals coloured in yellow in (a). The distributions of the sampling points generated by different similarity functions reveal the essence of this problem ((c,d)). The density-based voxelization results in a few but heavy voxels in dense local areas. If the weights are neglected, these areas will become sparse in the voxel cloud (the nonuniform distribution in (c)). For the sampling process based on the neighbour relationship, these remote but representative voxels will not be remained, as they can be represented by other light but dense samples. Such unreasonable sampling ((c)) leads to the problem in (a). On the contrary, voxel weights produce reasonable sampling results (the uniform distribution in (d)) and thus eliminate the disconnection ((b)).

Figure 22. Segmentation results for the same part of plot 11 produced by (a) the similarity function without voxel weights, occurring an unconnected individual (the yellow cluster), and (b) the similarity function with voxel weights, producing no abnormal cluster. Different colours represent different segmented trees. (c) and (d) are the sampling points generated in plot 11 using the similarity function without and with voxel weights, respectively. Here, the grey cube represents the boundary of the plot.

Figure 22. Segmentation results for the same part of plot 11 produced by (a) the similarity function without voxel weights, occurring an unconnected individual (the yellow cluster), and (b) the similarity function with voxel weights, producing no abnormal cluster. Different colours represent different segmented trees. (c) and (d) are the sampling points generated in plot 11 using the similarity function without and with voxel weights, respectively. Here, the grey cube represents the boundary of the plot.

5.2. Comparison of different sampling methods

compares the detection results of different sampling methods of the plots #1–#12 using different matching methods. Due to the random characteristic of the MSSS method (Bouneffouf and Birol Citation2015), fixed parameters can also produce various outcomes. The results presented here are optimal for multiple experiments. The quantitative comparison in shows that KNNS obtained higher matching rates not only in the overall results but also in different stem density classes for every matching method. The matching tree parameters of the two methods are approximately the same for a certain matching method. In addition, the KNNS method can achieve more stable results with faster calculation (Section 3.1.4.2). Hence this method is more feasible in practical applications.

Table 6. Comparison of the segmentation results for different sampling methods using different matching methods.

5.3. Effects of matching methods

The matching method using crown radius can improve the tree height precision at the cost of a small reduction in matching rates in comparison with the matching method using distance criterion ( and ), especially in the plots with medium and high density. The problem is mainly due to the fact that the fixed threshold of might be too large for high stem density plots. It could search many candidates especially when the commission rate is relatively high. In the final check of the reference, the farther matched candidates are likely to be filtered out. The crown radius can get more reasonable match trees compared with the distance criterion, but a match may not be searched in a more rigorous range of crown radius which results in the lower matching rates. The matched trees in the range of crown radius are the most reasonable results, and the accuracy of the tree height can be approved. In addition, the insufficient accuracy of the field stem position may be another reason of lower matching rates using crown radius. It can be inferred that more satisfactory results could be expected with higher accurate field single stem positioning.

6. Conclusions

This study developed a new individual tree segmentation method based on airborne LiDAR point cloud data directly. We used the mean shift voxelization and Nyström approximation for the spectral clustering method to isolate individual trees. By using the similarity graph with voxel weight and the K-nearest neighbour-based sampling (KNNS) scheme, the proposed algorithm not only reduced the storage and calculation complexity of the spectral clustering algorithm but also maintain its good segmentation performance. In the study sites with various levels of LiDAR point density and tree stem density, our algorithm extracted 103% of the reference trees, 69% of which were correctly matched using crown radius criterion. The matching method using crown radius is a more adaptive choice than distance criterion because it provides more reasonable linkage, especially for the high stem density plots. For all trees from 11 plots with GNSS calibrated tape-measured locations in Mengjiagang Forest Farm, the tree height regression of the matching results shown an R2 of 0.88 and a RMSE of 1.22 m. For all trees from 6 plots with GNSS calibrated total-station measured locations in Dagujia Forest Farm, the R2 and RMSE values of tree height regression reached 0.89 and 0.84 m. The consistence between plot and tile segmentation indicates that the algorithm can provide stable and favourable results for large-scale airborne LiDAR data. In comparison with other sampling methods using a benchmark dataset, the proposed KNNS method also shows better performance. Moreover, the introduction of voxel weights in the similarity function effectively solved the problem of abnormal clustering.

Supplemental material

Supplementary_Material

Download Zip (576.6 KB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The relevant code of this research is available at https://github.com/limingado/NSC/tree/v1.0.0.

Additional information

Funding

This study was funded by the National Key Research and Development Program of China [grant number 2017YFD0600404] and Natural Science Foundation of China [grant number 41871278].

Notes

Note: The stem density classes correspond to high (H), medium (M) and low (L) levels.

a Only selected trees with total station location are included for these plots.

Note: Corresponding Latin names: spruce (Picea abies), fir (Abies alba), beech (Fagus sylvatica), Scots pine (Pinus sylvestris), larch (Larix decidua), sycamore (Acer pseudoplatanus), elm (Ulmus glabra), and poplar (Populus nigra). These plots include forests of single or multi-layered (SL or ML)/mixed or coniferous (M or C).

Note: ΔH and ΔD represent the height difference and 2D distance between detection and reference.

Note: NSCvw and NSC0 represent the Nyström-based spectral clustering with and without the voxel weights in the similarity function. Rmatch represents the overall performance and the Rmatch with H, M, L correspond to the stem density class in .

References