2,291
Views
2
CrossRef citations to date
0
Altmetric
Articles

Spectral Clustering on Spherical Coordinates Under the Degree-Corrected Stochastic Blockmodel

, &
Pages 346-357 | Received 09 Nov 2020, Accepted 04 Nov 2021, Published online: 10 Jan 2022

Figures & data

Fig. 1 Histogram of within-community degree distributions from three bipartite networks with size 439×60,635, obtained from (a) a simulation of a SBM, (b) a simulation of a DCSBM, and (c) a real-world computer network (ICL2, see Section 6.2).

Fig. 1 Histogram of within-community degree distributions from three bipartite networks with size 439×60,635, obtained from (a) a simulation of a SBM, (b) a simulation of a DCSBM, and (c) a real-world computer network (ICL2, see Section 6.2).

Fig. 2 Scatterplots of the two-dimensional ASE of a simulated DCSBM with (a) K = 4, and (b) K = 2. also highlights the true and estimated latent position for six nodes, with the corresponding 50%, 75%, and 90% contours from the ASE-CLT, and the estimated latent positions x̂1,l for x1 from simulated DCSBM adjacency matrices Al,l=1,,1000.

Fig. 2 Scatterplots of the two-dimensional ASE of a simulated DCSBM with (a) K = 4, and (b) K = 2. Figure 2(b) also highlights the true and estimated latent position for six nodes, with the corresponding 50%, 75%, and 90% contours from the ASE-CLT, and the estimated latent positions x̂1,l for x1 from simulated DCSBM adjacency matrices Al, l=1,…,1000.

Fig. 3 Plots of x̂i,x˜i=x̂i/||x̂i|| and θ̂i=f2(x̂i), obtained from the two-dimensional ASE of a simulated DCSBM. Joint (green) and community-specific (blue and red) marginal distributions with MLE Gaussian fit are also shown.

Fig. 3 Plots of x̂i, x˜i=x̂i/||x̂i|| and θ̂i=f2(x̂i), obtained from the two-dimensional ASE of a simulated DCSBM. Joint (green) and community-specific (blue and red) marginal distributions with MLE Gaussian fit are also shown.

Fig. 4 Boxplots for N=1000 simulations of a degree-corrected stochastic blockmodel with n=2000 nodes, K = 3, equal number of nodes allocated to each group, and B described in (9), corrected by parameters ρi sampled from a Uniform(0,1) distribution.

Fig. 4 Boxplots for N=1000 simulations of a degree-corrected stochastic blockmodel with n=2000 nodes, K = 3, equal number of nodes allocated to each group, and B described in (9), corrected by parameters ρi sampled from a Uniform(0,1) distribution.

Table 1 Estimated performance for N = 250 simulated DCSBMs and bipartite DCScBMs.

Fig. 5 Estimated performance for N = 250 simulated DCSBMs and DCScBMs, for n{100,200,500,1000,2000}. For bipartite DCScBMs, n{150,300,750,1500,3000}.

Fig. 5 Estimated performance for N = 250 simulated DCSBMs and DCScBMs, for n∈{100,200,500,1000,2000}. For bipartite DCScBMs, n′∈{150,300,750,1500,3000}.

Fig. 6 Boxplots of differences in ARI for N = 250 simulated DCSBMs and DCScBM.

Fig. 6 Boxplots of differences in ARI for N = 250 simulated DCSBMs and DCScBM.

Table 2 Summary statistics for the Imperial College London computer networks.

Fig. 7 ICL2: scatterplot of the leading two dimensions for X̂, X˜ and Θ̂.

Fig. 7 ICL2: scatterplot of the leading two dimensions for X̂, X˜ and Θ̂.

Table 3 Estimates of (d, K) and ARIs for the embeddings X̂,X˜ and Θ̂ for m{30,50} and alternative methodologies.

Supplemental material

Supplemental Material

Download Zip (225.1 KB)