Search in:

Journal of the American Statistical Association Latest Articles

Submit an article Journal homepage

Open access

1,302

Views

CrossRef citations to date

Altmetric

Theory and Methods

Latent Space Modeling of Hypergraph Data

Kathryn Turnbulla Department of Mathematics and Statistics, Lancaster University, Lancaster, UK

https://orcid.org/0000-0003-1107-3865 View further author information

Simón Lunagómezb Departamento de Estadística, Instituto Tecnológico Autónomo de México, Ciudad de México, MexicoView further author information

Christopher Nemetha Department of Mathematics and Statistics, Lancaster University, Lancaster, UKCorrespondence[email protected]

https://orcid.org/0000-0002-9084-3866 View further author information

Edoardo Airoldic Fox School of Business, Temple University, Philadelphia, PAView further author information

Received 12 Mar 2020, Accepted 22 Sep 2023, Published online: 11 Dec 2023

Cite this article
https://doi.org/10.1080/01621459.2023.2270750
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Fig. 1 Examples of hypergraph datasets. (b) shows co-tagging data and (c) shows a subsample of the coauthorship network of Ji and Jin (Citation2016). The figures were made with R packages HyperG (Marchette Citation2021) and igraph (Csardi and Nepusz Citation2006).

Fig. 2 Example of a Čech complex. Left: $B_{r} (u_{i})$ for ${u_{i} = (u_{i 1}, u_{i 2})}_{i = 1}^{7}$ in $R^{2}$ . Middle: the graph obtained by taking pairwise intersections. Right: the hypergraph obtained by taking intersections of arbitrary order. The shaded region indicates an order 3 hyperedge.

Fig. 3 Example of a nsRGH (see Definition 3.1) with $U = {u_{i}}_{i = 1}^{7} = {(u_{i 1}, u_{i 2})}_{i = 1}^{7}$ . Left: $C_{r_{2}} (U)$ . Middle: $C_{r_{3}} (U)$ . Right: $\cup_{k = 2}^{3} D_{r_{k}}^{(k)} (U)$ .

Fig. 3 Example of a nsRGH (see Definition 3.1) with U={ui}i=17={(ui1,ui2)}i=17. Left: Cr2(U). Middle: Cr3(U). Right: ∪k=23Drk(k)(U).

Fig. 4 Comparison of theoretical (solid) and Monte Carlo (dashed) estimates of $p (y_{e_{k, i}}^{(g)} = 1 | u_{i}, r_{k}, μ, Σ)$ for varying r_k . We take $Σ = diag (1, 1), μ = (0, 0)$ and consider connection probabilities for k = 2, 3, 4. In the left plot $u_{i} = μ$ and in the right plot $u_{i} = (1, 2)$ . The same study with $Σ = diag (1, 2)$ is provided in Supplement F.3.

Fig. 5 Comparison between empirical (dashed line) and Poisson approximation (points) of the order k = 3 degree distribution conditional on the latent coordinate u_i . We take $N = 10, μ = (0, 0), Σ = diag (1, 1)$ and evaluate the distribution for $r_{3} \in (0.1, 0.4, 1.0)$ . The left plot shows $u_{i} = μ$ and the right plot shows $u_{i} = (1, 2)$ . The equivalent Figures with $Σ = diag (1, 2)$ and N = 20 are given in Supplement F.3.

Fig. 6 Comparison of true and posterior predictive degree distributions for hyperedges of order k = 2 (left) and k = 3 (right). Vertical lines and black dots correspond to the range and median of the probabilities of observing each degree as calculated via the posterior predictive. The observed degree is shown as green triangles.

Fig. 7 Summary of average per-iteration costs after 200 iterations for an MCMC implemented with and without delayed acceptance (DA) for 10 datasets sampled from data regimes (R1) (black, circle) and (R2) (red, triangle). Left: average per-iteration cost in seconds for a DA scheme. Right: ratio of average per-iteration cost for a DA scheme and a scheme without DA. Numbers less than 1 imply DA offers a computational speed-up.

Fig. 8 Visualization of the datasets considered in Section 7. The hyperedges correspond to the observations for each dataset. Each figure shows the full hypergraph (left) and a subset sampled according to a random walk (right).

Fig. 9 Summary of predictive distributions for $N^{*}$ additional nodes for the Grocery dataset. For each measure we report the proportion of predictions which are distance D from the truth, where D is the absolute difference between the predictive and the truth.

Fig. 10 Summary of predictive distributions for $N^{*}$ additional nodes for the coauthorship dataset. For each measure we report the proportion of predictions which are distance D from the truth, where D is the absolute difference between the predictive and the truth.

Fig. 11 Comparison between latent positions for subset of the coauthorship network on nodes {22, 27, 31}. Left: observed interactions. Middle: traceplot of latent positions for model with r_k = r. Right: traceplot of latent positions for our model with $r_{k} > r_{k - 1}$ .

Ji, P., and Jin, J. (2016), “Coauthorship and Citation Networks for Statisticians,” The Annals of Applied Statistics, 10, 1779–1812. DOI: 10.1214/15-AOAS896.

Web of Science ®Google Scholar

Marchette. (2021), HyperG: Hypergraphs in R, R package version 1.0.0.

Google Scholar

Csardi, G., and Nepusz, T. (2006), “The igraph Software Package for Complex Network Research,” InterJournal, complex systems, 1695.5, 1–9.

Google Scholar

Supplemental material

Supplemental Material

Download Zip (17.2 MB)

Supplemental Material

Download PDF (120 KB)

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Latent Space Modeling of Hypergraph Data

Supplemental Material

Supplemental Material

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Latent Space Modeling of Hypergraph Data

Figures & data

Supplemental Material

Supplemental Material

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date