798
Views
0
CrossRef citations to date
0
Altmetric
Articles

Posterior contraction rate of sparse latent feature models with application to proteomics

, ORCID Icon, , &
Pages 29-39 | Received 13 Mar 2020, Accepted 08 Aug 2021, Published online: 05 Sep 2021

Figures & data

Figure 1. An example of a tree structure T, which is a directed graph with random variables at the nodes (marked as circles). Entries of the kth column of Z, zjk's, are at the leaves. The lengths of all edges of T, ti's and ηl's, are marked on the figure. In particular, ηl's represent the lengths between each leaf (zjk, shaded nodes) and its parent node (ζl, dotted nodes). The total edge lengths S(T) is the summation of the lengths of all edges of T. In this example, S(T)=1i6ti+(2η1+η2+3η3+η4+4η5+η6). The condition in case (2) of Lemma 3.3 in Section 3 means inf1l6ηlη0 for some η0>0.

Figure 1. An example of a tree structure T, which is a directed graph with random variables at the nodes (marked as circles). Entries of the kth column of Z, zjk's, are at the leaves. The lengths of all edges of T, ti's and ηl's, are marked on the figure. In particular, ηl's represent the lengths between each leaf (zjk, shaded nodes) and its parent node (ζl, dotted nodes). The total edge lengths S(T) is the summation of the lengths of all edges of T. In this example, S(T)=∑1≤i≤6ti+(2η1+η2+3η3+η4+4η5+η6). The condition in case (2) of Lemma 3.3 in Section 3 means inf1≤l≤6ηl≥η0 for some η0>0.

Table 1. Simulation results for different combinations of n and p.

Figure 2. Simulation results for different combinations of n and p. Plot of ZnZnTZnZnT/ϵn versus n, where ZnZnTZnZnT is the spectral norm for the residual of the similarity matrix, and ϵn is the posterior contraction rate defined in Theorem 3.4. The ratio converges to zero as n increases, demonstrating the theoretical results. The vertical error bars represent one standard error.

Figure 2. Simulation results for different combinations of n and p. Plot of ‖ZnZnT−Zn∗Zn∗T‖/ϵn versus n, where ‖ZnZnT−Zn∗Zn∗T‖ is the spectral norm for the residual of the similarity matrix, and ϵn is the posterior contraction rate defined in Theorem 3.4. The ratio converges to zero as n increases, demonstrating the theoretical results. The vertical error bars represent one standard error.

Table 2. Simulation results for different s.

Figure 3. The inferred binary feature matrix Z^ for the TCGA RPPA dataset. The dataset consists of 100 patients, with 20 patients for each of the 5 cancer type, BRCA, DLBC, GBM, KIRC and LUAD. A shaded gray rectangle indicates the corresponding patient j possesses feature k, i.e., the corresponding matrix element Z^jk=1. The columns are in descending order of the number of objects possessing each feature. The rows are reordered for better display.

Figure 3. The inferred binary feature matrix Z^ for the TCGA RPPA dataset. The dataset consists of 100 patients, with 20 patients for each of the 5 cancer type, BRCA, DLBC, GBM, KIRC and LUAD. A shaded gray rectangle indicates the corresponding patient j possesses feature k, i.e., the corresponding matrix element Z^jk=1. The columns are in descending order of the number of objects possessing each feature. The rows are reordered for better display.
Supplemental material

Supplemental Material

Download PDF (275.5 KB)