Figures & data
FIGURE 1 Clustering vs. representative selection. (a) When applying k-medoids, k = 77 clusters are required to satisfy the distance condition. (b) A better representative set does so with only 13 representatives.
![FIGURE 1 Clustering vs. representative selection. (a) When applying k-medoids, k = 77 clusters are required to satisfy the distance condition. (b) A better representative set does so with only 13 representatives.](/cms/asset/020a6d66-b626-4783-9cea-7a59b3047490/uaai_a_1071092_f0001_oc.jpg)
Table
Table
Table
FIGURE 2 Three musical segments as pitch (in MIDI format) over time, along with the musical notation of the first segment (1stViolinSeg).
![FIGURE 2 Three musical segments as pitch (in MIDI format) over time, along with the musical notation of the first segment (1stViolinSeg).](/cms/asset/ead0098e-db1f-4a22-9657-8132ec19c1e8/uaai_a_1071092_f0002_oc.jpg)
FIGURE 3 Representative set size percentage from entire set and average representative set distance for three different composers, ten different pieces each, and five different distance criteria. Each column represents data for a different composer; δ-medoids yields the most compact representative set overall while still obtaining a smaller average distance than the k-centers heuristic.
![FIGURE 3 Representative set size percentage from entire set and average representative set distance for three different composers, ten different pieces each, and five different distance criteria. Each column represents data for a different composer; δ-medoids yields the most compact representative set overall while still obtaining a smaller average distance than the k-centers heuristic.](/cms/asset/e2841346-ddbe-4fee-96ed-bb05083419ba/uaai_a_1071092_f0003_oc.jpg)
FIGURE 4 The RoboCup 2D Simulation. Three potential movement trajectories for a specific agents are marked.
![FIGURE 4 The RoboCup 2D Simulation. Three potential movement trajectories for a specific agents are marked.](/cms/asset/bf127d48-1bf5-44f4-a722-7001420b3125/uaai_a_1071092_f0004_oc.jpg)
FIGURE 5 Representative set size percentage from entire set for four different teams, five different game logs each, and five distance criteria. Each column represents game data for a different team. Axes denoting distance are in log-scale.
![FIGURE 5 Representative set size percentage from entire set for four different teams, five different game logs each, and five distance criteria. Each column represents game data for a different team. Axes denoting distance are in log-scale.](/cms/asset/be57abb6-162c-4b75-b732-cdf04de38f47/uaai_a_1071092_f0005_oc.jpg)
FIGURE 6 The histograms (plotted as density functions, i.e., counts normalized as percentages) of the average overlap between representative sets found for each method for the same data under different permutations (overlap measured in %). For k-medoids, δ-medoids, and the k-centers heuristic, in more than 90% of the datasets, there was a > 90% average overlap. Spectral clustering yields drastically less consistent representation sets. The overlaps observed are almost exactly the same, implying that the expected extent of overlap depends more on the structure of the data than on the type of randomization the algorithm employs.
![FIGURE 6 The histograms (plotted as density functions, i.e., counts normalized as percentages) of the average overlap between representative sets found for each method for the same data under different permutations (overlap measured in %). For k-medoids, δ-medoids, and the k-centers heuristic, in more than 90% of the datasets, there was a > 90% average overlap. Spectral clustering yields drastically less consistent representation sets. The overlaps observed are almost exactly the same, implying that the expected extent of overlap depends more on the structure of the data than on the type of randomization the algorithm employs.](/cms/asset/e012d164-32b6-4e44-8a8e-e0e3ff167d8a/uaai_a_1071092_f0006_oc.jpg)
FIGURE 7 Representative set size percentage from entire set and average representative set distance for four different multivariate Gaussian distributions from which the samples are drawn, 20 different experiments each, and four different distribution values. Each column represents data for a different distribution; δ-medoids yields the most compact representative set overall while still obtaining a smaller average distance than the k-centers heuristic.
![FIGURE 7 Representative set size percentage from entire set and average representative set distance for four different multivariate Gaussian distributions from which the samples are drawn, 20 different experiments each, and four different distribution values. Each column represents data for a different distribution; δ-medoids yields the most compact representative set overall while still obtaining a smaller average distance than the k-centers heuristic.](/cms/asset/1fa41b69-5e76-42f2-a3a4-f89c67cf6b49/uaai_a_1071092_f0007_oc.jpg)