Search in:

Applied Artificial Intelligence

An International Journal

Volume 29, 2015 - Issue 8

Submit an article Journal homepage

Free access

291

Views

CrossRef citations to date

Altmetric

Original Articles

Representative Selection in Nonmetric Datasets

Elad LiebmanComputer Science Department, The University of Texas at Austin, Austin, Texas, United StatesCorrespondence[email protected]
View further author information

Benny ChorThe Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, IsraelView further author information

Peter StoneComputer Science Department, The University of Texas at Austin, Austin, Texas, United StatesView further author information

Pages 807-838 | Published online: 24 Sep 2015

Cite this article
https://doi.org/10.1080/08839514.2015.1071092
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Figures & data

FIGURE 1 Clustering vs. representative selection. (a) When applying k-medoids, k = 77 clusters are required to satisfy the distance condition. (b) A better representative set does so with only 13 representatives.

Table

Display Table

Table

Display Table

Table

Display Table

FIGURE 2 Three musical segments as pitch (in MIDI format) over time, along with the musical notation of the first segment (1stViolinSeg).

FIGURE 3 Representative set size percentage from entire set and average representative set distance for three different composers, ten different pieces each, and five different distance criteria. Each column represents data for a different composer; δ-medoids yields the most compact representative set overall while still obtaining a smaller average distance than the k-centers heuristic.

FIGURE 4 The RoboCup 2D Simulation. Three potential movement trajectories for a specific agents are marked.

FIGURE 5 Representative set size percentage from entire set for four different teams, five different game logs each, and five distance criteria. Each column represents game data for a different team. Axes denoting distance are in log-scale.

FIGURE 6 The histograms (plotted as density functions, i.e., counts normalized as percentages) of the average overlap between representative sets found for each method for the same data under different permutations (overlap measured in %). For k-medoids, δ-medoids, and the k-centers heuristic, in more than 90% of the datasets, there was a > 90% average overlap. Spectral clustering yields drastically less consistent representation sets. The overlaps observed are almost exactly the same, implying that the expected extent of overlap depends more on the structure of the data than on the type of randomization the algorithm employs.

FIGURE 7 Representative set size percentage from entire set and average representative set distance for four different multivariate Gaussian distributions from which the samples are drawn, 20 different experiments each, and four different distribution values. Each column represents data for a different distribution; δ-medoids yields the most compact representative set overall while still obtaining a smaller average distance than the k-centers heuristic.

FIGURE 8 Two segments for example, in musical notation.

FIGURE 9 Same two segments, plotted as midi pitch over time.

FIGURE 10 Two movement segments. Each coordinate in the trajectory is labeled with its timestamp in the trajectory . Both segments begin with a long sprint toward one direction and then a sequence of small steps in the opposite direction (scales are × 10).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Representative Selection in Nonmetric Datasets

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Representative Selection in Nonmetric Datasets

Figures & data

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date