Search in:

Journal of New Music Research Volume 37, 2008 - Issue 2: From Genres to Tags: Music Information Retrieval in the Age of Social Tagging

Submit an article Journal homepage

409

Views

CrossRef citations to date

Altmetric

Original Articles

Autotagger: A Model for Predicting Social Tags from Acoustic Features on Large Music Databases

Thierry Bertin-Mahieux University of Montreal, CanadaCorrespondence[email protected]

Douglas Eck University of Montreal, Canada

François Maillet University of Montreal, Canada

Paul Lamere Sun Microsystems, USA

Pages 115-135 | Published online: 26 Nov 2008

Cite this article
https://doi.org/10.1080/09298210802479250

Sample our Arts journals, sign in here to start your access, latest two volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/09298210802479250?needAccess=true

Abstract

Social tags are user-generated keywords associated with some resource on the Web. In the case of music, social tags have become an important component of “Web 2.0” recommender systems, allowing users to generate playlists based on use-dependent terms such as chill or jogging that have been applied to particular songs. In this paper, we propose a method for predicting these social tags directly from MP3 files. Using a set of 360 classifiers trained using the online ensemble learning algorithm FilterBoost, we map audio features onto social tags collected from the Web. The resulting automatic tags (or autotags) furnish information about music that is otherwise untagged or poorly tagged, allowing for insertion of previously unheard music into a social recommender. This avoids the “cold-start problem” common in such systems. Autotags can also be used to smooth the tag space from which similarities and recommendations are made by providing a set of comparable baseline tags for all tracks in a recommender system. Because the words we learn are the same as those used by people who label their music collections, it is easy to integrate our predictions into existing similarity and prediction methods based on web data.

Acknowledgement

Many thanks to the members of the CAL group, in particular Luke Barrington, Gert Lanckriet and Douglas Turnbull, for publishing the CAL500 data set and answering our numerous questions. Thanks to the many individuals that provided input, support and comments including James Bergstra, Andrew Hankinson, Stephen Green, the members of LISA lab, BRAMS lab and CIRMMT. Thanks to Joseph Turian for pointing us to the phrase “There's no data like more data” (originally from speech recognition, we believe).

Notes

¹ www.last.fm

²Audioscrobbler. Web Services described at http://www.audio scrobbler.net/data/webservices/.

³Music Information Retrieval Evaluation eXchange; yearly contest pages found at www.music-ir.org.

⁴ www.musicbrainz.org

⁵Of course, real recommenders deal with a more complex situation, caring about novelty of recommendations, serendipity and user confidence among others [see Herlocker et al. (2004) for more details]. However, similarity is essential. We do it on the artist level because the data available to build a ground truth would be too sparse on the album or song level.

⁶ www.allmusic.com

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

Human Computation

Source: Morgan & Claypool Publishers LLC

Random House Business Books, 2006, ISBN 9781905211210

Source: Springer Science and Business Media LLC

Source: Institute of Electrical and Electronics Engineers (IEEE)

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Source: Elsevier BV

Social Tagging and Music Information Retrieval

Source: Informa UK Limited

Labeling images with a computer game

Source: Association for Computing Machinery (ACM)

Aggregate features and ADABOOST for music classification

Source: HAL CCSD

FOLKSONOMY-BASED RECOMMENDER SYSTEMS

Source: Wiley

Multimodal Deep Learning for Music Genre Classification

Source: Ubiquity Press

Hierarchical attentive deep neural networks for semantic music annotation through multiple music representations

Source: Springer Science and Business Media LLC

Music Recommender Systems

Source: Springer US

Music classification by low-rank semantic mappings

Source: Springer Science and Business Media LLC

Evaluating collaborative filtering recommender systems

Source: Association for Computing Machinery (ACM)

Musical genre classification of audio signals

Source: Institute of Electrical and Electronics Engineers (IEEE)

Linking provided by

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Autotagger: A Model for Predicting Social Tags from Acoustic Features on Large Music Databases

Related Research Data

Information for

Open access

Opportunities

Help and information

Autotagger: A Model for Predicting Social Tags from Acoustic Features on Large Music Databases

Abstract

Acknowledgement

Notes

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature