Million Song Dataset

Submitted by on Dec 11 2019 } Suggest Revision
By: Thierry Bertin-Mahieux, Daniel P.W. Ellis, Brian Whitman, Paul Lamere
Resource Type:
Data Format:


The Million Song Dataset is a freely-available collection of audio features and metadata for a million contemporary popular music tracks. Its purposes are: To encourage research on algorithms that scale to commercial sizes To provide a reference dataset for evaluating research As a shortcut alternative to creating a large dataset with APIs (e.g. The Echo Nestís) To help new researchers get started in the MIR field The core of the dataset is the feature analysis and metadata for one million songs. The dataset does not include any audio, only the derived features. The sample audio can be fetched from services like 7digital, using code provided by Columbia University.
Categorized in: Applications | Music
Post comment