Million Song Dataset

Million Song Dataset is a collection of audio features and metadata for a million contemporary pop songs. Instead of storing any audio, the dataset consists of features derived from the audio, user-song profile data, and genres of songs. We extract L = 10k most popular songs from this dataset, as measured by the number of song-listening events; and m = 400k most active users, as measured by the number of song-listening events.

Data and Resources

Cite this as

Naoto Ohsaka, Riku Togashi (2024). Dataset: Million Song Dataset. https://doi.org/10.57702/4gtshr3o

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.1145/3539618.3591659
Citation
  • https://doi.org/10.48550/arXiv.1511.00792
  • https://doi.org/10.48550/arXiv.1904.07154
  • https://doi.org/10.48550/arXiv.1603.05359
  • https://doi.org/10.48550/arXiv.1706.03993
  • https://doi.org/10.1145/3125486.3125492
  • https://doi.org/10.48550/arXiv.1810.01807
  • https://doi.org/10.1145/3273024.3273035
  • https://doi.org/10.48550/arXiv.1911.04827
  • https://doi.org/10.48550/arXiv.1609.04243
  • https://doi.org/10.48550/arXiv.2001.10102
  • https://doi.org/10.48550/arXiv.1606.00298
Author Naoto Ohsaka
More Authors
Riku Togashi
Homepage http://www.music.info/million-song-dataset/