13 datasets found

Tags: music information retrieval

Filter Results
  • Realbook dataset

    The Realbook dataset contains 2,846 jazz songs based on band-in-a-box files, with time-aligned beat and chord information.
  • Chinese Pop Songs Dataset

    A dataset of 18,451 Chinese pop songs with precise syllable-note alignment.
  • Lyrics-Melody Parallel Corpus

    A large-scale lyrics-melody dataset with 18,451 Chinese pop songs and 644,472 lyrics-context-melody triples.
  • Million Song Dataset

    Million Song Dataset is a collection of audio features and metadata for a million contemporary pop songs. Instead of storing any audio, the dataset consists of features derived...
  • Audio Set

    The Audio Set dataset contains information of over 2 million audio soundtracks drawn from general YouTube videos.
  • Wikifonia Lead Sheets and Jazz Solos

    The dataset consists of 4,235 lead sheets from the Wikifonia database containing melodies from genres including (but not limited to) jazz, folk, pop, and classical, and 120...
  • MSD

    The dataset used in this paper is a collaborative filtering dataset, specifically the Million Song Dataset (MSD), which contains listening counts from 1 million users on 50,000...
  • Impro-Visor corpus

    The Impro-Visor corpus of 2,612 chord progressions, mainly jazz standards, but also including some blues, jazz-blues, modal jazz, and pop tunes.
  • MoisesDB

    MoisesDB: A Dataset for Source Separation beyond 4-stems
  • Clotho

    Automated audio captioning is a cross-modal translation task for describing the content of audio clips with natural language sentences.
  • EFSC dataset

    The EFSC dataset is transformed into a piano-roll representation with a resolution of 1/8th note.
  • Mozart piano music dataset

    The Mozart piano music dataset ( [29], comprising 13 piano sonatas with more than 106,000 notes) is used for pre-training the GAE portion of the RGAE.
  • EFSC subset

    The EFSC subset (comprising a total of 54,308 note events) of the Essen Folk Song Collection (EFSC) constitutes the data for the actual training and evaluation.
You can also access this registry using the API (see API Docs).