-
Realbook dataset
The Realbook dataset contains 2,846 jazz songs based on band-in-a-box files, with time-aligned beat and chord information. -
Chinese Pop Songs Dataset
A dataset of 18,451 Chinese pop songs with precise syllable-note alignment. -
Lyrics-Melody Parallel Corpus
A large-scale lyrics-melody dataset with 18,451 Chinese pop songs and 644,472 lyrics-context-melody triples. -
Million Song Dataset
Million Song Dataset is a collection of audio features and metadata for a million contemporary pop songs. Instead of storing any audio, the dataset consists of features derived... -
Wikifonia Lead Sheets and Jazz Solos
The dataset consists of 4,235 lead sheets from the Wikifonia database containing melodies from genres including (but not limited to) jazz, folk, pop, and classical, and 120... -
Impro-Visor corpus
The Impro-Visor corpus of 2,612 chord progressions, mainly jazz standards, but also including some blues, jazz-blues, modal jazz, and pop tunes. -
EFSC dataset
The EFSC dataset is transformed into a piano-roll representation with a resolution of 1/8th note. -
Mozart piano music dataset
The Mozart piano music dataset ( [29], comprising 13 piano sonatas with more than 106,000 notes) is used for pre-training the GAE portion of the RGAE. -
EFSC subset
The EFSC subset (comprising a total of 54,308 note events) of the Essen Folk Song Collection (EFSC) constitutes the data for the actual training and evaluation.