C2CGit

A large dataset from open projects in GitHub, which is more than 20× larger than existing datasets.

BibTex: