Dataset Groups Activity Stream C2CGit A large dataset from open projects in GitHub, which is more than 20× larger than existing datasets. BibTex: @dataset{Wenhao_Zheng_and_Hong-Yu_Zhou_and_Ming_Li_and_Jianxin_Wu_2024, abstract = {A large dataset from open projects in GitHub, which is more than 20× larger than existing datasets.}, author = {Wenhao Zheng and Hong-Yu Zhou and Ming Li and Jianxin Wu}, doi = {10.57702/3iop4b10}, institution = {No Organization}, keyword = {'code', 'comments', 'dataset', 'translation'}, month = {dec}, publisher = {TIB}, title = {C2CGit}, url = {https://service.tib.eu/ldmservice/dataset/c2cgit}, year = {2024} }