URL pre-training dataset

A dataset of 20 million unlabeled URLs for pre-training

BibTex: