Wikipedia2 and Aozorabunko datasets

Wikipedia2 and Aozorabunko datasets used for pre-training of PnG BERT model.

Data and Resources

Cite this as

Yusuke Yasuda, Tomoki Toda (2024). Dataset: Wikipedia2 and Aozorabunko datasets. https://doi.org/10.57702/argob6go

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Yusuke Yasuda
More Authors
Tomoki Toda
Homepage https://dumps.wikimedia.org/