DocBank

DocBank consists of 500K document layouts by weak supervision of articles available on the arXiv.com.

Data and Resources

Cite this as

Minghao Li, Yiheng Xu, Lei Cui, Shaohan Huang (2024). Dataset: DocBank. https://doi.org/10.57702/k9rwozlf

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.1007/978-3-031-41676-7_21
Author Minghao Li
More Authors
Yiheng Xu
Lei Cui
Shaohan Huang
Homepage https://coling.org/coling2020/datasets/docbank/