4 datasets found

Tags: document layout analysis

Filter Results
  • DiT

    DiT: Self-supervised pre-training for document image Transformer.
  • DocBank

    DocBank consists of 500K document layouts by weak supervision of articles available on the arXiv.com.
  • D4LA

    A new benchmark named D4LA, which is the most diverse and detailed manually-labeled dataset for document layout analysis.
  • PubLayNet dataset

    The PubLayNet dataset is the largest dataset ever for document layout analysis.