CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset

A comprehensive dataset for post-OCR parsing and receipt understanding, specifically designed to enhance OCR and information extraction from receipts in multilingual contexts involving Arabic and English.

Data and Resources

Cite this as

Abdelrahman Abdallah, Mahmoud Abdalla, Mahmoud SalahEldin Kasem, Mohamed Mahmoud, Ibrahim Abdelhalim, Mohamed Elkasaby, Yasser ElBendary, Adam Jatowt (2024). Dataset: CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset. https://doi.org/10.57702/qf25gc3z

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2406.04493
Author Abdelrahman Abdallah
More Authors
Mahmoud Abdalla
Mahmoud SalahEldin Kasem
Mohamed Mahmoud
Ibrahim Abdelhalim
Mohamed Elkasaby
Yasser ElBendary
Adam Jatowt
Homepage https://github.com/Update-For-Integrated-Business-AI/CORU