Web2Text: Deep Structured Boilerplate Removal
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Thijs Vogels, Octavian-Eugen Ganea, Carsten Eickhoff (2024). Dataset: Web2Text: Deep Structured Boilerplate Removal. https://doi.org/10.57702/phtx73hb
DOI retrieved: December 16, 2024
Additional Info
Field | Value |
---|---|
Created | December 16, 2024 |
Last update | December 16, 2024 |
Defined In | https://doi.org/10.48550/arXiv.1801.02607 |
Author | Thijs Vogels |
More Authors |
|
Homepage | https://arxiv.org/abs/1704.07813 |