Pipeline for Arabic Legal Text Summarization Dataset
Data and Resources
-
Gold StandardJSON
The gold standard built using GPT-4 in JSON format. It contains the summaries...
-
The extracted text of the casesTXT
This file contains the extracted text from the PDFs of the legal cases
-
generated output of the fine tuned modeljsonl
this files contains the generated output for the fine tuned model
-
generated output of the base modeljsonl
This file contains the generated output for the base model
-
Training datasetjsonl
this file contains the training dataset used for fine tuning the model
-
Evaluation Datasetjsonl
This file contains the evaluation dataset used in the experiments
Cite this as
Ahmad Sakor, (2024). Dataset: Pipeline for Arabic Legal Text Summarization Dataset. https://doi.org/10.57702/djfcf0oa
DOI retrieved: December 21, 2024
Additional Info
Field | Value |
---|---|
Created | December 21, 2024 |
Last update | December 21, 2024 |
License | notspecified: License not specified |
Author | Ahmad Sakor |
More Authors |
|