NLPContributionGraph Trial Dataset - Trial data from the NLPContributionGraph scheme

Research contributions in NLP annotated as structured data using the NLPContributionGraph scheme for structuring scholarly contributions in the ORKG.

The repository is organized as follows:

[task-name-folder]/                                # machine-translation, named-entity-recognition, question-answering, relation-classification, text-classification
    ├── [article-counter-folder]/                  # ranges from 0 to 9 since we annotated 10 articles per task
    │   └── research-problem.json              # `research problem` mandatory information unit in json format
    │   └── model.json                         # `model` information unit in json format; in some articles it is called `approach`
    │   └── ...                                # there are 8 main information units in all and each article may be annotated by 3 or 6
    │   └── triples/                               # the folder containing information unit triples one per line
    │   │   └── research-problem.txt               # `research problem` triples (one research problem statement per line)
    │   │   └── model.txt                          # `model` triples (one statement per line)
    │   │   └── ...                                # there are 8 main information units in all and each article may be annotated by 3 or 6
    │   └── ...                                    # there are ten articles annotated for each task, so this repeats nine more times
    └── ...                                        # there are five tasks selected overall, so this repeats four more times

Field	Value
Created	July 3, 2020
Last updated	August 4, 2023
Format	JSON

Field

Value

Created

July 3, 2020

Last updated

August 4, 2023

Format

JSON

Trial data from the NLPContributionGraph scheme

Cite this as

Additional Information

Trial data from the NLPContributionGraph scheme

Embed resource view

Cite this as

Additional Information