<?xml version='1.0' encoding='utf-8'?>
<resource xmlns="http://datacite.org/schema/kernel-4" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.2/metadata.xsd">
  <identifier identifierType="DOI">10.71694/4tut8jmy</identifier>
  <creators>
    <creator>
      <creatorName>Philipp D. Rohde</creatorName>
      <nameIdentifier nameIdentifierScheme="orcid">0000-0002-9835-4354</nameIdentifier>
    </creator>
    <creator>
      <creatorName>Enrique Iglesias</creatorName>
      <nameIdentifier nameIdentifierScheme="orcid">0000-0002-8734-3123</nameIdentifier>
    </creator>
  </creators>
  <titles>
    <title>Multi-Repository Metadata Assessment</title>
  </titles>
  <publisher>TIB</publisher>
  <publicationYear>2025</publicationYear>
  <subjects>
    <subject>Metadata Quality</subject>
    <subject>OEMetadata</subject>
  </subjects>
  <contributors>
    <contributor>
      <contributorName>Philipp D. Rohde</contributorName>
    </contributor>
    <contributor>
      <contributorName>Philipp D. Rohde</contributorName>
    </contributor>
  </contributors>
  <dates>
    <date dateType="Created">13/12/2025</date>
    <date dateType="Updated">27/02/2026</date>
    <date dateType="Issued">13/12/2025</date>
  </dates>
  <resourceType resourceTypeGeneral="Dataset">Dataset</resourceType>
  <sizes>
    <size>157863 kb</size>
  </sizes>
  <formats>
    <format>ZIP</format>
  </formats>
  <descriptions>
    <description descriptionType="Other">The metadata of energy-related datasets from the following repositories is used in this multi-repository assessment of conformance of metadata with [OEMetadata v2](https://github.com/OpenEnergyPlatform/oemetadata):&#13;
&#13;
- Energy Data eXchange: https://edx.netl.doe.gov/&#13;
- EU Open Data Portal: https://data.europa.eu/data/datasets&#13;
- Eurostat: https://ec.europa.eu/eurostat&#13;
- FfE Open Data Portal: https://opendata.ffe.de/&#13;
- Inspire-HEP: https://inspirehep.net&#13;
- Open Energy Data Initiative: https://data.openei.org/&#13;
- Open Energy Platform: https://openenergyplatform.org/database/&#13;
- OpenAIRE: https://explore.openaire.eu/&#13;
- Open Power System Data (OPSD): https://open-power-system-data.org/&#13;
- Open Access Power-Grid Frequency Database: https://osf.io/m43tg/&#13;
&#13;
Unfortunately, we cannot share the scripts that we used for downloading the metadata from these repositories.&#13;
However, we are sharing our preprocessing scripts for the repositories which needed preprocessing before mapping them to OEMetadata.&#13;
These scripts would be executed directly after retrieving the metadata from the repository.&#13;
&#13;
In order to map the raw metadata (see `metadata_raw`) to the OEMetadata, execute&#13;
&#13;
```bash&#13;
pip install -r requirements.txt&#13;
python parser.py&#13;
```&#13;
&#13;
The transformed metadata is then stored in a directory called `metadata_mapped`.&#13;
To fix some reporting issues with nested fields, execute the cleanup script for the OEP:&#13;
&#13;
```bash&#13;
python cleanup_oep.py&#13;
```&#13;
&#13;
Now you are ready to reproduce our report by running&#13;
&#13;
```bash&#13;
python report.py&#13;
```&#13;
&#13;
The reports for each of the repositories is then stored in the directory `report`.</description>
  </descriptions>
</resource>
