Dataset - LDM

GTFS-Madrid-Bench

A benchmark to evaluate declarative KG construction engines that can be used to provide access mechanisms to (virtual) knowledge graphs. Our proposal introduces several...
- Dataset
- ZIP
WN18RR Benchmark

WN18RR is a link prediction dataset created from WN18, which is a subset of WordNet. WN18 consists of 18 relations and 40,943 entities. However, many text triples are obtained...
- Dataset
- ZIP
SPaRKLE: Symbolic caPtuRing of knowledge for Knowledge graph enrichment with ...

SPaRKLE is a hybrid method that combines symbolic and mathematical methodologies while leveraging Partial Completness Assumption (PCA) heuristics to capture implicit information...
- Dataset
- nt
- TSV
- ZIP
The Family KG

Statistical predicate invention is considered a key problem in statistical relational learning. SPI involves discovering new concepts, properties, and relations within...
- Dataset
- nt
The French Royalty KG

The French Royalty KG is created by extracting information about French royal families from DBpedia, and for each person, we added the class dbo:Person as well as different...
- Dataset
- .zip
FB15k-237 Benchmark

FB15k-237 is a link prediction dataset created from FB15k. While FB15k consists of 1,345 relations, 14,951 entities, and 592,213 triples, many triples are inverses that cause...
- Dataset
- ZIP
Berlin SPARQL Benchmark (BSBM)

The SPARQL Query Language for RDF and the SPARQL Protocol for RDF are implemented by a growing number of storage systems and are used within enterprise and open web settings. As...
- Dataset
- ZIP
The Lehigh University Benchmark (LUBM)

The Lehigh University Benchmark is developed to facilitate the evaluation of Semantic Web repositories in a standard and systematic way. The benchmark is intended to evaluate...
- Dataset
- ZIP
- TXT
Waterloo SPARQL Diversity Test Suite (WatDiv) Benchmark

WatDiv is a benchmark designed to measure how an RDF data management system performs across a wide spectrum of SPARQL queries with varying structural characteristics and...
- Dataset
- ZIP
WN18 (WordNet18)

The WN18 dataset has 18 relations scraped from WordNet for roughly 41,000 synsets, resulting in 141,442 triplets. It was found out that a large number of the test triplets can...
- Dataset
- tar.gz
- tgz
FB15k (Freebase 15K)

The FB15k dataset contains knowledge base relation triples and textual mentions of Freebase entity pairs. It has a total of 592,213 triplets with 14,951 entities and 1,345...
- Dataset
- ZIP
YAGO3-10 (Yet Another Great Ontology 3-10)

YAGO3-10 is benchmark dataset for knowledge base completion. It is a subset of YAGO3 (which itself is an extension of YAGO) that contains entities associated with at least ten...
- Dataset
- TSV
Entity Summarization Benchmark (ESBM)

ESBM (short for Entity Summarization BenchMark) is a benchmark for evaluating algorithms for entity summarization, aka entity summarizers. The latest version is on 2019-12-08.
- Dataset
- .zip
Monash Time Series Forecasting Repository

All datasets contain univariate time series and they are available in a new format that we name as .tsf, pioneered by the sktime .ts format.
- Dataset
- ZIP
SDM-Genomic-Dataset

This benchmark is created by randomly sampling data records from somatic mutation data collected in COSMIC (https://cancer.sanger.ac.uk/cosmic). SDM-Genomic-Datasets include...
- Dataset
- .zip
- .csv
KGQA-Datasets

This dataset is a collection of existing KGQA datasets in the form of the huggingface datasets library, aiming to provide an easy-to-use access to them.
- Dataset
- ZIP

You can also access this registry using the API (see API Docs).

16 datasets found