TIB - Organizations

PALADIN: Benchmarks, Experimental Settings, and Evaluation

This collection includes all the data and scripts necessary to reproduce the results from the experimental study of PALADIN. Data The data is generated using the Synthetic Data...
- Dataset
- ZIP
WN18RR Benchmark

WN18RR is a link prediction dataset created from WN18, which is a subset of WordNet. WN18 consists of 18 relations and 40,943 entities. However, many text triples are obtained...
- Dataset
- ZIP
SPaRKLE: Symbolic caPtuRing of knowledge for Knowledge graph enrichment with ...

SPaRKLE is a hybrid method that combines symbolic and mathematical methodologies while leveraging Partial Completness Assumption (PCA) heuristics to capture implicit information...
- Dataset
- nt
- TSV
- ZIP
The Family KG

Statistical predicate invention is considered a key problem in statistical relational learning. SPI involves discovering new concepts, properties, and relations within...
- Dataset
- nt
The French Royalty KG

The French Royalty KG is created by extracting information about French royal families from DBpedia, and for each person, we added the class dbo:Person as well as different...
- Dataset
- .zip
FB15k-237 Benchmark

FB15k-237 is a link prediction dataset created from FB15k. While FB15k consists of 1,345 relations, 14,951 entities, and 592,213 triples, many triples are inverses that cause...
- Dataset
- ZIP
A Benchmark Suite for Federated Semantic Data Query Processing (FedBench)

A comprehensive benchmark suite for testing and analyzing the performance of federated query processing strategies on semantic data. This benchmark is flexible enough to cover a...
- Dataset
- ZIP
Berlin SPARQL Benchmark (BSBM)

The SPARQL Query Language for RDF and the SPARQL Protocol for RDF are implemented by a growing number of storage systems and are used within enterprise and open web settings. As...
- Dataset
- ZIP
The Lehigh University Benchmark (LUBM)

The Lehigh University Benchmark is developed to facilitate the evaluation of Semantic Web repositories in a standard and systematic way. The benchmark is intended to evaluate...
- Dataset
- ZIP
- TXT
Waterloo SPARQL Diversity Test Suite (WatDiv) Benchmark

WatDiv is a benchmark designed to measure how an RDF data management system performs across a wide spectrum of SPARQL queries with varying structural characteristics and...
- Dataset
- ZIP
WN18 (WordNet18)

The WN18 dataset has 18 relations scraped from WordNet for roughly 41,000 synsets, resulting in 141,442 triplets. It was found out that a large number of the test triplets can...
- Dataset
- tar.gz
- tgz
FB15k (Freebase 15K)

The FB15k dataset contains knowledge base relation triples and textual mentions of Freebase entity pairs. It has a total of 592,213 triplets with 14,951 entities and 1,345...
- Dataset
- ZIP
YAGO3-10 (Yet Another Great Ontology 3-10)

YAGO3-10 is benchmark dataset for knowledge base completion. It is a subset of YAGO3 (which itself is an extension of YAGO) that contains entities associated with at least ten...
- Dataset
- TSV
Entity Summarization Benchmark (ESBM)

ESBM (short for Entity Summarization BenchMark) is a benchmark for evaluating algorithms for entity summarization, aka entity summarizers. The latest version is on 2019-12-08.
- Dataset
- .zip

14 datasets found