-
PALADIN: Benchmarks, Experimental Settings, and Evaluation
This collection includes all the data and scripts necessary to reproduce the results from the experimental study of PALADIN. Data The data is generated using the Synthetic Data... -
WN18RR Benchmark
WN18RR is a link prediction dataset created from WN18, which is a subset of WordNet. WN18 consists of 18 relations and 40,943 entities. However, many text triples are obtained... -
SPaRKLE: Symbolic caPtuRing of knowledge for Knowledge graph enrichment with ...
SPaRKLE is a hybrid method that combines symbolic and mathematical methodologies while leveraging Partial Completness Assumption (PCA) heuristics to capture implicit information... -
The Family KG
Statistical predicate invention is considered a key problem in statistical relational learning. SPI involves discovering new concepts, properties, and relations within... -
The French Royalty KG
The French Royalty KG is created by extracting information about French royal families from DBpedia, and for each person, we added the class dbo:Person as well as different... -
FB15k-237 Benchmark
FB15k-237 is a link prediction dataset created from FB15k. While FB15k consists of 1,345 relations, 14,951 entities, and 592,213 triples, many triples are inverses that cause... -
A Benchmark Suite for Federated Semantic Data Query Processing (FedBench)
A comprehensive benchmark suite for testing and analyzing the performance of federated query processing strategies on semantic data. This benchmark is flexible enough to cover a... -
Berlin SPARQL Benchmark (BSBM)
The SPARQL Query Language for RDF and the SPARQL Protocol for RDF are implemented by a growing number of storage systems and are used within enterprise and open web settings. As... -
The Lehigh University Benchmark (LUBM)
The Lehigh University Benchmark is developed to facilitate the evaluation of Semantic Web repositories in a standard and systematic way. The benchmark is intended to evaluate... -
Waterloo SPARQL Diversity Test Suite (WatDiv) Benchmark
WatDiv is a benchmark designed to measure how an RDF data management system performs across a wide spectrum of SPARQL queries with varying structural characteristics and... -
WN18 (WordNet18)
The WN18 dataset has 18 relations scraped from WordNet for roughly 41,000 synsets, resulting in 141,442 triplets. It was found out that a large number of the test triplets can... -
FB15k (Freebase 15K)
The FB15k dataset contains knowledge base relation triples and textual mentions of Freebase entity pairs. It has a total of 592,213 triplets with 14,951 entities and 1,345... -
YAGO3-10 (Yet Another Great Ontology 3-10)
YAGO3-10 is benchmark dataset for knowledge base completion. It is a subset of YAGO3 (which itself is an extension of YAGO) that contains entities associated with at least ten... -
Entity Summarization Benchmark (ESBM)
ESBM (short for Entity Summarization BenchMark) is a benchmark for evaluating algorithms for entity summarization, aka entity summarizers. The latest version is on 2019-12-08.