Skip to content

Changes

View changes from to


On April 11, 2024 at 7:22:59 AM UTC, philipp Philipp D. Rohde:
  • Updated description of PALADIN: Benchmarks, Experimental Settings, and Evaluation from

    This collection includes all the data and scripts necessary to reproduce the results from the experimental study of [PALADIN](https://github.com/SDM-TIB/PALADIN). __Data__ The data is generated using the [Synthetic Data Generator](https://github.com/SDM-TIB/Synthetic-Data-Generator) which generates process-based breast cancer treatment data following the distribution in a real population of breast cancer patients. The collection comprises a total of 18 data sets, nine for relational databases and nine for RDF-based knowledge graphs. For each data format, there are three different sizes of data sets: + _Small_ models 1,000 patients + _Medium-sized_ models 10,000 patients + _Large_ models 100,000 patients There are three data sets of each size. They differ in the parameter used for the mutation probability of the data generator. The lower this value is, the closer the data is to following the treatment guideline for breast cancer patients with an amplified HER2 gene. The data is available for download in * Turtle format: `synth_data_ttl.zip` * Preloaded for the use with Virtuoso 7.20.3237: `synth_data_virtuoso.zip` * MySQL 8.1 dump: `synth_data_sql.zip` __PALADIN Schemas__ The file `paladin_schemas.zip` contains the different PALADIN schemas used in the experimental study. There are mainly four different schemas. One of them represents the treatment guideline for breast cancer patients with an amplified HER2 gene. The remaining three shemas are used in the study of the scalability. They divide the patients based on the ranges over their IDs. They comprise of 64, 256, and 1024 nodes, respectively. __Experimental Environment__ In order to reproduce the results, download the file `experiments.zip`. Once unzipped, execute the file `run_experiments.sh`. Note that you need to have Docker installed. The script `run_experiments.sh` should be executed with sudo permissions in order to let the script automatically transfer the ownership of the files created with Docker to your user.
    to
    This collection includes all the data and scripts necessary to reproduce the results from the experimental study of [PALADIN](https://github.com/SDM-TIB/PALADIN). __Data__ The data is generated using the [Synthetic Data Generator](https://github.com/SDM-TIB/Synthetic-Data-Generator) which generates process-based breast cancer treatment data following the distribution in a real population of breast cancer patients. The collection comprises a total of 18 data sets, nine for relational databases and nine for RDF-based knowledge graphs. For each data format, there are three different sizes of data sets: + _Small_ models 1,000 patients + _Medium-sized_ models 10,000 patients + _Large_ models 100,000 patients There are three data sets of each size. They differ in the parameter used for the mutation probability of the data generator. The lower this value is, the closer the data is to following the treatment guideline for breast cancer patients with an amplified HER2 gene. The data is available for download in * Turtle format: `synth_data_ttl.zip` * Preloaded for the use with Virtuoso 7.20.3237: `synth_data_virtuoso.zip` * MySQL 8.1 dump: `synth_data_sql.zip` __PALADIN Schemas__ The file `paladin_schemas.zip` contains the different PALADIN schemas used in the experimental study. There are mainly seven different schemas. One of them represents the treatment guideline for breast cancer patients with an amplified HER2 gene. The remaining six shemas are used in the study of the scalability. They divide the patients based on the ranges over their IDs. They comprise of 16, 32, 64, 128, 256, 512, and 1024 nodes, respectively. __Experimental Environment__ In order to reproduce the results, download the file `experiments.zip`. Once unzipped, execute the file `run_experiments.sh`. Note that you need to have Docker installed. The script `run_experiments.sh` should be executed with sudo permissions in order to let the script automatically transfer the ownership of the files created with Docker to your user.



Before browse our site, please accept our cookies policy