Supplementary data for a study on a data driven learning approach for the assessment of data quality

Abstract: This data is 100% artificial (no real patients involved). This dataset was generated to explore the application of simple machine learning to learn knowledge about how simple statistical measures about a dataset (e.g. mean value for variable, value counts etc.) can indicate data quality issues. The generated dummy data, outcome data and MM-results are available in the folder "dummy data, outcome data, MM-results". The numbers of triggered DQ-issue rules are available in folder "triggered issue rules". The MM-results used for machine learning can be found in file "MM-results_export_for_machine_learningresult_exports.csv".

Data and Resources

This dataset has no data

Cite this as

Tute, Erik (2021). Dataset: Supplementary data for a study on a data driven learning approach for the assessment of data quality. https://doi.org/10.24355/dbbs.084-202107061200-0

DOI retrieved: 2021

Additional Info

Field Value
Imported on January 8, 2025
Last update January 8, 2025
License CC-BY-4.0
Source https://doi.org/10.24355/dbbs.084-202107061200-0
Author Tute, Erik
Given Name Erik
Family Name Tute
Source Creation 2021
Publication Year 2021
Resource Type Dataset - research_data
Subject Areas
Name: data driven learning approach

Name: dummy data

Name: data qualitiy assessment

Name: 61

Related Identifiers
Identifier: https://leopard.tu-braunschweig.de/receive/dbbs_mods_00069626?XSL.Transformer=mods
Type: URL
Relation: HasMetadata