-
Dataset for Configurable Software Systems
The dataset used in this paper is a collection of configurable software systems, including Apache, BDBC, BDBJ, LLVM, SQLite, and x264. -
Using Bug Report Discussions to Guide Fixing Bugs in Software
We propose various input context representations, encompassing different natural language components that are tied to the discussions and likely to capture their meaningful... -
Stack Overflow Performance Discussions
The dataset used for the study, containing 2,304 posts related to performance of software components -
Program Merge Conflict Resolution via Neural Transformers
The dataset is used to train and evaluate the MergeBERT model for merge conflict resolution. -
Source Code Graphs for Just-In-Time Bug Prediction
The dataset contains 7 different commit categories: None, Merge, Corrective, Preventive, Addition, Non-Functional, and Perfective. -
ESBMC-AI dataset
A dataset of 1000 C code samples, each consisting of 20 to 50 lines of code, used to evaluate the proposed method for automatic program repair. -
LIME and SHAP explanations for issue type predictions
The dataset contains 3092 issues with the prediction whether they are a bug or not from the machine learning models and their corresponding LIME and SHAP explanations.