Dataset Groups Activity Stream Malware Detection Dataset The dataset used in this paper is a large-scale regression dataset, containing 2,342,274 Portable Executable (PE) files, each represented with almost 100,000 features. BibTex: @dataset{Martin_Jureček_and_Olha_Jurečková_2024, abstract = {The dataset used in this paper is a large-scale regression dataset, containing 2,342,274 Portable Executable (PE) files, each represented with almost 100,000 features.}, author = {Martin Jureček and Olha Jurečková}, doi = {10.57702/lfif0eg0}, institution = {No Organization}, keyword = {'Benign Files', 'Benign programs', 'Cloud', 'Detection', 'Features', 'Flow Behavior', 'Large-Scale Regression', 'Machine Learning', 'Malware', 'Malware Detection', 'Malware Families', 'Network Flow', 'Performance Metrics', 'Portable Executable', 'Windows programs'}, month = {dec}, publisher = {TIB}, title = {Malware Detection Dataset}, url = {https://service.tib.eu/ldmservice/dataset/malware-detection-dataset}, year = {2024} }