TREC05 spam corpus

The dataset used in the paper is the TREC05 spam corpus, which contains 39,999 real ham and 52,790 spam emails.

BibTex: