You're currently viewing an old version of this dataset. To see the current version, click here.

RCV1 dataset

The RCV1 dataset is used for predicting categories of newswire stories recently collected by Reuters. Ltd. The RCV1 can be naturally partitioned based on news category and used for federated learning experiments, since readers may only be interested in one or two categories of news and the model training process will mimic the personalized privacy-preserving news recommender system, for which reader history is located on a user’s personal devices.

Data and Resources

This dataset has no data

Cite this as

Shlomo E. Chazan, Sharon Gannot, Jacob Goldberger (2024). Dataset: RCV1 dataset. https://doi.org/10.57702/jffbfebe

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2101.00052
Citation
  • https://doi.org/10.48550/arXiv.1606.04838
  • https://doi.org/10.48550/arXiv.1812.06535
Author Shlomo E. Chazan
More Authors
Sharon Gannot
Jacob Goldberger
Homepage https://www.cs.cornell.edu/~cst2118/RCV1/