USPTO-50k
The USPTO-50k dataset is a curated subset of chemical reaction examples from patent literature, where each reaction is labeled with one of ten reaction classes, focusing on the task of predicting likely reactants for given target compounds.
BibTex: