-
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle ...
The Capacitated Vehicle Routing Problem (CVRP) dataset, a combinatorial optimization problem in which a set of locations must be covered by a single vehicle with limited capacity.