Cite this as

Alexander Bukharin, Yixiao Li, Pengcheng He, Tuo Zhao (2024). Dataset: HERON: Hierarchical Preference-based Reinforcement Learning. Resource: Original Metadata. https://doi.org/10.57702/sffb16ea

DOI retrieved: December 16, 2024

Additional Information

Field Value
Created unknown
Last updated December 16, 2024
Format application/json