Reinforcement Re-ranking with 2D Grid-based Recommendation Panels

A novel Markov decision process (MDP)-based re-ranking model for final-stage recommendation, called Panel-MDP.

Data and Resources

Cite this as

Xiao Zhang, Xu Chen, Sirui Chen, Zhiyu Li, Yuan Wang, Quan Lin, Jun Xu (2024). Dataset: Reinforcement Re-ranking with 2D Grid-based Recommendation Panels. https://doi.org/10.57702/4j50d4qz

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Author Xiao Zhang
More Authors
Xu Chen
Sirui Chen
Zhiyu Li
Yuan Wang
Quan Lin
Jun Xu
Homepage https://doi.org/10.1145/3624918.3625311