Skip to content
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning