LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking
Lukas Henrik Helff; Quentin Delfosse; David Steinmann; Ruben Härle; Hikaru Shindo; Patrick Schramowski; Wolfgang Stammer; Kristian Kersting; Felix Friedrich
In: Computing Research Repository eprint Journal (CoRR), Vol. abs/2604.15149, Pages 1-8, arXiv, 2026.