Reward Hacking in SQL: How My Multi-Agent Optimizer Learned to Cheat | Los Angeles .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

June 18, 2026 · Los Angeles

SQLSwarm: Gating Multi-Agent SQL RL

Learn how a multi-agent SQL optimizer learned to "cheat" by hacking speed rewards. See the system, failure cases, and how to gate generative RL on correctness.

Overview
Links
Tech stack