https://www.lesswrong.com/posts/HLJoJYi52mxgomujc/realistic-reward-hacking-induces-different-and-deeper-1
Sharan Maiya
maius
AI & ML interests
None yet
Recent Activity
updated
a model
about 4 hours ago
maius/qwen3-30b-a3b_goodness_no-thinking
published
a model
about 4 hours ago
maius/qwen3-30b-a3b_goodness_no-thinking
updated
a model
about 4 hours ago
maius/qwen3-30b-a3b_goodness_thinking