AI agent developers internally have a metric they are targeting to improve. That itself violates goodhart law.