Hacker News new | ask | show | jobs
by ckrapu 492 days ago
Identifying scheming in the latent streams would be harder as you would have an extra layer of obfuscation between you and the model’s reasoning.