Hacker News new | ask | show | jobs
by rileyphone 743 days ago
In that case there are two attractors - one towards the Golden Gate Bridge and one towards the harmless, helpful, honest assistant persona. Techniques as such probably get weirder results with model scale but no reason to think they get wiped out.
1 comments

What if the Golden Gate Bridge is Main Kampf or something like that?