Hacker News new | ask | show | jobs
by throwawaymaths 376 days ago
not 100% hard, but download deepseek and ask it some sensitive questions and see what it says if youre unconvinced that some level of alignment cant be achieved by brute forcing it into the weights