Hacker News new | ask | show | jobs
Robustly identifying concepts introduced during chat fine-tuning with crosscoder (arxiv.org)
6 points by veryluckyxyz 440 days ago