Hacker News new | ask | show | jobs
Test your interpretability techniques by de-censoring Chinese models (alignmentforum.org)
2 points by allenleee 72 days ago