Hacker News new | ask | show | jobs
Test your interpretability techniques by de-censoring Chinese models (lesswrong.com)
2 points by allenleee 135 days ago