Hacker News new | ask | show | jobs
Every attention weight matrix in GPT-2, visualized (amanvir.com)
1 points by venusgirdle 426 days ago