Hacker News new | ask | show | jobs
by alansaber 185 days ago
This model is more comparable to GPT-2 than anything we use now.