Hacker News new | ask | show | jobs
by yeldarb 2227 days ago
I was very surprised how well it did mimicking the StackOverflow archives when I trained GPT-2 on them last year: https://stackroboflow.com (Only the 345M weights were released back then; now I'm curious how much better 1.5B would do.)