Hacker News new | ask | show | jobs
by scottydog51834 1201 days ago
GPT-2

Here is one cool (and recent) research project using GPT-2 model weights.

https://www.lesswrong.com/posts/cgqh99SHsCv3jJYDS/we-found-a...

1 comments

TFA claims GPT-2 isn't open. Is it that the trained weights are open but the project that did the training isn't?
Realistically the most difficult part of training a LLM is curating a good data set. OpenAI never published their training data.