|
|
|
|
|
by ahmedhawas123
325 days ago
|
|
Exciting as this is to toy around with... Perhaps I missed it somewhere, but I find it frustrating that, unlike most other open weight models and despite this being an open release, OpenAI has chosen to provide pretty minimal transparency regarding model architecture and training. It's become the norm for LLama, Deepseek, Qwenn, Mistral and others to provide a pretty detailed write up on the model which allows researchers to advance and compare notes. |
|
[0] https://cdn.openai.com/pdf/419b6906-9da6-406c-a19d-1bb078ac7...