|
|
|
|
|
by manav
277 days ago
|
|
We're all training similarly large base++; near same data, just pricing it differently... with grok removing a few filters and maybe some safeguards? For that matter, many of the benchmarks are flawed and are just easily gamed and whatnot. iykyk. |
|