|
|
|
|
|
by israrkhan
424 days ago
|
|
Agreed. Also their name make it seem like it is totally new model. If they needed to assign their own name to it, at least they could have included the parent (and grant parent) model names in the name. Just like the name DeepSeek-R1-Distill-Qwen-7B clearly says that it is a distilled Qwen model. |
|
Otoh, there aren't many frontier labs that have actually done finetunes.