|
|
|
|
|
by andix
490 days ago
|
|
Its llama/quen with some additional training to add reasoning. In a similar way deep seeks v3 was trained into r1. It also looks to me like there was some Chinese propaganda trained into llama/quen too, but that’s just my observation. |
|