Y
Hacker News
new
|
ask
|
show
|
jobs
by
_3u10
5 days ago
No, they need the same arch, but you can distill them into a single model. And yes, if you use the API directly Claude will often say it’s an open weight model (likely the ones it was distilled from)