Hacker News new | ask | show | jobs
by bradhilton 517 days ago
These are distillation fine-tunes of two different models:

- Qwen2.5 7B - Llama3.1 8B

Though the sizes are similar, they will probably have different strengths and weaknesses based on their lineage.

1 comments

thanks.

I'm running the qwen distillation right now and it's amazing.