Hacker News new | ask | show | jobs
by satvikpendem 35 days ago
Use the MTP models which 2x token generation speed, for example: https://unsloth.ai/docs/models/qwen3.6#mtp-guide
1 comments

Very interesting I'll have to check this out thank you. This is why I love HN.