| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by ejpir 90 days ago

unfortunately the bigger models are pretty slow in token speed. The memory is just not that fast.

You can check what each model does on AMD Strix halo here: