Hacker News new | ask | show | jobs
by JKCalhoun 718 days ago
"Call my broker, tell him to sell all my NVDA!"

Combined with the earlier paper this year that claimed LLMs work fine (and faster) with trinary numbers (rather than floats? or long ints?) — the idea of running a quick LLM local is looking better and better.

1 comments

This is the same paper (or an extension) — using ternary weights means you can replace multiplication with addition/subtraction.