Hacker News new | ask | show | jobs
by 2ndorderthought 51 days ago
Woah. How is this working? It's stupid fast.
1 comments

The weights are mapped directly to transistors. It's not a generic processor, it's literally a dedicated Llama 8B chip that can't be used for anything else. When you specialize in hardware you get faster - Taalas is pushing that to the limit.

They seem to be doing well. I checked recently and their API is closed to signups due to overwhelming demand.

I want to buy a chip not API access!!!