Hacker News new | ask | show | jobs
by cjbprime 592 days ago
Right, the nvidia card maxes out at 24GB.
1 comments

A 24gb model is fast and ranks 3. A 70b model is slow and 8.

A top tier hosted model is fast and 100.

Past what specialized models can do, it's about a mixture/agentic approach and next level, nuclear power scale. Having a computer with lots of relatively fast RAM is not magic.