|
|
|
|
|
by thundergolfer
486 days ago
|
|
“Pure garage-energy” is a great phrase. Most interested to see their inference stack, hope that’s one of the 5. I think most people are running R1 on a single H200 node but Deepseek had much lower RAM per GPU for their inference and so had some cluster based MoE deployment. |
|