Hacker News new | ask | show | jobs
DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200 (blogs.nvidia.com)
13 points by moondistance 499 days ago
1 comments

https://news.ycombinator.com/item?id=42879864

this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.