Y
Hacker News
new
|
ask
|
show
|
jobs
DeepSeek-R1 at 3,872 tokens / second on a single Nvidia HGX H200
(
blogs.nvidia.com
)
13 points
by
moondistance
499 days ago
1 comments
billconan
499 days ago
https://news.ycombinator.com/item?id=42879864
this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.
link
this is cerebras' 70B number, 1600 tokens / sec, not sure about the costs.