|
|
|
|
|
by indeed30
502 days ago
|
|
So, can somebody in the know speculate about how Deepseek (or OpenAI, or whoever really) is actually running their API? If I wanted to run a production-grade service using the full Deepseek model, with good tokens/sec and the ability to serve concurrent requests, what sort of hardware are we looking at? |
|