|
|
|
|
|
by spps11
490 days ago
|
|
Thanks for sharing, enjoyed reading it! I have a slightly tangential question: Do you have any insights into what exactly DeepSeek did by bypassing CUDA that made their run more efficient? I always found it surprising that a core library like Cuda, developed over such a long time, still had room for improvement—especially to the extent that a seemingly new team of developers could bridge the gap on their own. |
|