Hacker News new | ask | show | jobs
by osti 455 days ago
I think this is the one where they train LLM without NVIDIA GPU's.
1 comments

They talk about CUDA level tracing in their framework. I assume its just consumer GPU's that Nvidia say arent meant to be used in datacenters.