Hacker News new | ask | show | jobs
by ein0p 663 days ago
You don’t need to write your own LLM to know how it works. And unlike, say, a browser it doesn’t really do anything even remotely impressive unless you have at least a few tens of thousands of dollars to spend on training. Source: my day job is to do precisely what I’m telling you not to bother doing, but I do have access to a large pool of GPUs. If I didn’t, I’d be doing what I suggest above.
2 comments

Good points. For learning purpose, just understanding what a neural network is and how it works covers it all.
But I mean people can always rent GPUs too. And they're getting pretty ubiquitous as we ramp up from the AI hype craze, I am just an IT monkey at the moment and even I have on-demand access to a server with something like 4x192GB GPUs at work.
Have you tried renting a few hundred GPUs in public clouds? Or TPUs for that matter? For weeks or months on end?