| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by specproc 1201 days ago
	Nice, that sounds like it's worth exploring. Much appreciated. Again though, it's the zero-effort part that's appealing. I'm on a very small team and getting that to close to the same standard will take time for a ham-fisted clod like myself. Worth giving a shot all the same though, thanks again.

2 comments

leobg 1201 days ago

The zero shot ability is convenient. But for tasks that you need to get done millions of times, I’d much rather spend $10 on GPU compute and maybe a day of training data generation to train a T5 which I then “own”.

Also, running your own specialized model locally can be much faster than using someone’s API.

link

specproc 1200 days ago

Sure, purely a time issue for me. I'm not the most skilled in this area, and I've got a load of core stuff I need to keep on top of.

I think we're not far off having something equivalent that can be pulled from Huggingface and run on a near consumer grade GPU.

For now, I'll hang tight and see how things progress. Don't disagree.

link

leobg 1200 days ago

Maybe one day you’ll be able to tell ChatGPT what kind of model you need and it’ll automatically select the right architecture, gather the training data, and commission the training using the cheapest and/or fastest provider. :)

link

pfdietz 1201 days ago

It's interesting what you can do with ChatGPT with few shot learning. It generalizes at the drop of a hat, often correctly.

link