| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by txtai 1268 days ago
	Starting with HF models and moving to a large model like GPT3 when the task calls for it is a good approach to take with almost all tasks.

1 comments

joedevon 1268 days ago

How about fine tuning testing w/ Davinci and then scaling it down for the other models or HF once you've proven it works. I believe the openai docs propose this approach (minus HF of course)

link

txtai 1268 days ago

That's up to you. Many don't want to open an account and pay in order to explore what's possible. There are LLMs available on the HF Hub, such as google/flan-t5-xl.

link

rozgo 1268 days ago

This works well. Starting with bigger models allows us to explore whats possible. I find it easier to scale down from here.

link