| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by airocker 808 days ago
	Are tools fine tuned in both or just prompt based? For a fair comparison

2 comments

fudoshin2596 808 days ago

No fine tuning. Looks like he's do raw model capabilities with simple prompt. repo: https://github.com/parea-ai/tool-use-benchmark

link

Joschkabraun 808 days ago

All models got the same prompt fed which was essentially "Question: {question}". And then the API's accept the function call definition

link

airocker 808 days ago

But what if there is a bad model that fine tunes with only tools?

link

Joschkabraun 808 days ago

What do you mean with being a bad model?

If the model is really good at tool use, then it will broadly useful as it needs capabilities to generate the tool definition. So, there should be some transferability.

link

ec109685 808 days ago

If the exact tools test was part of the training of the model, wouldn’t that throw off the results and not generalize?

link