| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by valine 869 days ago
	This is cool. Was looking for model weights, but it seems like maybe it will work with a variety of different models. This is like a RAG/agent app built on top of your typical llama. Am I reading that right?

1 comments

DanyWin 869 days ago

You are exactly right! As I wanted to have a solution that works with many LLMs out of the box, I focused on chain of thoughts and few shot learnings.

Lots of paper show that fine-tuning only helps with steerability and form (https://arxiv.org/abs/2402.05119), therefore I thought it would be sufficient to provide just the right examples and it did work!

We do intend to create a decentralized dataset to further train models and have maybe a 2b or 7b model working well

link

valine 869 days ago

What kind of problems are you seeing that you think can be improved with a fine tune?

link

msp26 869 days ago

Thank you for linking that paper!

link