| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by eurekin 539 days ago
	That's exactly what everybody advised me against doing - finetuning on own projects. Got really discouraged and stopped. So glad someone has done it!

4 comments

redeux 539 days ago

Almost no one knows if a project/business idea will be successful or not, so it's not much use asking. It's more productive to ask smart, experienced people how to best validate and execute an idea. People generally give useful and actionable feedback based on their experiences. Just make sure you understand who you're talking to when evaluating someone's advice.

link

mentalgear 538 days ago

"understand who you're talking to when evaluating someone's advice." Good you mentioned this, found out to this is a crucial part as well: Always perceive the advice you get depending on that person's background and interests (e.g. your target group, or domain-foreign expert).

link

menaerus 539 days ago

> That's exactly what everybody advised me against doing - finetuning on own projects

Why would someone advise against it? IMHO that sounds as the end game to me. If it weren't so darn expensive, I'd try this for myself for sure.

link

freehorse 539 days ago

I think that people suggest RAG, also because the models develop so fast that very probably the base model you finetune on will be obsolete in a year or so.

If we are approaching diminishing returns it makes more sense to finetune. As the recent advances seem to happen by throwing more compute to CoT etc maybe the time is close or has already come.

link

eru 538 days ago

What's CoT?

link

vlabakje90 538 days ago

Chain of Thought. When I see people using abbreviations like this I sometimes jokingly wonder what they do with all this time they're saving.

link

zitterbewegung 538 days ago

There are so many chain types it is easier to do the abbreviations. Basically extend a RAG to have a graph to influence how to either critisize itself or perform different actions. It has gotten to the point where there are libraries for define them. https://langchain-ai.github.io/langgraph/tutorials/introduct...

link

plagiarist 538 days ago

Perhaps they're preemptively reducing several tokens into one, for the machines' benefit.

link

freehorse 538 days ago

I post in twitter and invest in crypto

link

scosman 538 days ago

Fine tuning to a specific codebase is a bit strange. It's going to learn some style/tool guidance which is good (but there are other ways of getting), at the risk of unlearning some generalization it learned from looking at 1,000,000x more code samples of varied styles.

In general I'd suggest trying this first:

- Large context: use large context models to load relevant files. It can pickup your style/tool choices fine this way without fine tuning. I'm usually manually inserting files into context, but a great RAG solution would be ideal.

- Project specific instructions (like .cursorrules): tell it specific things you want. I tell it preferred test tools/strategies/styles.

I am curious to see more detailed evals here, but the claims are too high level to really dive into.

In generally: I love fine tuning for more specific/repeatable tasks. I even have my own fine-tuning platform (https://github.com/Kiln-AI/Kiln). However coding is very broad. Good use case for foundation large models with smart use of context.

link

QuesnayJr 539 days ago

Other people have spent a lot of time on it and gotten nowhere, so I suspect there is some art to it.

link

eurekin 538 days ago

They have? Is there a write up about that?

link