| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by factorymoo 827 days ago
	This might be an unfair statement but it really feels like all of these blogs don't know why. They copy/paste each other (you often seem the same errors in multiple notebooks/blogs) and I have a feeling no one really deeply understands what they're doing.

3 comments

unraveller 827 days ago

Found my answer for why thanks to the issues in latest dolphin fine-tune. They do these types of fine tunes mainly to reduce refusal rates and increase intelligence. They did the knee-jerk rerun of the same old data this time, as I suspected, just for lols to see where open-source is at.

Spoiler alert, fine-tunes won't be better until the data quality is better than meta's instruction fine-tune. Give it some weeks.

Why does [doplin-l3-8B] perform substantially worse in some tests?

Essentially, it's trained like this:

  LLama-3-8B-base_model --> LLama-3-8B-Instruct
  LLama-3-8B-base_model --> dolphin-2.9-llama3-8B

And not like this:

  LLama-3-8B-Instruct --> dolphin-2.9-llama3-8B

https://huggingface.co/cognitivecomputations/dolphin-2.9-lla...

link

jackblemming 827 days ago

Most of the entire field of machine learning is “try shit and see what works”. So it seems like they’re par for the course.

link

v3ss0n 827 days ago

Same as software engineering field too.

link

littlestymaar 827 days ago

It's even worse for AI given that nobody really understands why anything works.

link

sinuhe69 827 days ago

I wonder what we don’t understand from the SE POV?

link

ijk 827 days ago

One additional problem with people who write breathless tutorials about doing things with AI is that they are more likely than average to have been written with ChatGPT. Which, given the knowledge cutoff for most models, is not where I'd personally turn for data on recent technical developments, but is par for the course for the kind of low-effort copy-paste bloggers doing it for attention.

This particular one seems to be from someone who is documenting their learning process, which is a valuable contribution but, obviously, not a source of great authority on the how's and why's.

link