| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by vintagedave 597 days ago

It's a fine-tuned model over Llama 3.1, which is the kind of thing I -- a non-expert -- would want to do and have though of doing if I trained a LLM for a specific programming language for private (non-cloud-hosted) use. So this is both interesting and yet I lack to knowledge to really understand its impact in the LLM world.

> the model displays significant improvements in judgment and reward modeling.

This seems significant?

Technical report: https://nousresearch.com/wp-content/uploads/2024/08/Hermes-3...

1 comments

grudg3 597 days ago

Funny, I had the same thought today and looked into it, but it seems like getting a properly curated dataset is the most difficult part.

link