|
|
|
|
|
by vintagedave
597 days ago
|
|
It's a fine-tuned model over Llama 3.1, which is the kind of thing I -- a non-expert -- would want to do and have though of doing if I trained a LLM for a specific programming language for private (non-cloud-hosted) use. So this is both interesting and yet I lack to knowledge to really understand its impact in the LLM world. > the model displays significant improvements in judgment and reward
modeling. This seems significant? Technical report: https://nousresearch.com/wp-content/uploads/2024/08/Hermes-3... |
|