Hacker News new | ask | show | jobs
by harrisoned 795 days ago
I'm playing with the llama 3 8b instruct model out of curiosity, and it is insanely better than the llama 2 on that regard. It's almost like a fully uncensored model. it did refused to make pentest scripts when i asked, which is fine. But it made the scripts after i changed the system prompt to something more 'permissive'. The model seem to adhere more to user commands, and it's more useful overall. It's even good at complex math, which is insane considering even GPT4 is bad at it.

I wasn't sure if meta would release the model to the public, i'm glad they did.

1 comments

I was trying to get it to code some general web scraping code, and it just repeatedly refused even given some convincing, what kind of system prompt are you having luck with?
If your intention is coding something complex, you should try a model finetuned for that, i don't think llama 3 is that good for coding. But i used standart prompt engineering stuff, same as from llama 2. Instead of chat, you use a completion mode, where you just need to give it a text to continue writing from.
> If your intention is coding something complex, you should try a model finetuned for that

Is there any consensus as to the models most suitable for coding complex things?

I personally don't use LLMs to code, besides a few snippets or when i'm out of ideas what/how do to. But models like Starcoder and Code Llama is what i see people often using for this purpose. There are benchmarks for various languages, you can find those on Hugging Face.