Hacker News new | ask | show | jobs
by lopkeny12ko 848 days ago
Gemma, despite being developed by a company worth billions of dollars, is a phenomonally poor model.

I tried the open source release yesterday. I started with the input string "hello" and it responded "I am a new user to this forum and I am looking for 100000000000000..." with zeros repeating forever.

Ok, cool I guess. Looks like I'll be sticking with GPT-4.

2 comments

Did you use the raw model or the instruction tuned one? 2B or 7B? You didn't give it much to go on.
The Mistral model I tried when it came out produced "blog posts" as responses. I assume this somehow depends on where those models get much of their training data from (please correct me if I'm wrong).