| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by lopkeny12ko 848 days ago

Gemma, despite being developed by a company worth billions of dollars, is a phenomonally poor model.

I tried the open source release yesterday. I started with the input string "hello" and it responded "I am a new user to this forum and I am looking for 100000000000000..." with zeros repeating forever.

Ok, cool I guess. Looks like I'll be sticking with GPT-4.

2 comments

fsmv 848 days ago

Did you use the raw model or the instruction tuned one? 2B or 7B? You didn't give it much to go on.

link

SunlitCat 848 days ago

The Mistral model I tried when it came out produced "blog posts" as responses. I assume this somehow depends on where those models get much of their training data from (please correct me if I'm wrong).

link