Hacker News new | ask | show | jobs
by dooraven 1062 days ago
> Bard release, it would have made more sense for them to have a more limited release of a better model for PR reasons than what actually happened.

Yes I would agree with you if Google wasn't set on to full on panic mode by their investors about releasing something vs Open AI due to Chat GPT's buzz.

Bard was just a "hey we can do this too" thing, it was released half assed, had next to no marketing or hype.

Vertex AI is their real proper offering, and I want to see how PaLM 2 does in comparison.

1 comments

I can already tell you that PaLM is not anywhere near as good and PaLM-2 is at least not as good before RLHF.

Not going to keep replying, believe what you want about Google's capabilities

@dooraven - I also work in ML (including recently working at Google) and I agree with @whimsicalism.

You seem to be under the mistaken belief that: 1. Google has competent high-level organization that effectively sets and pursues long term goals. 2. There is some advantage to developing a highly capable LLM but not releasing it.

(2) could be the case if Google had built an extremely large model which was too expensive to deploy. Having been privy to what they had been working on up until mid-2022 and knowing how much work, compute and planning goes into extremely large models, this would very much surprise me.

Note: I did not have much visibility into what deepmind was up to. Maybe they had something.

ok now I am confused, as Meta themselves say Palm-2 is better than Llama 2?

> Llama 2 70B results are on par or better than PaLM (540B) (Chowdhery et al., 2022) on almost all benchmarks. There is still a large gap in performance between Llama 2 70B and GPT-4 and PaLM-2-L.

https://scontent.fsyd7-1.fna.fbcdn.net/v/t39.2365-6/10000000...

If Google's publically available model is better Llama 2 already then why is it so inconceivable that they'd have private models that are better than their public ones which are better than LLama already.

Palm-2 isn't better than GPT-4 but the convo was about better than Llama models no?