Hacker News new | ask | show | jobs
Noam Shazeer Joins OpenAI (twitter.com)
141 points by lukasgross 20 hours ago
https://xcancel.com/NoamShazeer/status/2067400851438932297

https://www.reuters.com/technology/googles-gemini-co-lead-no...

17 comments

Some context for people who haven’t followed the full loop: Shazeer was a long-time Google researcher, joined Google in 2000, and was one of the co-authors of “Attention Is All You Need.”

He left Google in 2021 to co-found Character.AI. In 2024, Google brought him and some Character.AI researchers back via a licensing/talent deal with Character.AI (reportedly around $2.7B). He was then made a Gemini co-lead.

Now he’s leaving Google again for OpenAI.

Exciting times!

Very exciting indeed! The teenager -> suicide pipeline is one of the final frontiers for SV and it's nice to see Google enter in competition with Meta.
Sounds like yet another Scam Altman, perfect match indeed.
Well in terms of employers loyalty, they were hired to solve AI problems, and in this case they siphoned all the resources at Google, built all their tests on Google's money, time, resources, brains, crawled content, and then once it started to work left Google, leaving Google empty-handed.

Nice guys.

If you got your girlfriend/boyfriend by sneakily convincing them to cheat on their partner, don't be surprised if you are the next one to be cheated on.

What a waste of money.

I think it becomes somewhat more defensible when considering that the alternative was operatiny Google's policy (before the advent of competition) of "these models would bring unknown dangers in the hands of the public, we shouldn't release them until we better understand the implications" (or perhaps more selfishly "these effectively nullify all our detectors of generated text, if released they would instantly lose us the war on SEO").

(recall that OpenAI thought GPT-2 was too powerful to release for approximately tantamount reasons)

Companies are not your friend who you need to be loyal to. There's a reason noncompetes are illegal in California.
talent poaching is something pretty common in tech, google knows something like this can and will happen, so does openAI

also "empty handed" is just unnecesarily dramatic, he left all the knwoledge base he helped build, that's google's IP and is worth m(b?)illions

i dont keep up on this stuff so maybe i am missing some context.

should he have been obligated to stay at google for the rest of his career?

Oh no, he wasn't loyal to the soulless trillion dollar mega corp :( what a terrible person
> Sounds like yet another Scam Altman, perfect match indeed.

Not really.

Altman couldn't code his way out of a wet paper bag.

Noam is OTOH and IIUC the real deal.

https://old.reddit.com/r/singularity/comments/1u8xc9m/most_l...

Seems like there are some insights here!

edit: it seems the post has been removed but comments are viewable.

1 liner summary:

To put it lightly, the dude was politically outspoken and held strong beliefs.

Alright. OpenAI feels like a better fit for him after all
sigh how are so many brilliant people this stupid?
It gets to their head.

I had had a boss (from a YC-funded company, no less) that behaved in this way. Talked down on me with the g-slur, used language barriers to alienate his peers, and demanded religious sensitivity whenever we met after work. His entire life was defined by this religiously insecure identity, and several meetings were derailed by him thinking he was slighted by the rest of the team. That led to team members avoiding him, which reaffirmed his perception of being discriminated against. In reality we were all just baffled by his inability to adopt a secular work ethic.

As a queer person I could partially empathize with his behavior. Some of the smartest queer people I know are also frustrated, downtrodden and crass in protest of their mistreatment. But they're also generally grounded people that buckle down at work and get things done. They don't accuse people of being bigoted, lash out at coworkers or use slurs in the office. Perhaps it helps that queer identity isn't eschatological in nature, but that's only my best guess.

Noam Shazeer was one of the lead authors of the seminal paper "Attention Is All You Need", which introduced the transformer architecture. (From Wikipedia)
This understates his criticality. The author list was randomized, but the critical idea was truly his. Wonder what this says about GDM …
Source for this? The notion of attention dates to a content-addressable lookup during sequence alignment (as well as, concurrently, memory lookups in neural Turing machines). Attention had been used in other models, like GRUs and LSTMs with attention. The Vaswani et. al. paper did not introduce attention, just removed everything _but_ attention (and FFW) from the network. Are you claiming the "critical idea" of removing the GRU and LSTM parts and just keeping attention was "truly" Noam's?
I don't know we can just say things now. Ah we're on the internet
Is this a generally well known thing?
Even more important, I wonder what it says about HBW...
Even if we knew, we’d still fail to understand GHO
Wow. What could possibly have caused him to quit so soon after coming back?

I hope this is not accurate but I'm afraid it is: https://x.com/signulll/status/2067446889956430273

https://nitter.net/signulll/status/2067446889956430273 for those who don't want to click the above
signull is more of an anonymous sh*tposter than a known industry insider, but I think this does capture the sama contribution to OpenAI very well. At least from an outsider who follows this stuff based on vibes.
That twitter story isn't anything unique to OpenAI or Google, it's just classic "big public corp vs private startup" culture. Once you have to worry about the SEC, shareholders, antitrust, regulations, lawsuits, etc. it's very, very difficult to avoid turning into "big corp" culture.

Sama, and any other founder, will always have a difficult fight against bureaucracy, and once you let a little bit in, the bureaucracy's sole purpose becomes to grow itself.

Google and Apple both need a culling similar to what Elon did with Twitter after taking over.
I disagree. It's not about the culling, it has never been, and actually, it makes things worse. You spend countless hours and tons of money recruiting talented people not to lay them off because you don't want a bureaucratic org.

If the issue is inefficiency, tons of meetings, too much team alignment etc, then that's the issue that you need to tackle, and these issues can already appear in a 50-100 employee company. Sure, that's an easy problem to solve with a smaller size but unless you hired people for no reason, these people have a very specific set of problems to tackle and are often, in these companies, the best in class to tackle them, culling half of the company isn't going to make things better.

(And X rehired part of the laid-off engineers)

Google bloat gave us transformers. Apple bloat gave us a usable touchscreen only, pocket computer (famously an entire org within Apple had developed an iPod-based approach that was competing with what was released)

The leaps forward need bloat. A startup can execute on specific vector direction way better.

Now back to your point, what did X deliver with its lean ops? It seems that it needed 2 bailouts (one from xAI, and one from space X)

Google is facing a legitimate innovators dilemma here. It makes sense to have all this process when youre protecting a $4.5 trillion golden goose. The tragedy here is that one predictable outcome of this situation is google deciding to considerably cut research funding when they figure out it just serves to bootstrap future competitors.
This is when it makes sense to split your business up into multiple smaller businesses. The government should be doing this via anti-trust but they have dropped the ball there so, at this point, the corps really need to just do it to themselves to better compete.
Wasn't that what the whole Alphabet re-org was supposed to do?
If I had to make a guess, money played a role lol.
He is close or already a billionaire, not sure much more money will be do much heavy-lifting
going to go with "money" and a lot of BS from altman
This reads like an episode of Silicon Valley. I wish that show was rebooted, they'd have so much funny material nowadays.
Very bad news for Gemini - the brief comeback with 2.5 Pro last year looked to be driven by Noam
Don't think it matters in the long run to be honest. The models have no moat, they are becoming a commodity.

Besides that, Google is in a pretty good position, they're not bleeding money on AI like Anthropic/OpenAI, and they own product verticals where they can integrate it. Plus they have a mature ads-model which is what might actually drive a bit of revenue for LLMs.

I think the 'models have no moat' thing is overblown. Only like 3-4 companies in the entire world have cutting edge models, that means there is some kind of moat...
Money.

That's their moat.

Maybe also stolen copyrighted content that cannot be found anywhere else now, so they are the only ones who can train on it.

money. but it eventually runs out
A little IPO is the solution.

Don't we all want to (automatically) and passively invest in a company losing billions of dollars ?

At least we can diversify our portfolio from SpaceX.

Pre-Quote: "We are all going to lose, hundreds of billions"
Their moat is running in my basement right now, on a server plugged into a 220V 30A circuit.
yeah, sure, look at anthropic revenue, what is it if not the moat? you can argue for how long but for them good model = the fastest growing company ever.
Revenue is not a metric of success at all.

Grabbing market-share if you have investors that are ready to burn cash infinetely. Find a hot niche, buy a banana 1 USD, sell it for 0.10 USD.

Example: Cursor, they became popular because they were selling ChatGPT unlimited for 20 USD / month.

When they launched, just a reskinned VS Code, "fastest growing AI company"

No coincidence they were bought by SpaceX, who wants to consolidate revenue even if non-sense as long it helps other investors to exit. It shows rapid growth.

Profit is the real moat.

One example: Nvidia. Proprietary tooling, proprietary IP, proprietary hardware, no alternative, expensive.

> models have no moat

Possibly true. Any smart innovations developed by one organization will be smuggled into others.

Training, inferring, and data collection, infrastructures are definitely moats. High-volume usage feedback is also hard to come by for new entrants.

And Google has all of those. Custom silicon, more data than anyone else and probably the most comprehensive data collection system, and phones in the hands of 73% of the global smartphone using population to push gemini into to get high volume usage feedback and even more telemetry and data.
And they've had some initial success with TPUs which could be a major differentiator in the future.
Yup, and they have the Apple partnership for now as well. Much better position generally than OpenAI in my opinion.
I don't think you're honestly accounting for the engineering behind the progress models are making. If it was just a matter of compute on hand and iterating, Meta would be neck and neck with Ant, OAI, and Google, but clearly you've gotta have more.

Noam has a deep expertise in these systems at every level, both algorithmically and at production scale, and knows how to leverage things at different levels.

It's not like Google won't have anyone else that can do what he does, but at the same time, it's an implicit criticism of Google's culture, operations, development, and overall AI program. Shazeer is well past the point where the paycheck is the deciding factor, although I'm certain he is very well paid. Having the freedom to innovate and build free from the corporate fuckery of Google and Facebook is probably more valuable than the pay raise he got with the move, and OAI has the advantage of not having to cope with decades of corporate cruft and inertia. They'll get there - all corporations do - but they're relatively young enough to still be nimble.

I honestly don't think that matters for multiple reasons:

1. There are already multiple "sota" models on the market that compete with only marginal gains between them (OpenAI, Anthropic, Google/Gemini) and some that are catching up (DeepSeek, Qwen,..).

2. The fact that something is a hard engineering problem does not mean it's generating revenue. So while what you said is true, deep expertise is required to push the industry forward, I don't think that is going to matter for the bottom line of these companies. Hence why I think the models don't give a company any 'moat' in a capitalist economy.

Silver lining: given the leaked financials of OpenAI, he might very well be joining a sinking ship.

Also, why didn't they nail him down contractually when they bought character.ai ... isn't that pretty standard with these type of superstat (re)hires?

Looks like Google is leaking both AI talent and know-how something fierce ... and since the very day the transformer paper was written.

As an outsider, I'd be really curious to understand why, given how well positioned they seem to be in the AI battle:

- huge, quasi unmatched data war chest

- huge, quasi unmatched, planet-scale infrastructure

- native AI chip design and production (TPU)

- the core ideas for what we now know as "AI" were invented there

- deepmind, enough said

- pretty much the deepest pocket of all the AI players with the possible exception of MSFT

- a massively large user base and reach to deploy AI to (Android, YT, Cloud, Search, Email, ...)

- supposedly one the best engineering culture of the valley

Why do the best people leave ?

Why do their AI product always come in 3rd place ?

Why can't they seem to take the lead, both in terms of product design or in term of raw LLM performance?

The only answer I can think of is:

- culture is completely broken

- management sucks something fierce

- company is so fat and rich no one is actually interested in winning anymore

Wow - Google paid a couple billion dollars to bring Noam back. Really impressive by OAI if this reporting is accurate!
It is accurate. Confirmed by Noam himself on X https://x.com/i/status/2067400851438932297
Love the choice of words by Noam- exceptional team for OpenAI, amazing team for GDM.
Love this type of detailed textual analysis.
I hope this doesn't impact Google's progress on open models.
Is Shazeer known to be opposed to open-weight releases?
OpenAI hasn't released open weights since GPT-OSS-20/120B. Google has the Gemma line.

I wouldn't expect OpenAI to start releasing open weight competitive models again, but I could be wrong.

Surprised to not see more comments on this, especially given the popularity of the Anthropic/Karpathy article. What a win for OpenAI - and what a loss for Google, just 2 years after paying $2.7bn to bring Noam back into the fold. Does not bode well for Gemini long-term... Or could be a signal for how deeply they are leaning into world models.
I think nobody they acquired from Character.AI is at Google anymore.
is it partly due to alleged antisemitism at google?
This does suck for Google. Noam will take a lot of Google trade secrets with him to OpenAi. Google's bench is deeper than this one guy though.
From the excited comments and fanboyism, I have to say KRAZAM predicted the cult of personality that has infected the AI space.

https://www.youtube.com/watch?v=KIZt9YPAPZo&ra=m

I guess this means Google is nowhere close, to even discern a hint of an AGI? So when Demis Hassabis says AGI...could arrive in just 3 years he has learned the best from Larry Ellison?
I would guess it means Sam Altman gave him more money.
Its getting pretty lame that we talk about the these guys like they're football players transferring teams.
Speak for yourself, my Fantasy Developer League is crushing it this season
How do I ̷g̷a̷m̷b̷l̷e̷ sports bet on this
I feel like there was a scene in Silicon Valley about a developer fantasy league.
Krazam already has a video covering this exact idea.
In this case, it's not a new thing ... back in 2005 (yes 21 years ago), people talked about the achievements of Noam Shazeer at Google (and Jeff Dean and Sanjay, etc)
It could be the opposite. Those are really useful people, they deserve this more than football players
Idk, football players actually make a bunch of people happy and entertained. 80% of the United States wishes this tech never existed.

What they're working on is just making peoples jobs, skills obsolete and trying to invent machines that will concentrate the worlds wealth into the hands of the people who own those machines.

Very few people interpret football so much that the actual frontier work of the best players matter. Out of 30 friends I know who like football only 1 of them could explain what’s going on in the field technically. For most people, pro players are replaceable.

Popular entertainment and unique progress of human civilization can’t be really compared either

This "guy" is worth on the order of all football players put together.
What's the AI equivalent of NIL?
Have you seen the Krazam fantasy FAANGball sketch? https://www.youtube.com/watch?v=KIZt9YPAPZo

It's funny, but with the AI hires/moves it feels more like satire now.

wait this is kinda brilliant tho
We're a community of geeks. We admire Tesla, Feynman, Linus and such. For me they are far greater than football players
Tell me open ai are in emergency mode without telling me they are in emergency mode
[flagged]
We've banned this account.