Some context for people who haven’t followed the full loop: Shazeer was a long-time Google researcher, joined Google in 2000, and was one of the co-authors of “Attention Is All You Need.”
He left Google in 2021 to co-found Character.AI. In 2024, Google brought him and some Character.AI researchers back via a licensing/talent deal with Character.AI (reportedly around $2.7B). He was then made a Gemini co-lead.
Very exciting indeed! The teenager -> suicide pipeline is one of the final frontiers for SV and it's nice to see Google enter in competition with Meta.
Well in terms of employers loyalty, they were hired to solve AI problems, and in this case they siphoned all the resources at Google, built all their tests on Google's money, time, resources, brains, crawled content, and then once it started to work left Google, leaving Google empty-handed.
Nice guys.
If you got your girlfriend/boyfriend by sneakily convincing them to cheat on their partner, don't be surprised if you are the next one to be cheated on.
I think it becomes somewhat more defensible when considering that the alternative was operatiny Google's policy (before the advent of competition) of "these models would bring unknown dangers in the hands of the public, we shouldn't release them until we better understand the implications" (or perhaps more selfishly "these effectively nullify all our detectors of generated text, if released they would instantly lose us the war on SEO").
(recall that OpenAI thought GPT-2 was too powerful to release for approximately tantamount reasons)
I had had a boss (from a YC-funded company, no less) that behaved in this way. Talked down on me with the g-slur, used language barriers to alienate his peers, and demanded religious sensitivity whenever we met after work. His entire life was defined by this religiously insecure identity, and several meetings were derailed by him thinking he was slighted by the rest of the team. That led to team members avoiding him, which reaffirmed his perception of being discriminated against. In reality we were all just baffled by his inability to adopt a secular work ethic.
As a queer person I could partially empathize with his behavior. Some of the smartest queer people I know are also frustrated, downtrodden and crass in protest of their mistreatment. But they're also generally grounded people that buckle down at work and get things done. They don't accuse people of being bigoted, lash out at coworkers or use slurs in the office. Perhaps it helps that queer identity isn't eschatological in nature, but that's only my best guess.
Noam Shazeer was one of the lead authors of the seminal paper "Attention Is All You Need", which introduced the transformer architecture. (From Wikipedia)
Source for this? The notion of attention dates to a content-addressable lookup during sequence alignment (as well as, concurrently, memory lookups in neural Turing machines). Attention had been used in other models, like GRUs and LSTMs with attention. The Vaswani et. al. paper did not introduce attention, just removed everything _but_ attention (and FFW) from the network. Are you claiming the "critical idea" of removing the GRU and LSTM parts and just keeping attention was "truly" Noam's?
signull is more of an anonymous sh*tposter than a known industry insider, but I think this does capture the sama contribution to OpenAI very well. At least from an outsider who follows this stuff based on vibes.
That twitter story isn't anything unique to OpenAI or Google, it's just classic "big public corp vs private startup" culture. Once you have to worry about the SEC, shareholders, antitrust, regulations, lawsuits, etc. it's very, very difficult to avoid turning into "big corp" culture.
Sama, and any other founder, will always have a difficult fight against bureaucracy, and once you let a little bit in, the bureaucracy's sole purpose becomes to grow itself.
I disagree. It's not about the culling, it has never been, and actually, it makes things worse. You spend countless hours and tons of money recruiting talented people not to lay them off because you don't want a bureaucratic org.
If the issue is inefficiency, tons of meetings, too much team alignment etc, then that's the issue that you need to tackle, and these issues can already appear in a 50-100 employee company. Sure, that's an easy problem to solve with a smaller size but unless you hired people for no reason, these people have a very specific set of problems to tackle and are often, in these companies, the best in class to tackle them, culling half of the company isn't going to make things better.
Google bloat gave us transformers. Apple bloat gave us a usable touchscreen only, pocket computer (famously an entire org within Apple had developed an iPod-based approach that was competing with what was released)
The leaps forward need bloat. A startup can execute on specific vector direction way better.
Now back to your point, what did X deliver with its lean ops? It seems that it needed 2 bailouts (one from xAI, and one from space X)
Google is facing a legitimate innovators dilemma here. It makes sense to have all this process when youre protecting a $4.5 trillion golden goose. The tragedy here is that one predictable outcome of this situation is google deciding to considerably cut research funding when they figure out it just serves to bootstrap future competitors.
This is when it makes sense to split your business up into multiple smaller businesses. The government should be doing this via anti-trust but they have dropped the ball there so, at this point, the corps really need to just do it to themselves to better compete.
Don't think it matters in the long run to be honest. The models have no moat, they are becoming a commodity.
Besides that, Google is in a pretty good position, they're not bleeding money on AI like Anthropic/OpenAI, and they own product verticals where they can integrate it. Plus they have a mature ads-model which is what might actually drive a bit of revenue for LLMs.
I think the 'models have no moat' thing is overblown. Only like 3-4 companies in the entire world have cutting edge models, that means there is some kind of moat...
yeah, sure, look at anthropic revenue, what is it if not the moat? you can argue for how long but for them good model = the fastest growing company ever.
Grabbing market-share if you have investors that are ready to burn cash infinetely. Find a hot niche, buy a banana 1 USD, sell it for 0.10 USD.
Example: Cursor, they became popular because they were selling ChatGPT unlimited for 20 USD / month.
When they launched, just a reskinned VS Code, "fastest growing AI company"
No coincidence they were bought by SpaceX, who wants to consolidate revenue even if non-sense as long it helps other investors to exit. It shows rapid growth.
Profit is the real moat.
One example: Nvidia. Proprietary tooling, proprietary IP, proprietary hardware, no alternative, expensive.
And Google has all of those. Custom silicon, more data than anyone else and probably the most comprehensive data collection system, and phones in the hands of 73% of the global smartphone using population to push gemini into to get high volume usage feedback and even more telemetry and data.
I don't think you're honestly accounting for the engineering behind the progress models are making. If it was just a matter of compute on hand and iterating, Meta would be neck and neck with Ant, OAI, and Google, but clearly you've gotta have more.
Noam has a deep expertise in these systems at every level, both algorithmically and at production scale, and knows how to leverage things at different levels.
It's not like Google won't have anyone else that can do what he does, but at the same time, it's an implicit criticism of Google's culture, operations, development, and overall AI program. Shazeer is well past the point where the paycheck is the deciding factor, although I'm certain he is very well paid. Having the freedom to innovate and build free from the corporate fuckery of Google and Facebook is probably more valuable than the pay raise he got with the move, and OAI has the advantage of not having to cope with decades of corporate cruft and inertia. They'll get there - all corporations do - but they're relatively young enough to still be nimble.
I honestly don't think that matters for multiple reasons:
1. There are already multiple "sota" models on the market that compete with only marginal gains between them (OpenAI, Anthropic, Google/Gemini) and some that are catching up (DeepSeek, Qwen,..).
2. The fact that something is a hard engineering problem does not mean it's generating revenue. So while what you said is true, deep expertise is required to push the industry forward, I don't think that is going to matter for the bottom line of these companies. Hence why I think the models don't give a company any 'moat' in a capitalist economy.
Surprised to not see more comments on this, especially given the popularity of the Anthropic/Karpathy article. What a win for OpenAI - and what a loss for Google, just 2 years after paying $2.7bn to bring Noam back into the fold. Does not bode well for Gemini long-term... Or could be a signal for how deeply they are leaning into world models.
I guess this means Google is nowhere close, to even discern
a hint of an AGI? So when Demis Hassabis says AGI...could arrive in just 3 years he has learned the best from Larry Ellison?
In this case, it's not a new thing ... back in 2005 (yes 21 years ago), people talked about the achievements of Noam Shazeer at Google (and Jeff Dean and Sanjay, etc)
Idk, football players actually make a bunch of people happy and entertained. 80% of the United States wishes this tech never existed.
What they're working on is just making peoples jobs, skills obsolete and trying to invent machines that will concentrate the worlds wealth into the hands of the people who own those machines.
Very few people interpret football so much that the actual frontier work of the best players matter. Out of 30 friends I know who like football only 1 of them could explain what’s going on in the field technically. For most people, pro players are replaceable.
Popular entertainment and unique progress of human civilization can’t be really compared either
He left Google in 2021 to co-found Character.AI. In 2024, Google brought him and some Character.AI researchers back via a licensing/talent deal with Character.AI (reportedly around $2.7B). He was then made a Gemini co-lead.
Now he’s leaving Google again for OpenAI.
Exciting times!