Hacker News new | ask | show | jobs
by GlitchInstitute 279 days ago
Gemini is very fast because it runs on TPUsV7 mostly
1 comments

It is definitely because it's a smaller model. TPUv7 has ~10% lower flops at FP8 and 33% lower memory bandwidth than Nvidia Blackwell cards. Add CUDA to the comparison and they'll probably be even worse at real world utilization. Grok is already running on Blackwell cards and although there's little info on GPT5, I doubt they are behind.