Hacker News new | ask | show | jobs
by rileyphone 705 days ago
Gemini is probably using ring attention. But scaling to that size requires more engineering effort in terms of interlink that goes beyond the purpose of this release from Mistral.