Y
Hacker News
new
|
ask
|
show
|
jobs
by
rileyphone
705 days ago
Gemini is probably using ring attention. But scaling to that size requires more engineering effort in terms of interlink that goes beyond the purpose of this release from Mistral.