Hacker News new | ask | show | jobs
by v3ss0n 1137 days ago
https://deepai.org/publication/scaling-transformer-to-1m-tok...

Can this be implemented in current opensource models?