Y
Hacker News
new
|
ask
|
show
|
jobs
by
sosodev
73 days ago
I don't know how well it performs, but you can extend Qwen3.5 to 1 million token context using YaRN. Also, Nemotron 3 Super was recently released and scales up to 1 million token context natively.