Hacker News new | ask | show | jobs
by kingsleyopara 703 days ago
The biggest win here has to be the context length increase to 128k from 8k tokens. Till now my understanding is there hasn't been any open models anywhere close to that.
2 comments

It is notable, but it's not alone. Mistral NeMo just released last week with a 128k context window:

https://news.ycombinator.com/item?id=40996058

Thanks! Not sure how I missed that :)
It's easy to miss things. Trying to keep up with the latest in AI news is like drinking from the firehose -- it's never-ending.
Phi 3