Y
Hacker News
new
|
ask
|
show
|
jobs
by
ComputerGuru
853 days ago
Is that the sliding context window size? Because I didn't have good results with sliding context windows in the regular Mistral models.
1 comments
ajcp
853 days ago
Yeah, I think they fine-tune without a specific window size target to achieve and then keep expanding context until it starts falling over.
link