Hacker News new | ask | show | jobs
by ComputerGuru 853 days ago
Is that the sliding context window size? Because I didn't have good results with sliding context windows in the regular Mistral models.
1 comments

Yeah, I think they fine-tune without a specific window size target to achieve and then keep expanding context until it starts falling over.