I think so many people (including me) effectively ignored Mistral 0.1's sliding window that few realized 0.2 instruct is native 32K.
Mistral 7b instruct 0.2 is just an instruct fine tune of Mistral 7b and stays with a 8k context.
I think so many people (including me) effectively ignored Mistral 0.1's sliding window that few realized 0.2 instruct is native 32K.