Hacker News new | ask | show | jobs
by tarruda 851 days ago
Mixtral 8x7B has 32k context.

Mistral 7b instruct 0.2 is just an instruct fine tune of Mistral 7b and stays with a 8k context.