|
|
|
|
|
by brucethemoose2
876 days ago
|
|
I've had less luck with Mixtral, but I run Yi 34B finetunes for general personal use, including quick queries for work. Its kinda like GPT 3.5, with no internet access and slightly less reliable responses, but unrestrained, much faster and with a huge (up to 75K on my Nvidia 3090) usable context. Mixtral is extremely fast though, at least at a batch size of 1. |
|