|
|
|
|
|
by onion2k
753 days ago
|
|
I run Mistral 7b and Llama 3 locally using jani.ai on a 32GB Dell laptop and get about 6 tokens per second with a context window of 8k. It's definitely usable if you're patient. I'm glad I also have a Hugging Face account though. |
|
https://jan.ai/
https://github.com/janhq/jan