|
|
|
|
|
by geuis
900 days ago
|
|
I have a 2020 16in MacBook Pro. I think it's the last generation of Intel chips. I've been struggling to get some of the LLM models like Mixtral to run on it. I hate the idea of needing to buy another $3k laptop less than 4 years after spending that much on my current machine. But if I want to get serious about developing non-chatgpt services, do I need a new M2 or M3 chip to get this stuff running locally? |
|
It’s a no contest option 2 for me.
Just use llama.cpp with any of the available UIs. It will be usable with 4 but quantization on CPU. You can use any of the “Q4_M” “GGUF” models that TheBloke puts out on Huggingface.
https://github.com/ggerganov/llama.cpp
UI projects in description.
https://huggingface.co/TheBloke
A closed source option is LMStudio.
https://lmstudio.ai/