Hacker News new | ask | show | jobs
by fy20 1045 days ago
You can probably run it locally with llama.cpp using CPU only, but it will be slow. I have a couple year old laptop with a RTX 3060 and it runs pretty well split across the CPU and GPU.