Y
Hacker News
new
|
ask
|
show
|
jobs
by
fy20
1045 days ago
You can probably run it locally with llama.cpp using CPU only, but it will be slow. I have a couple year old laptop with a RTX 3060 and it runs pretty well split across the CPU and GPU.