Hacker News new | ask | show | jobs
by vczf 1041 days ago
70B llama.cpp works now. You need the temporary `-gqa 8` flag for 70B.

You can even extend context with RoPE!