Hacker News new | ask | show | jobs
by btbuildem 500 days ago
Largest R1, as in the 671B? How do you accomplish that feat?
1 comments

Just do it? Llama.cpp doesn't load the entire thing into ram. It mmaps the file and the kernel takes care of the rest.