Hacker News new | ask | show | jobs
by gertop 617 days ago
Llamafile is great if you don't want to run any meaningful models because it's limited to 4GB.
1 comments

That's a Windows limitation, though.
Even on Windows, you just run the binary separate from the model file. I actually run a single binary separate from the model files because I run it with multiple of them, so I kind of forgot that that was even the default way it kind of expects you to hold it.