Hacker News new | ask | show | jobs
by zozbot234 63 days ago
Try running CPU-only inference to troubleshoot that. GPU layers will likely just ignore mmap.