Hacker News new | ask | show | jobs
by tjchear 112 days ago
If you’re using it with a local model then you need a lot of GPU memory to load up the model. Unified memory is great here since you can basically use almost all the RAM to load the model.