Hacker News new | ask | show | jobs
by kken 806 days ago
Try Ollama or LM Studio. Mixtral and its finetunes work perfectly for me on my RTX3090 with offloading.