Hacker News new | ask | show | jobs
by wkat4242 456 days ago
Hmm yeah but it needs 30 seconds to give an answer. And that's with the LLM running on a GPU with HBM2 memory.