Hacker News new | ask | show | jobs
by wtallis 37 days ago
It should only need to load one copy of the weights, but each tab/site will need a separate context and KV cache.