Hacker News new | ask | show | jobs
by hashmap 1 day ago
you can, yes, and the db becomes model weights that you can just use lookups for retrieval and have attention live in vram.