Hacker News new | ask | show | jobs
by choilive 657 days ago
Because this doesn't prompt cache? Prompt caching is dumping out the calculated values from vRAM onto disk and reloading them back into memory as necessary.