Hacker News new | ask | show | jobs
by edunteman 256 days ago
An interesting alternative product to offer is injecting prompt cache tokens into requests where they could be helpful; not bypassing generations but at least low hanging fruit for cost savings