Hacker News new | ask | show | jobs
Low-Rank KV Attention: 50% Less Memory, Better Models (fin.ai)
2 points by destraynor 64 days ago