Hacker News new | ask | show | jobs
by george_123 1042 days ago
1) when the blog was released, TGI didn’t support paged attention, 2) many people don’t even know about TGI to reduce inference costs.