Hacker News new | ask | show | jobs
by gvd 1043 days ago
So what? TGI also supports this.
1 comments

1) when the blog was released, TGI didn’t support paged attention, 2) many people don’t even know about TGI to reduce inference costs.