Hacker News new | ask | show | jobs
Google's TurboQuant offers LLMs up to 6x compression (arstechnica.com)
4 points by cwt137 80 days ago