Hacker News new | ask | show | jobs
by conshama 16 days ago
Somehow I never got the hype around Groq. They served models with fast inference speed - that sounded great in theory - and as a user I was looking forward to use them. But, when I did try, I discovered that they are quantizing the models underneath. And they dont even disclose it. So I stopped using them.

The whole thing never made any sense to me - but I guess AI hype is a thing.

1 comments

I believe despite quantisation they were still extremely fast, which is still incredibly useful if you don't need high precision/accuracy (which is good enough for many use cases)