Hacker News new | ask | show | jobs
by hank2000 37 days ago
Yall want to see instant? Check out chatjimmy.ai blow your mind. I’m not affiliated.

But the things it unlocks in a product I’m building are mind blowing. Millisecond inference even on much older models will change the whole game. Enough to run inference on every. Single. API call. Without notable disruption. This sh*t is wild.

3 comments

Do you have more information on this? I thought groq was fast but this is insane.

EDIT: it’s this company https://taalas.com/products/

Yeah, if I could get double digit ms latency out of gpt-4.1 that would be game changing.
That for sure is instant