| HN Mirror

After reviewing what they have on their playground, this thing seems to be a scam.

They're running Qwen on a traditional LLM pipeline. The "diffusion effect", as it says there, it's just a decorative, lmao. That in itself shouldn't break the deal as I understand you have to put on a show, but, looking at the latency and timing of their outputs this is not a diffusion model, as they claim. They're also not even close to the 1,000 TPS figure they put out.

I'm surprised nobody on this forum got the slightest clue on that. I guess I should 4x my fee again :).