Hacker News new | ask | show | jobs
by MaxPock 20 days ago
How are these "capacity constrained" Chinese companies running inference without Hoppers and Blackwells ?
4 comments

Huawei Ascend AI Accellerators. DeepSeek V4 model architecture was optimized for Chinese hardware.
They can (not entirely sure how 'grey' market this is) either have subsidiaries outside of china (eg: singapore) that provide the inference and/or just rent it off the public gpu clouds.
Making their own NPUs for inference probably, you don't have to buy NVidia for inference. Google doesn't.