Hacker News new | ask | show | jobs
by lostmsu 9 days ago
They say they are using https://github.com/tile-ai/TileRT

- persistent CUDA kernel

- tiled processing with overlapping read/writes

- model designed with specific constraints in mind

1 comments

Excuse me, do aliens live among us? 17 commits, 99% Python and multiplying the speed of GLM, Deepseek V4, MiMO 2.5?
tilert is closed source, the repo is just a python wrapper that invokes the binary.