Hacker News new | ask | show | jobs
by mohsaied 360 days ago
This is the first step towards fully automated GPU performance optimization. The idea is to automatically generate GPU kernels, then automatically integrate them in vLLM/SGLang/PyTorch.