Hacker News new | ask | show | jobs
Large Transformer Model Inference Optimization (lilianweng.github.io)
3 points by axit 1251 days ago