Hacker News new | ask | show | jobs
by teaearlgraycold 241 days ago
Much of this is probably down to optimized transformer kernels.