Hacker News new | ask | show | jobs
Optimal Performance Without Static Graphs by Fusing Tensor Operation Streams (burn.dev)
5 points by nathanielsimard 823 days ago
1 comments

Happy to share what we have been working on lately. The blog post explores Burn's tensor operation stream strategy, optimizing models through an eager API by creating custom kernels with fused operations. Our custom GELU experiment reveal a remarkable improvement of up to 78 times on our WGPU backend.