Hacker News new | ask | show | jobs
DiffusionBlocks – Block-Wise NN Training via Diffusion Interpretation (github.com)
2 points by aanet 27 days ago
1 comments

"DiffusionBlocks, a principled framework that partitions transformers into independently trainable blocks, reducing memory requirements proportionally while maintaining competitive performance across diverse architectures and tasks."