Hacker News new | ask | show | jobs
by bdbenton5255 359 days ago
Certainly an important discovery for utilizing these models on scaled hardware. This approach could certainly be applied beyond LLMs to other types of neural networks. That would be an interesting space to explore.
1 comments

Thanks for the feedback! Yes, we believe the approach is general and applicable to other ML workloads.