Y
Hacker News
new
|
ask
|
show
|
jobs
by
bdbenton5255
359 days ago
Certainly an important discovery for utilizing these models on scaled hardware. This approach could certainly be applied beyond LLMs to other types of neural networks. That would be an interesting space to explore.
1 comments
zhihaojia
359 days ago
Thanks for the feedback! Yes, we believe the approach is general and applicable to other ML workloads.
link