Hacker News new | ask | show | jobs
by ejro 994 days ago
Interesting. This is probably a universal problem for large model training but not being discussed enough.