Hacker News new | ask | show | jobs
by hedgehog 940 days ago
Another thing to note here is this looks to be around seven total days of training on at most 4 A100s. Not all really cutting edge work requires a data center sized cluster.