Hacker News new | ask | show | jobs
by suggala 5 days ago
It would take less than 1 month if not for the restrictions. One of the reason is they might be using distilling to achieve the parity.
1 comments

oh, do you not pay attention to the hardware they're allowed to buy from nvidia? At this point, it's more just being nerfed than being able to do the magical training stuff.