Hacker News new | ask | show | jobs
by pandaforce 86 days ago
Most vision models are trained on images or conventional video codecs. There's a good reason why H200's have 7 JPEG + 7 nvdec ASICS.