Hacker News new | ask | show | jobs
by svachalek 235 days ago
Per the very short article, the solution was to pack multiple models per GPU.
1 comments

yes but that could mean a layer per model