|
|
|
|
|
by parineum
82 days ago
|
|
> but there are a lot of new ideas in terms of architecture that may warrant massive training runs I don't think the argument is that isn't true, it's that the gains from those massive training runs is diminishing. Eventually, it won't be worth it to do the run for each new idea, you'll have to bundle a bunch together to get any noticeable change. |
|