|
|
|
|
|
by DoctorOetker
134 days ago
|
|
it is instructive to calculate the size and requirements for a system that can pretrain a 405B parameter transformer in ~ 17 days. a different question is the expected payback time, unless someone can demonstrate a reasonable calculation that shows a sufficiently short payback period, if no one here can we still can't exclude big tech seeing something we don't have access to (the launch costs charged to third parties may be different than the launch costs charged for themselves for example). suppose the payback time is in fact sufficiently short or commercial life sufficiently long to make sense, then the scale didn't really matter, it just means sending up the system described above repeatedly. |
|