|
|
|
|
|
by theGnuMe
1281 days ago
|
|
It's certainly a potential path. It's folding at home model + crypto. May offset electricity costs if you can sell the tokens... I am interested in understanding why more training flops vs say more parameters. Surely bigger models perform better but perhaps there is a limit? I don't quite understand how distributed subtrees will work. |
|
Interestingly, I don't think it's clear on how parameters in the network and compute would clearly find a domain within a combined problem space, where the mapping from question to answer will give sensible results. It seems like we need more tools to extend ML.