|
|
|
|
|
by version_five
1127 days ago
|
|
If we're not already, I'd expect to see lots of previously public content get locked down so it can be monetized. It will be interesting to see what the long run equilibrium looks like, as in will large players play ball? I'd guess that the future high value application will still have lots of fine tuning on more specialized datasets, which will continue to be a bottleneck and be considered valuable, while the large scale training data will be less important (with respect to transformer llms, who knows if there is some newer breakthrough). Same way that e.g. everybody uses CNNs pertained on imagenet, and fine tunes on application specific stuff, and there was not much of a commercial push (imo) to rush out and build a better general purpose large scale image set like imagenet. |
|