|
|
|
|
|
by p-e-w
1126 days ago
|
|
> Making that ocean deeper is not a trivial problem that we can just throw more compute or data at. You can't possibly know that, given that we don't actually understand how LLMs work on a high level. > We've pretty much tapped out that depth with GPT4 GPT-4 is three months old and you're confident that its working principle cannot be extended further? Where do you get that confidence from? |
|
If you're familiar with other fields of AI, adding more and more layers to ResNet was the hotness for awhile, but the trick stopped working after awhile.