|
|
|
|
|
by Imnimo
1120 days ago
|
|
>Does the n-gram model really need all those parameters to mimic GPT-4? Yes, it does. I don't understand what this argument is supposed to demonstrate. Obviously you can compress the 8000-gram model that GPT-4 represents - GPT-4's weights are proof! |
|