|
|
|
|
|
by ACCount37
3 hours ago
|
|
Not only you could: you would also want to. The likes of Mythos show that the scaling laws are real, and you can x5/x2 the total/active params and get meaningful gains. If "inference per param" gets cheaper? Up the params and get more intelligence for the same price. |
|