|
|
|
|
|
by gremlinsinc
849 days ago
|
|
could a training model be fed the raw data or source and weights of an llm and create better functioning llms by spotting patterns and things between models? like if you could feed it all the open source models and it could create sub models off of those and maybe even a 2nd Gen 'self' instance to better train on the second set such that maybe it could find ways to get the same results with 5b model as 75b. |
|
The 75B param model simply has more complexity to work with than the 5B model.
In the same sense that: `y = mx + b` is just not as expressive as `y = ax^2 + bx + c`.