|
|
|
|
|
by thenaturalist
778 days ago
|
|
You're correctly identifying an issue that by now I think everyone is facing globally: Realizing the bottleneck to performance or improvements of LLMs isn't necessarily quantity, but inevitably quality. Which is a much harder problem to solve outside few highly standardized niches/ industries. I think synthetic data generation as a mean to guide LLMs over a larger than optimal search space is going to be quite interesting. |
|
However, if your models distribution is wrong, you’re basically going to have an even more skewed distribution in models trained using the synthetic data.
To me, it seems like the architecture is the next place for improvements. If you can’t synthesise the entirety of human knowledge using transformers, there’s an issue there.
The smell that points me in that direction is the fact that up until recently, you could quantise models heavily with little drop in performance, but recent Llama3 research shows that’s not the case anymore