Hacker News new | ask | show | jobs
by haldujai 1158 days ago
To my knowledge no SOTA model has been trained on a significant proportion of synthetic data, has this changed?

The best examples I know of are instruction tuning sets but that is a minute amount of data compared to the unsupervised training data.