Hacker News new | ask | show | jobs
by seydor 765 days ago
and then they use the output of chatGPT to train their open models
1 comments

which is a pity, because the models and finetunes tainted with even a minuscule amount of GPT slop are affected very badly. you can easily tell the difference between llama finetunes with or without synthetic datasets.