Hacker News new | ask | show | jobs
by permo-w 1206 days ago
I was considering making a fine-tune of da-vinci-3 based on comment chains scraped from politics subreddits with the heuristic “only use the chain if every comment has a score of 0”[1], but this seems to manage it just nicely on its own

[1] i.e. they’re having an argument and downvoting all of each other’s replies