| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by ted537 440 days ago

I don't think it would be too hard to scrape useful data out of my LLM convos.

If human response is "That's BS", "fuck off", or something similar, mark as bad assistant message.

If human response is "huh" or "cool", mark as good assistant message.

If on ChatGPT, watch how much scrolling user does. If there's a lot, its somewhat likely that the LLM outputted something useful.

That strategy would have holes of course but as long as its better than guessing something like that would be a useful heuristic.

2 comments

londons_explore 440 days ago

This.

Even very weak human signals can be immensely valuable over large enough datasets.

link

DeepYogurt 440 days ago

> If human response is "That's BS", "fuck off", or something similar, mark as bad assistant message.

Marking is not a trivial task though. Use some AI system to mark it and you get a 99.something% filter maybe but whatever that remainder is leaks through. Over time your filter may get worse as a result.

link

ehecatl42 440 days ago

I'm in the process of messing around with a new distro where things are not quite what I am used to, and the usual suspects have been pretty helpful there... except for when they just make shit up

Grok is the only one that swore back at me. I kinda liked that. The others are way too polite, "Artificial Intelligence? Artificial Canadians, more like", my uni-going kid joked.

link