Hacker News new | ask | show | jobs
by simianwords 50 days ago
not really. there are easy heuristics to filter out bots with good confidence. FWIW i don't see any bots posting anything in my feed
2 comments

Yes your individual feed isn't really relevant if we talk about the masses, Reddit accounts are for sale quite cheap, HN as well, X too and so-on, it's literally just a matter of means/methodology. If I want today to do 1000 random posts talking about a certain thing, I could.
my individual feed does matter because it shows that it is possible to curate something without bots which is obviously what XAI would do
congratulations, you have solved anti-scam. go make your billion since its easy.
its easy to solve at the offline level where you have time to filter out. in fact this is already done in pre-training by OpenAI and other companies.

you think its hard?

Yes I think it's hard.

OpenAI has already been proven to be easily gamed through very unsophisticated poisoning (fake information in a web page + an edit to a wiki page pointing at it, fake information in a reddit post), so I'm not sure we shoudl hold up their efforts at data cleaning as a gold standard.

https://www.sei.cmu.edu/blog/data-poisoning-in-ai-models-the...