Hacker News new | ask | show | jobs
by diggan 623 days ago
How are you so sure these users are actually bots? Just because someone disagrees with you about Russia or China doesn't mean that's evidence of a bot, no matter how stupid their opinion is.
5 comments

I don't know about anyone else, but to me a lot of bot traffic is very obvious. I don't have the expertise to be able to describe the feeling that low quality bot text gives me, but it sticks out like a sore thumb. It's too verbose, not specific enough to the discussion, and so on.

I'm sure there are real pros who sneak automated propaganda in front of my eyes with my notice, but then again I probably just think they are human trolls.

> but it sticks out like a sore thumb

Could you give some examples of HN comments that "sticks out like a sore thumb"?

> It's too verbose, not specific enough to the discussion, and so on.

That to me just sounds like the average person who feels deeply about something, but isn't used to productive arguments/debates. I come across this frequently on HN, Twitter and everywhere else, including real life where I know for a fact the person I'm speaking to is not a robot (I'm 99% sure at least).

sorry, I didn't mean to give the impression that I was talking about HN comments specifically. I was talking about spotting bot content out on the open Internet.

as for verbosity, I don't mean simply using a lot of text, but rather using a lot of superfluous words sentences.

people tend not to write in comments the way they would in an article.

Hackernews isn’t the place to bring that up regardless of your opinion. So out of context political posts should be viewed with at least some scrutiny.
That's true, but maybe there should be a meta section of the site where these topics can be openly discussed?

While I appreciate dang's perspective[1], and agree that most of these are baseless accusations, I also think that it's inevitable that a site with seemingly zero bot-mitigation techniques, where accounts and comments can be easily automated, doesn't have some or, I would wager _a lot_, of bot activity.

I would definitely appreciate some transparency here. E.g. are there any automated or manual bot detection and prevention techniques in place? If so, can these accounts and their comments be flagged as such?

[1]: https://news.ycombinator.com/item?id=41710142

We're not going to have a meta section for reasons I've explained in the past:

https://news.ycombinator.com/item?id=22649383 (March 2020)

https://news.ycombinator.com/item?id=24902628 (Oct 2020)

I've responded to your other point here: https://news.ycombinator.com/item?id=41713361

There are a few horsemen of the online community apocalypse,

1) Politics 2) Religion 3) Meta

Fundamentally - Productive discussion is problem solving. A high signal to noise ratio community is almost always boring, see r/Badeconomics for example.

Politics, religion are low barrier to entry topics, and always result in flame wars, that then proceed to drag all other behavior down.

Meta is similar: To have a high signal community, with a large user base, you filter out thousands of accounts and comments, regularly. Meta spaces inevitably become the gathering point for these accounts and users, and their sheer volume ends up making public refutations and evidence sharing impossible.

As a result, meta becomes impossible to engage with at the level it was envisioned.

In my experience, all meta areas become staging grounds to target or harass moderation. HN is unique in the level of communication from Dang.

This I agree with, off-topic is off-topic and should be removed/flagged. But I'm guessing we're not talking about simple rule/guidelines-breaking here.
How are you so sure these users are actually bots?

I stated nothing about bots. Re-read what I wrote.

Bots, trolls, foreign agents, a dear child has many names. Point is the same, name calling without evidence does nothing to solve the problem.
Ignoring there are problems doesn't solve anything either.
[flagged]
> If the account is new and promoting Ruzzian narrative by denying the reality I can be 99% sure it is a paid person copy pasting arguments from a KGB manual, 1% is a home sovieticus with some free time.

I'm not as certain as you about that. Last time the US had a presidential election, it seems like almost half the country is either absolutely bananas and out of their mind, or half the country are robots.

But reality turns out to be less exciting in reality. People are just dumb, and spew whatever propaganda they happen to come across "at the right time". Same is true for Russians as it is for Americans.

I think it's mostly a timing thing. It's one thing for someone to say something dumb, but it's another for someone to say it immediately on a new account. That, to me, screams bot behavior. Also if they have a laser focus. Like if I open a twitter account and every single tweet is some closely related propaganda point.
Anyone who’s spent any amount of time in this space can spot them pretty quickly/easily. They tend to stick to certain scripts and themes and almost never deviate.
In my experience, that's not true. Rather, people are much too quick to jump to the conclusion that so-and-so is a bot (or a troll, a shill, a foreign agent, etc.), when the other's views are outside the range of what feels normal to them.

I've written a lot of about this dynamic because it's so fundamental. Here are some of the longer posts (mini essays really):

https://news.ycombinator.com/item?id=39158911 (Jan 2024)

https://news.ycombinator.com/item?id=35932851 (May 2023)

https://news.ycombinator.com/item?id=27398725 (June 2021)

https://news.ycombinator.com/item?id=23308098 (May 2020)

Since HN has many users with different backgrounds from all over the world, it has a lot of user pairs (A, B) where A's views don't seem normal to B and vice versa. This is why we have the following rule, which has held up well over the years:

"Please don't post insinuations about astroturfing, shilling, bots, brigading, foreign agents and the like. It degrades discussion and is usually mistaken. If you're worried about abuse, email hn@ycombinator.com and we'll look at the data." - https://news.ycombinator.com/newsguidelines.html

In my research and experience, it is. I’m making no comment about bots/shills on this site, either, I’m responding to the plausibility of the original comment.
> I’m making no comment about bots/shills on this site, either, I’m responding to the plausibility of the original comment.

The original comment:

> I wonder the same about HN. Has anyone done this kind of analysis? Me good LLM

Slightly disingenuous to argue from the standpoint of "I'm talking about the whole internet" when this thread is specifically about HN. But whatever floats your boat.

Hey, I would appreciate if you could address some of my questions here[1].

I do think it's unrealistic to believe that there is absolutely zero bot activity, so at least some of those accusations might be true.

[1]: https://news.ycombinator.com/item?id=41711060

The claim is not "zero bot activity" - how would one even begin to support that?

Rather, the claim is that accusations about other users being bots/shills/etc. overwhelmingly turn out, when investigated, to have zero evidence in favor of them. And I do mean overwhelmingly. That is perhaps the single most consistent phenomenon we've observed on HN, and it has strong implications.

If you want further explanation of how we approach these issues, the links in my GP comment (https://news.ycombinator.com/item?id=41710142) go into it in depth. If you read those and still have a question that isn't answered there, I can take a crack at it. Since you ask (in your other comment) whether HN has any protections against this kind of thing at all, I think you should look at those past explanations—for example the first paragraph of https://news.ycombinator.com/item?id=27398725.

Alright, thanks. I read your explanations and they do answer some of my questions.

I'm still surprised that the percentage of this activity here is so low, below 0.1%, as you say. Given that the modern internet is flooded by bots—over 60% in the case of ProductHunt as estimated by the article, and a third of global internet traffic[1]—how do you a) know that you're detecting all of them accurately (given that it seems like a manual process that takes a lot of effort), and b) explain that it's so low here compared to most other places?

[1]: https://investors.fastly.com/news/news-details/2024/New-Fast...

From what I understand - users accuse others of being shills and bots, and are a largely wrong.

Dang and team use other tools to remove the actual bots that they can find evidence for.

So yes, there are bots, but human reports, tend to be more about disagreements, than actual bot identification.

intended's reply is correct.

Most of the bot activity we know about on HN has to do with voting rings and things like that, people trying to promote their commercial content. To the extent that they post things, it's mostly low-quality stuff that either gets killed by software, flagged by users, or eventually reported to us.

When it comes to political, ideological, nationalistic arguments and the like, that's where we see little (if any) evidence. Those are the areas where users are most likely to accuse each other of not being human, or posting in bad faith, etc., so that's what I've written about in the posts that I linked to.

There's still always the possibility that some bad actors are running campaigns too sophisitcated for us to detect and crack down on. I call this the Sufficiently Smart Manipulator problem and you can find past takes on it here: https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que....

I can't say whether or not this exists (that follows by definition—"sufficiently" means smart enough to evade detection). All I can tell you is that in specific cases people ask us to look into, there are usually obvious reasons not to believe this interpretation. For example would a sufficiently smart manipulator be smart enough to have been posting about Julia macros back in 2017, or the equivalent? You can always make a case for "yes" but those cases end up having to stretch pretty thin.

How is that different from humans? Humans have themes/areas they care more about, and are more likely to discuss with others. It's not hard to imagine there are Russians/Chinese people caring deeply about their country, just like there are Americans who care deeply about US.
A human aggressively taking a particular line and a bot doing so may be equivalent; do we need to differentiate there?
If the comment is off-topic/breaking the guidelines/rules, it should be removed, full stop.

The difference is that the bots comment should be removed regardless if the particular comment is breaking the rules or not, as HN specifically is a forum for humans. The humans comment, granted it doesn't break the rules, shouldn't, no matter how shitty their opinion/view is.

If posts make HN a less interesting place to converse I don't see why humans should get a pass & I don't see anything in the guidelines to support that view either.
C’mon. When you have an account that is less than a year old and has 542 posts, 541 of which are repeating very specific kremlin narratives verbatim, it isn’t difficult to make a guess. Is your contention that they are actually difficult to spot, or that they don’t exist at all? because both of those views are hilariously false.
I feel like you're speaking about specific accounts here, since it's so obvious and exact. Care to share the HN accounts you're thinking about here?

My contention is that people jump to "It's just a bot" when they parrot obvious government propaganda they disagree with, when the average person is as likely to parrot obvious propaganda without involving computers at all.

People are just generally stupid by themselves, and reducing it to "Robots be robotting" doesn't feel very helpful when there is an actual problem to address.

No, I'm not. And I don't/won't post any specific accounts. I'm speaking more generally - and no one is jumping to anything here, you're projecting an argument that absolutely no one is making. The original claim was that russian/chinese bots were on this platform and left. I've only been here about 1.5 years, so I don't know the validity of that claim, but I have a fair amount of experience and research in the last ten years or so on the topic of foreign misinformation campaigns on the web, so it sounds like a very valid claim, given how proliferate these campaigns were across the entire web.

It isn't an entirely new concept or unknown, and that isn't what is happening here. You're making a lot of weird assumptions, especially given the fact that the US government wrote several hundred pages about this exact topic years ago.

> and no one is jumping to anything here, you're projecting an argument that absolutely no one is making

You literally claimed "when you have accounts with these stats, and they say these specific things, it isn't difficult to guess..." which ends with "that they're bots" I'm guessing. Read around in this very submission for more examples of people doing "the jump".

I'm not saying there isn't any "foreign misinformation campaigns on the web", so not sure who is projecting here.

Ten years ago those accounts existed, too. Back then we called them "people."
Not at all - ten years ago russian misinformation campaigns on twitter and meta platforms were alive and well. There was an entire several hundred page report about it, even.