Hacker News new | ask | show | jobs
by hodgehog11 93 days ago
I'm amazed that such a simple method of detection worked so flawlessly for so many people. This would not work for those who merely used LLMs to help pinpoint strengths and weaknesses in the paper; there are separate techniques to judge that. Instead, it only detects those who quite literally copied and pasted the LLM output as a review.

It's incredible how so many people thought it was fair that their paper should be assessed by human reviewers alone, and yet would not extend the same courtesy to others.

3 comments

I'm not surprised at all. The ML research community isn't a community any more, it's turned into a dog-eat-dog low-trust fierce competition. So much more people, papers, churn, that everyone is just fending for themselves. Any moment that you charitably spend on community service can be felt as a moment you take away from the next project, jeopardizing the next paper, getting scooped, delaying your graduation, your contract, your funding, your visa, your residence permit, your industry plans etc. It's a machine. I don't think people outside the phd system really understand the incentives involved. People are offered very little slack in this system. It's sink or swim, with very little instruction or scientific culture or integrity getting passed on. The PhD students see their supervisors cut corners all the time too, authorship bullshit jockeying even in big name labs etc. People I talked to are quite disillusioned, expect their work to have little impact and get superseded by a new better model in a few months so it's all about who can grind faster, who can twist the benchmarks into showing a minimal improvement etc. And the starry eyed novices get slapped by reality into thinking this way fairly early.

To be clear this is not an excuse but an explanation why I am not surprised.

And the real punchline is that the deluge of papers barely matters, as the academic field is barely moving, and the most interesting innovations are happening on the product side.
I disagree with this. Usually the products are based on published research. This is not easily seen by the enthusiast power user base.

Of course it's only a small fraction of all papers that end up actually being used. Most are mainly about advancing careers and strengthening CVs.

I have been in both academia and industry for years, and I don't think the model you describe is true anymore. It was definitely true 10 years ago, but the situation has flipped. Now, I see really ambitious and impactful research coming out of industry labs. Academia is often lagging behind the state of the art because they lack the resources (data, compute, and skills) to compete.

Academia is also incentivized such that everyone works on the same popular topics to secure grants and citations. This is currently LLMs, where academia needs to compete with multi-billion corporations on a technology that is notoriously expensive. In effect, many researchers work on topics that are pretty non-consequential from the get go (such as N+1th evaluation dataset), but it's the only way for them to stay relevant.

I recently talked with a PI from a well-known university lab, and asked why they were doing a startup, given the ML research problems they were working on.

They said a company was the only way to get access to the compute power they needed for that research.

A startup sounds like probably a good solution, if they get paired with the right product- and business-minded people, and together they find a winning collaboration. (Edit: Or if they get acquired rapidly in the AI boom, and negotiate the right deal to enable their research longer-term.)

A lot of those industry papers are in collab with an academic lab or even often first authored by a PhD student who interns in a big tech lab.
One key reason you’re wrong is that many interesting things aren’t even getting published, they’re on the DL for years and eventually make it to public spheres and products.

Academia is just a daycare at this point, and many labs shouldn’t exists or get funding. The people who move the field aren’t necessarily the ones with the most citations, they’re usually hard at work in places that don’t publish at all.

Are you talking about just frontier LLM agent stuff or all of the scope of ICML? I wonder what your subfield is.
This is 'spam' all over again. Before spam every email was valuable and required some attention. It was a better version of paper mail in that it was faster and cheaper. But then the spam thing happened and suddenly being 'faster and cheaper' was no longer an advantage, it was a massive drawback. But by then there was no way back. I think LLMs will do the same with text in general. By making the production of text faster and cheaper the value of all text will diminish, quite probably to something very close to the energy value of the bits that carry the data.
Generally speaking people have worse impulse control than they believe they do. Once you give a tool that does most of the work for you, very very few people will actually be able to use that tool in truly enriching ways. The majority of people (even the smart ones) will weaken over time and take shortcuts.
I have a very simple solution to this but it is a bit expensive. I run two laptops, one that I talk to an LLM on and another where I do all my work and which is my main machine. The LLM is strictly there in a consulting role, I've done some coding experiments as well (see previous comments) but nothing that stood out to me as a major improvement.

The trick is: I can't cut-and-paste between the two machines. So there is never even a temptation to do so and I can guarantee that my writing or other professional output will never be polluted. Because like you I'm well aware of that poor impulse control factor and I figured the only way to really solve this is to make sure it can not happen.

This is a nice solution, but I think it speaks to just how enticing the problem is. This is the sort of tactic someone with a gambling addiction would employ. I don't say that to be rude to you: I've had to do similar things with regard to addicting infinite-scroll internet sites, and I definitely give in more than I'd like.
I am totally aware of my weakness in light of potential addiction, that's why I don't give it any chance, so you are spot on and it is not taken as rude at all.
In a similar vein, I want a text editor where pasting from an external source isn't allowed. If you try, it should instantly remove the pasted text. Copy-pasting from inside the document would still be allowed (it could detect this by keeping track of every string in the document that has been selected by the cursor and allowing pastes that match one of those strings).

It wouldn't work in every use case (what if you need to include a verbatim quote and don't want to make typos by manually typing it?), but it'd be useful when everything in the document should be your words and you want to remove the temptation to use LLMs.

The clipboard is one of the most dangerous components of any operating system when it comes to running secure environments.
This somewhat of the equivalent of "quitting cold turkey", in the sense that you remove the temptation from your reach.

The problem is that it's just much easier to un-quit and run the LLM in the same laptop you work on.

It's just so very tempting.

I think that's the only way to deal with such temptations. Kidding yourself that you are strong enough to do it 'just once' or that you can handle the temptation is foolish and will only lead to predictable outcomes. I have a similar policy to smoking, drugs, alcohol and so on, I just don't want the temptation. It helps to have seen lots of people who thought they were smart enough eventually go under (but the price is pretty high).

Oh, and LLMs are of course geared to pull you in further, they are on a continuous upsell salespitch. Drug pushers could learn a thing or two from them.

You could ssh in to the "dirty" machine ... just sayin'
Yes, I could. But I've purposefully made linking the two quite hard.
People do similar things installing an app locking their phones so they don't spend all day on them, then "oh just this once", ...
That's an excellent point. It seems likely they thought they could operate as a proper reviewer, but when the deadline came, they took the shortcut they knew they were not supposed to take.

It really does sound like an addiction when you put it this way.

I think you're framing this behaviour too generously. Laziness is one thing, lack of integrity is another, and this seems to be a straightforward case of cheating and lying.
I think it's just numbers. When one person errs it's a fault of character. When most people err, we call it a systematic fault. Why are most people overweight for the first time in history? Do most people lack the good character to restrict their diet? You could argue yes, however appeals to character won't actually solve the problem.