| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ATechGuy 281 days ago
	You rely on LLMs riddled with hallucinations for malware detection?

4 comments

jmb99 281 days ago

I'm not exactly pro-AI, but even I can see that their system clearly works well in this case. If you tune the model to favour false positives, with a human review step (that's quick), I can image your response time being cut from days to hours (and your customers getting their updates that much faster).

link

ATechGuy 280 days ago

You are assuming that they build their own models.

link

Culonavirus 281 days ago

He literally said "Flagged packages are escalated to a human review team." in the second sentence. Wtf is the problem here?

link

ATechGuy 280 days ago

What about packages that are not "flagged"? There could be hallucinations when deciding to (or not) "flag packages".

link

orbital-decay 280 days ago

>What about packages that are not "flagged"?

You can't catch everything with normal static analysis either. LLM just produces some additional signal in this case, false negatives can be tolerated.

link

ATechGuy 280 days ago

static analysis DOES NOT hallucinate.

link

Twirrim 280 days ago

So what? They're not replacing standard tooling like static analysis with it. As they mention, it's being used as additional signal alongside static analysis.

There are cases an LLM may be able to catch that their static analysis can't currently catch. Should they just completely ignore those scenarios, thereby doing the worst thing by their customers, just to stay purist?

What is the worst case scenario that you're envisioning from an LLM hallucinating in this use case? To me the worst case is that it might incorrectly flag a package as malicious, which given they do a human review anyway isn't the end of the world. On the flip side, you've got LLM catching cases not yet recognised by static analysis, that can then be accounted for in the future.

If they were just using an LLM, I might share similar concerns, but they're not.

link

tripzilch 279 days ago

well, you've never had a non-spam email end up in your spam folder? or the other way around?

when static analysis does it, it's called a "misclassification"

link

wiseowise 281 days ago

> We use a mix of static analysis and AI. Flagged packages are escalated to a human review team.

“Chat, I have reading comprehension problems. How do I fix it?”

link

atanasi 279 days ago

Reading comprehension problems can often be caught with some static analysis combined with AI.

link

Mawr 281 days ago

"LLM bad"

Very insightful.

link