|
|
|
|
|
by godelski
945 days ago
|
|
> Whats the endgame here? I don't mean to be rude, but at least to me the sentiment of this comment comes off as asking what the end game is for any hacker demonstrating vulnerabilities in ordinary software. There's always a cat and mouse game. I think we should all understand that given the name of this site... The point is to perform such checks on LLMs as we would with any software. There definitely is the ability to debug ML models, it's just harder and different than standard code. There's a large research domain dedicated to this pursuit (safety, alignment, mech interp, etc). Maybe I'm misinterpreting your meaning? I must be, right? Because why would we not want to understand how vulnerable our tools are? Isn't that like the first rule of tools? Understanding what they're good at and what they're bad at. So I assume I've misinterpreted. |
|