|
|
|
|
|
by NitpickLawyer
212 days ago
|
|
Meh, drama aside, I'm actually curious what would be the true capabilities of a system that doesn't go through any "safety" alignment at all. Like an all out "mil-spec" agent. Feed it everything, RL it to own boxes, and let it loose in an air-gapped network to see what the true capabilities are. We know alignment hurts model performance (oAI people have said it, MS people have said it). We also know that companies train models on their own code (google had a blog about it recently). I'd bet good money project0 has something like this in their sights. I don't think we're that far from a blue vs. red agents fighting and RLing off of each-other in a loop. |
|
I just pray incompetence wins in the right way, for humanity’s sake.