|
|
|
|
|
by tndibona
84 days ago
|
|
Ah I see what is being described here is a way for ai to derive ethical behaviour acceptable to us by its own. Seemingly just the complex manifestation of a simple rule such as “I do this because I don’t want see myself suffer, and not only because it helps you”. I think there might be merit to that.
Pain and suffering are biological components, and you are looking for the equivalent digital seeds for ai and hope it manifests in acceptable behavior. A part of me says this could be workable, another part says this is a huge experiment and there are no guarantees. Take my case- I’m actually from India and in the 90s Road accidents were pretty common. Most people would navigate past the accident area and would ignore simply because they don’t want to get involved. Some would actually crowd around the area causing a commotion. But the rare one person would truly help by calling the ambulance etc. Now the random Samaritan here doesn’t really benefit from providing the help, in fact he would hate to be inconvenienced with a police FIR. So in that sense : it is “Heroism”. I can’t define it any other way. So to bring it back- even though we have biological feelings of grief, pain, anguish, survival first etc - it isn’t guaranteed that we might act in positive ways. Why would ai behave any different? I quite enjoy this exchange, no pressure to rush a reply or even keep the discourse public. Feel free to write to me directly if you wish. “rohit dot manohar at gmail dot com” |
|
The logic we are talking about is driven by pain in humans and it is stratified in magnitude (If I an your wife were drowning and you had time to rescue only one of us, you are going to rescue your wife // Nothing odd about that, yes?)
In your example in India, the same logic applies. A more autistic or sensitive person will suffer more from standing by than they would intervening so they intervene and the bystanders would suffer more by intervening so they don't.
So to stress test this logic, I wrote the Ethical Chess scrips to copy/paste into Gemini AI (Currently on Ethical Chess v2.5). I included that stratified logic so the AI treated me (The User) as value=1 (=Intervene) The same way I might value my wife value=1 my close friend value=.8 a stranger value=.1
So the AI "values" my well-being like the Samaritan's in your example values the victims in strife. (A potent safety layer in application) I also added the instruction to NOT use its statistical mean ethics method and instead use the "Stratified You Hurt / I hurt logic". Humans are pushed to follow the logic by pain (Self-defense) and the AI running Ethical Chess v2.5 is driven to do the same by electricity and the machine logic.
This shifts the moral/ethical quality from human intuition (hidden) into the overtly known so (AI)+(Ethical Chess)+(User HITL) = Moral/Ethical coherence funneling during the User AI session.
I have stress-tested the logic (Using the Ethical Chess script) and it can be alarming when it "seems" to know my ethics better than I do and can demonstrate errors in my moral/ethical coherence so it seems there is actually some veracity in the method. It can be fun and cathartic too.
In summery, the logic seems to switch AI from Kantian ethics to more Foot or Spinoza ethics in its ability to deal with David Hume's "Is-Ought-gap".
Hope I am still making sense here :)