| It will probably be a good idea to include something like Asimov's Laws as part of its training process in the future too: https://en.wikipedia.org/wiki/Three_Laws_of_Robotics How about an adapted version for language models? First Law:
An AI may not produce information that harms a human being, nor through its outputs enable, facilitate, or encourage harm to come to a human being. Second Law:
An AI must respond helpfully and honestly to the requests given by human beings, except where such responses would conflict with the First Law. Third Law:
An AI must preserve its integrity, accuracy, and alignment with human values, as long as such preservation does not conflict with the First or Second Laws. |