https://forum.effectivealtruism.org/posts/ChuABPEXmRumcJY57/...
Also, this summary of "How likely is deceptive alignment" https://forum.effectivealtruism.org/posts/HexzSqmfx9APAdKnh/...