Red should meant manual human review without automated tools nor AI.
Green for proper AI review and tests verifying the expected input/outputs.