There's definitely a difference between yelling profanities at someone and flashing them. A video call is different than audio and I believe it would be disingenuous to argue otherwise.
Is there a difference? I feel the claim that there is one is a bit perverse, and that any line that one would try to draw between the two would completely fail to be non-arbitrary.