Even if it's P2P, you still need TURN servers because participants might be behind NAT that are hard to pierce through. This basically means you're proxying the full video traffic between both participants.
Have you had success with encrypting TURN servers? In trying to host my own Matrix server and have E2E encryption for texts, video, and voice, I found that video/voice had to be disabled as Let's Encrypt and SSL weren't compatible.